Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Podcasts
  • Courses
    • Your Courses
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
  • Barringer Process Reliability Introduction Course Landing Page
  • Upcoming Live Events
You are here: Home / Articles / Estimating the Normal Distribution Parameters and Tolerance Limits

by Dennis Craggs 3 Comments

Estimating the Normal Distribution Parameters and Tolerance Limits

Estimating the Normal Distribution Parameters and Tolerance Limits

Frequently, data collection is the most time consuming and expensive part of a project. Consequently, people work with small sample data. There is too little data to plot a histogram, so the analysis assumes the underlying population is normally distributed.

A frequent error is to assume the sample average and standard deviation are the population normal mean and standard deviation. When small sample sizes are being analyzed, these assumptions lead to estimation errors.

Methods to make better estimates are discussed in this article.

The Data

If we have a sample of 5 measurements with an average of 100.08 and standard deviation of 2.75. Then how do we estimate confidence limits on the population mean $-\mu-$, confidence limits on the population standard deviation $-\sigma-$, and the statistical tolerance limits?

Confidence Limits on the Population Mean

The sample average is not the mean. Successive groups of measurements will have different sample averages and thus exhibit variation about the mean. In statistics, it can be proved that the averages drawn from a normally distributed random variable are themselves normally distributed. Therefore, rather than using a sample average, it is better to calculate confidence limits on the mean. Some use the normal distribution to calculate these limits. The problem is that the population standard deviation, $-\sigma-$, is unknown. It can be shown that difference between the population mean and the sample mean divided by the mean standard deviation follows the t-distribution, i.e.,

$$\frac{\bar{x}-\mu}{s/\sqrt{n}}{\sim}t(\nu)$$

(1)

where $-\nu=n-1-$, the degrees of freedom. Also, the $-\sim-$ symbol means “is distributed as”.

The t-distributions are a family of distributions differentiated by degrees of freedom. With increasing degrees of freedom, the t-distribution family approaches the normal distribution as shown in figure 1,

Figure 1

The curve shows t-distributions for degrees of freedom of 2, 3, 4, 5, 10, and 20 vs. the normal distribution. The t-distributions are symmetric so the probability density at t is equal to the probability density at –t. Note that below 20 degrees of freedom, the divergence from the normal is visually significant. In general, when compared to the normal distribution, the t-distributions are flatter. The center probability densities are lower and the tail probability densities are higher.

Let’s assume a confidence C=90% for our estimate of $-\mu-$. The probability of error is $-\alpha=1-C=0.1-$ and would be split equally between the upper and lower tails of the distribution. The general equation to calculate two-sided limits on the mean is equation 2,

$$\bar x + t_{\alpha/2,n-1}s/\sqrt{n}\lt\mu\lt\bar x + t_{1-\alpha/2,n-1}s/\sqrt{n}$$

(2)

The notation deserves some explanation. The t-subscripts indicate a cumulative probability and the degrees of freedom in the problem. This equation assumes the usage of a table of t-distribution cumulative percentiles, i.e., the lower tail, for different t-values. Alternatively, the t-value could come from the Excel function T.INV, which provides the t-value for a cumulative probability P with specified degrees of freedom. When P is less than 0.5, the t-values are negative.

With 5 samples and a 90% confidence is desired, $-t_{0.05,4}=-2.132-$. Using $-\bar x=100.08-$ and s= 2.75, then the 90% confidence interval on the normal mean is (97.46,102.70).

Confidence Limits on the Population Standard Deviation

A similar problem occurs estimating the population standard deviation, $-\sigma-$, using the sample size and the sample standard deviation, s. It can be shown that the degrees of freedom, $-\nu-$, times the square of the sample standard deviation ($-S^2-$), and divided by the population standard deviation follows a Chi-Square distribution with $-\nu=n-1-$ degrees of freedom,  formula 3:

$$(n-1)s^2/\sigma^2\sim\chi^2(\nu)$$

(3)

 A plot for 1, 2, 3, 4, and 5 degrees of freedom is shown in figure 2.

Figure 2

Note that the Chi-Square family pdf is skewed to the right and therefore not symmetric. Two-sided limits on s may be calculated using formula 4,

$$s\sqrt{(n-1)/\chi_{1-\alpha/2,n-1}^2}\lt\sigma\lt{s\sqrt{(n-1)/\chi_{\alpha/2,n-1}^2}}$$

(4)

Using a left-sided cumulative $-\chi^2-$ probability table and a sample size of 5, then $-\chi^2_{0.05,4}=0.711-$ and $-\chi^2_{0.95,4}=11.14-$. Substituting the sample standard deviation, the limits on the population standard deviation are (1.65,6.52). Note that while the sample standard deviation was 2.75, the population standard deviation could be as large as 6.52, a very large difference.

Tolerance Limits on the Population

Tolerance limits cannot be directly calculated using the normal distribution table. The normal table assumes that we know $-\mu-$ and $-\sigma-$. Calculating tolerance limits on a normal population from test statistics should use a non-central t-distribution. The theory is rather involved, but factors for one-sided and two-sided tolerance limits containing P% of the population with C% confidence are documented in Tables A6 and A7 of Experimental Statistics Handbook 91, United States Department of Commerce, National Bureau of Standards. One may select 75%, 90%, 95% and 99% confidence values for 75%, 90%, 95%, 99%, and 99.9% populations. Test sample sizes range from 2 to 1000. Other sources are available.

For two-sided tolerance limits, define LL as the lower limit and UL as the upper limit. Then use equations 5 and 6 and the two-sided table A6.

$$LL=\bar{x}-ks$$

(5)

$$UL=\bar{x}+ks$$

(6)

The factor k is sometimes called the k-factor.

If calculating a one-sided lower tolerance limit, use equation 5 and table A7 for the k-factor. Alternatively, if calculating a one-sided upper tolerance limit, use equation 6 and table A7 for the k-factor.

For the sample problem, let’s calculate the two-sided 99.9% tolerance limit at 90% confidence. From table A6, k=6.875, therefore, the tolerance limits are (81.17, 118.99). If the sample average ±3 sigma were used, then the limits of (91.83,108.33) would underestimate the tolerance limits. The k-factor approach is more accurate as it considers the variability in both the sample mean and standard deviation.

Conclusions

  • The t-distribution is symmetric and provides statistical estimates of the population mean.
  • The Chi-square distribution is right skewed and provides statistical estimates of the population standard deviation.
  • K-factors based on the non-central t-distribution compensate for sample variation and provide statistically valid estimates of the population spread.

If you have any questions on this material, please ask in the feedback section. I promise to get back to you.

Dennis Craggs

810-964-1529

dlcraggs@me.com

Filed Under: Articles, Big Data & Analytics, on Tools & Techniques

About Dennis Craggs

I am a mechanical engineer, programmer, and statistician. My career spanned the aerospace, NASA and Teledyne CAE, and automotive, Ford and Chrysler, industries. After retirement, I started consulting as a reliability engineer and am writing articles on big data analytics. My primary goal is to assist young engineers and consult on product development issues.

« Asset Healthcare Management
4 Considerations When Designing A Risk Management Program »

Comments

  1. Arin Holecek says

    February 18, 2020 at 2:07 PM

    Hello, I visited the online statistics handbook referenced, but I do not see the tables that are mentioned. Instead there are equations for tolerance factor k. I am writing an article that will reference your article as well as the online handbook, but if there are tables still available I’d like to provide a specific link to those. Do you mind pointing me in the right direction?
    Thank you so much,
    Arin Holecek

    Reply
    • Dennis Craggs says

      February 18, 2020 at 4:53 PM

      I was using my personal copy of the handbook. However, anybody can purchase a copy on line from Amazon or other sources. To see a scanned image of the publication, follow this link: Experimental Statistics Manual 91

      You browser window should show a table A-6, pages 446 and 447. Then, use the right arrow to shift the display to display the remaining pages of table A-6, pages 448 and 449. Right shift again to display table A-7, pages 450 and 451. Depending on your display, the table is very readable. To maximize the size use the button at the lower right.

      Reply
      • Arin Holecek says

        February 19, 2020 at 11:09 AM

        Thank you so much! I love your website! Many statistical subjects are elucidated beautifully.

        Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Big Data & Analytics series Article by Dennis Craggs

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Articles

  • Gremlins today
  • The Power of Vision in Leadership and Organizational Success
  • 3 Types of MTBF Stories
  • ALT: An in Depth Description
  • Project Email Economics

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy