When Should I Use Confidence Intervals, Prediction Intervals, and ... [PDF]

Apr 18, 2013 - In this post, we'll take a look at the different types of intervals that are available in Minitab, their characteristics, and when you should use them. I'll cover ... A confidence interval is a range of values, derived from sample statistics, that is likely to contain the value of an unknown population parameter. Because ...

3 downloads 21 Views 140KB Size

Recommend Stories


Confidence Intervals
Goodbyes are only for those who love with their eyes. Because for those who love with heart and soul

Confidence Intervals
You can never cross the ocean unless you have the courage to lose sight of the shore. Andrè Gide

Confidence Intervals: Sampling Distribution [PDF]
Sep 13, 2012 - IMPORTANT POINTS. • Sample statistics vary from sample to sample. (they will not match the parameter exactly). • KEY QUESTION: For a given sample statistic, what are plausible values for the population parameter? How much uncertain

Bootstrap confidence intervals
I tried to make sense of the Four Books, until love arrived, and it all became a single syllable. Yunus

Estimating and Finding Confidence Intervals
If you are irritated by every rub, how will your mirror be polished? Rumi

Hypothesis Testing and Confidence Intervals
Where there is ruin, there is hope for a treasure. Rumi

Chebyshev's, CLT, and Confidence Intervals
If you are irritated by every rub, how will your mirror be polished? Rumi

Confidence Intervals for Proportions
Be like the sun for grace and mercy. Be like the night to cover others' faults. Be like running water

7.10 a comparison of confidence intervals and tolerance intervals
Be grateful for whoever comes, because each has been sent as a guide from beyond. Rumi

Scientific Evidence and Confidence Intervals
No matter how you feel: Get Up, Dress Up, Show Up, and Never Give Up! Anonymous

Idea Transcript


The Minitab Blog

(http://blog.minitab.com)

Data Analysis (http://blog.minitab.com/blog/data-analysis-2) Project Tools (http://blog.minitab.com/blog/project-tools-2)



Quality Improvement (http://blog.minitab.com/blog/quality-improvement-2) Industries keyboard_arrow_down

Minitab.com (http://www.minitab.com/)

When Should I Use Confidence Intervals, Prediction Intervals, and Tolerance Intervals Jim Frost (http://blog.minitab.com/blog/adventures-in-statistics-2) . 18 April, 2013 11

17

()

21

() (http://blog.minitab.com/blog/adventures-in-statistics-2/when-should-i-use-confidence-intervals-prediction-intervals-and-tolerance-intervals)

7 Deadly Statistical Sins Even the Experts Make

You Might Also Like: What's the

Do you know how to avoid them?

Get the facts > (http://ow.ly/JkKn30aUDAp)

In statistics, we use a variety of intervals to characterize the results. The most well-known of these are confidence intervals. However, confidence intervals are not always appropriate. In this post, we’ll take a look at the different types of intervals that are available in Minitab, their characteristics, and when you should use them. I’ll cover confidence intervals, prediction intervals, and tolerance intervals. Because tolerance intervals are the least-known, I’ll devote extra time to explaining how they work and when you’d want to use them.

What are Confidence Intervals? A confidence interval is a range of values, derived from sample statistics, that is likely to contain the value of an unknown population parameter. Because of their random nature, it is unlikely that two samples from a given population will yield identical confidence intervals. But if you repeated your sample many times, a certain percentage of the resulting confidence intervals would contain the unknown population parameter. The percentage of these confidence intervals that contain the parameter is the confidence level of the interval.

Difference between Confidence, Prediction, Most frequently, you’ll use confidence intervals to bound the mean or standard and Tolerance Intervals? deviation, but you can also obtain them for regression coefficients, proportions, rates of occurrence (Poisson), and for the differences between populations. Suppose that you randomly sample light bulbs and measure the burn time. Minitab calculates that the 95% confidence interval is 1230 – 1265 hours. The confidence interval indicates that you can be 95% confident that the mean for the entire population of light bulbs falls within this range. Confidence intervals only assess sampling error in relation to the parameter of interest. (Sampling error is simply the error inherent when trying to estimate the characteristic of an entire population from a sample.) Consequently, you should be aware of these important considerations: As you increase the sample size, the sampling error decreases and the intervals become narrower. If you could increase the sample size to equal the population, there would be no sampling error. In this case, the confidence interval would have a width of zero and be equal to the true population parameter. Confidence intervals only tell you about the parameter of interest and nothing about the distribution of individual values. In the light bulb example, we know that the mean is likely to fall within the range, but the 95% confidence interval does not predict that 95% of future observations will fall within the range. We’ll need to use a different type of interval to draw a conclusion like that. For more information about confidence intervals, please read my blog post: Understanding Hypothesis Tests: Confidence Intervals and Confidence Levels (http://blog.minitab.com/blog/adventures-in-statistics/understanding-hypothesis-tests%3A-confidenceintervals-and-confidence-levels).

What Are Prediction Intervals? A prediction interval is a type of confidence interval that you can use with predictions (http://blog.minitab.com/blog/adventures-instatistics/how-to-predict-with-minitab-using-bmi-to-predict-the-body-fat-percentage-part-1) from linear and nonlinear models (http://blog.minitab.com/blog/adventures-in-statistics/linear-or-nonlinear-regression-that-is-the-question). There are two types of prediction intervals that use predictor values entered into the model equation.

Confidence interval of the prediction A confidence interval of the prediction is a range that is likely to contain the mean response given specified settings of the predictors in your model. Just like the regular confidence intervals, the confidence interval of the prediction presents a range for the mean rather than the distribution of individual data points. Going back to our light bulb example, suppose we design an experiment to test how different production methods (Slow or Quick) and filament materials (A or B) affect the burn time. After we fit a model, statistical software (http://www.minitab.com/products/minitab) like Minitab can predict the response for specific settings. We want to predict the mean burn time for bulbs that are produced with the Quick method and filament type A. Minitab calculates a confidence interval of the prediction of 1400 – 1450 hours. We can be 95% confident that this range includes the mean burn time for light bulbs manufactured using these settings. However, it doesn’t tell us anything about the distribution of burn times for individual bulbs.

Prediction interval A prediction interval is a range that is likely to contain the response value of a single new observation given specified settings of the predictors in your model. We’ll use the same settings as above, and Minitab calculates a prediction interval of 1350 – 1500 hours. We can be 95% confident that this range includes the burn time of the next light bulb produced with these settings. The prediction interval is always wider than the corresponding confidence interval of the prediction because of the added uncertainty involved in predicting a single response versus the mean response. We’re getting down to determining where an individual observation is likely to fall, but you need a model for it to work.

What Are Tolerance Intervals? A tolerance interval is a range that is likely to contain a specified proportion of the population. To generate tolerance intervals, you must specify both the proportion of the population and a confidence level. The confidence level is the likelihood that the interval actually covers the proportion. Let’s look at an example, because that’s the easiest way to understand tolerance intervals.

Example of a tolerance interval The light bulb manufacturer is interested in how long their light bulbs burn. The analysts randomly sample 100 bulbs and record the burn time in this worksheet (//cdn2.content.compendiumblog.com/uploads/user/458939f4-fe08-4dbc-b271-efca0f5a2682/479b4fbdf8c0-4011-9409-f4109cc4c745/File/c4ab0558e6b5c4e7f6b759528067d9d0/lightbulb.MTW). In Minitab, go to Stat > Quality Tools > Tolerance Intervals. Under Data, choose Samples in columns. In the textbox, enter Hours. Click OK. (If you're not already using it, please download the free 30-day trial of Minitab (http://www.minitab.com/products/minitab/free-trial/) and play along!)

The normality test indicates that our data are normally distributed. Consequently, we can use the Normal interval (1060 1435). The manufacturer is 95% confident that at least 95% of all burn times will fall between 1060 to 1435 hours. If this range is wider than their clients' requirements, the process may produce excessive defects.

How tolerance intervals work compared to confidence intervals A confidence interval's width is due entirely to sampling error. As the sample size approaches the entire population, the width of the confidence interval approaches zero. In contrast, the width of a tolerance interval is due to both sampling error and variance in the population. As the sample size approaches the entire population, the sampling error diminishes and the estimated percentiles approach the true population percentiles. To determine where 95% of the population falls, Minitab calculates the data values that correspond to the estimated 2.5th and 97.5th percentiles (97.5 - 2.5 = 95). Read here (http://blog.minitab.com/blog/adventures-in-statistics/the-graphical-benefits-ofidentifying-the-distribution-of-your-data) for more information about percentiles and population proportions. Unfortunately, the percentile estimates will have error because we are working with a sample. We can’t be 100% confident that a tolerance interval truly contains the specified proportion. Consequently, tolerance intervals have a confidence level.

Uses for tolerance intervals In general, use tolerance intervals if you have sampled data and want to predict a range of likely outcomes. In the quality improvement field, Six Sigma analysts generally require that the output from a process have measurements (e.g., burn time, length, etc.) that fall within the specification limits. In this context, tolerance intervals can detect excessive variation by comparing client requirements to tolerance limits that cover a specified proportion of the population. If the tolerance interval is wider than the client's requirements, there may be too much product variation. With Minitab statistical software (http://it.minitab.com/en-us/products/minitab/free-trial.aspx), it’s easy to obtain all of these intervals for your data! You just need to be aware of what information each interval provides. (http://blog.minitab.com/blog/understanding-statistics/whats-the-difference-between-confidence-prediction-and-tolerance-intervals) Understanding Hypothesis Tests: Confidence Intervals and Confidence Levels (http://blog.minitab.com/blog/adventures-in-statistics-2/understanding-hypothesis-tests%3Aconfidence-intervals-and-confidence-levels) Tip 3: Gain Confidence with Confidence Intervals (http://blog.minitab.com/blog/statistics-tips-from-a-technical-trainer/tip-3-gain-confidence-with-confidence-intervals-v2) Applied Regression Analysis: How to Present and Use the Results to Avoid Costly Mistakes, part 2 (http://blog.minitab.com/blog/adventures-in-statistics-2/applied-regressionanalysis-how-to-present-and-use-the-results-to-avoid-costly-mistakes-part-2)

Comments Name: Jessica • Wednesday, March 19, 2014 Does this mean that confidence intervals give a range of MEANS while prediction intervals give a range of Y values?

Name: Jim Frost • Friday, March 21, 2014 Hi Jessica, Yes, that's it. :) Confidence intervals give a range for the mean of a population. Prediction intervals give a range for the y-value of the next observation given specific x-values. Jim



Who We Are

Authors

Minitab is the leading provider of software and services for quality improvement and statistics education. More than 90% of Fortune 100 companies use Minitab Statistical Software, our flagship product, and more students worldwide have used Minitab to learn statistics than any other package.

Eston Martz (http://blog.minitab.com/blog/understandingstatistics) Michelle Paret (http://blog.minitab.com/blog/michelleparet) Bonnie K. Stone (http://blog.minitab.com/blog/qualitybusiness) Marilyn Wheatley (http://blog.minitab.com/blog/marilynwheatleys-blog) Bruno Scibilia (http://blog.minitab.com/blog/applying-

Minitab Inc. is a privately owned company headquartered in State College, Pennsylvania, with subsidiaries in the United Kingdom, France, and Australia. Our global network of representatives serves more than 40 countries around the world.

Visit Us at Minitab.com Blog Map (http://blog.minitab.com/sitemap.html) | Legal (http://www.minitab.com/legal/) | Privacy Policy (http://www.minitab.com/legal/#privacypolicy) | Trademarks (http://www.minitab.com/legal/trademarks/) Copyright ©2017 Minitab Inc. All rights Reserved.

statistics-in-quality-projects)

Smile Life

When life gives you a hundred reasons to cry, show life that you have a thousand reasons to smile

Get in touch

© Copyright 2015 - 2024 PDFFOX.COM - All rights reserved.