STATISTICAL INTERVALS 2.pptx staticssmsokso

CONFIDENCE INTERVALS
• An interval estimate of a parameter is an interval or a range of
values used to estimate the parameter.This estimate may or may
not contain the value of the parameter being estimated.

INTRODUCTION
Stress and the College Student
 A recent poll conducted by the mtvU/Associated Press found that
85% of college students reported that they experience stress daily.
The study said,“It is clear that being stressed is a fact of life on college
campuses today.”
 The study also reports that 74% of students’ stress comes from
school work, 71% from grades, and 62% from financial woes.The
report stated that 2240 undergraduate students were selected and
that the poll has a margin of error of 3.0%.

CONFIDENCE INTERVALS
• The confidence level of an interval estimate of a
parameter is the probability that the interval estimate will
contain the parameter, assuming that a large number of
samples are selected and that the estimation process on the
same parameter is repeated.
• A confidence interval is a specific interval estimate of a
parameter determined by using data obtained from a
sample and by using the specific confidence level of the
estimate.

CONFIDENCE
INTERVALS:
SINGLE SAMPLE

MARGIN OF ERROR
• The margin of error,
also called the
maximum error of the
estimate, is the
maximum likely
difference between the
point estimate of a
parameter and the
actual value of the
parameter.

ASSUMPTIONS FOR FINDING A CONFIDENCE INTERVAL
FOR A MEAN WHEN σ IS KNOWN
1. The sample is a random sample.
2. Either n ≥ 30 or the population is normally distributed when n <
30.

ROUNDING RULE FOR A CONFIDENCE INTERVAL FOR
A MEAN
• When you are computing a confidence interval for a population mean by using
raw data, round off to one more decimal place than the number of decimal
places in the original data.
• When you are computing a confidence interval for a population mean by using
a sample mean and a standard deviation, round off to the same number of
decimal places as given for the mean.

SAMPLE PROBLEM: DAYS IT TAKES TO
SELL A CAMARO
• A researcher wishes to estimate the number of days it takes an automobile
dealer to sell a Chevrolet Camaro. A random sample of 50 cars had a mean
time on the dealer’s lot of 54 days.Assume the population standard deviation
to be 6.0 days. Find the best point estimate of the population mean and the
95% confidence interval of the population mean.
• Source: Based on information obtained from Power Information Network.
Ans: 52.3 <

SAMPLE PROBLEM: NUMBER OF
CUSTOMERS
• A large department store found that it averages 362 customers per
hour.Assume that the standard deviation is 29.6 and a random
sample of 40 hours was used to determine the average. Find the
99% confidence interval of the population mean.
Ans: 350 < < 374

95% CONFIDENCE INTERVALS FOR EACH SAMPLE MEAN

FINDING α/2 FOR A 98% CONFIDENCE
INTERVAL

SAMPLE PROBLEM
• The following data represent a random sample of the assets (in
millions of dollars) of 30 credit unions in southwestern Pennsylvania.
Assume the population standard deviation is 14.405. Find the 90%
confidence interval of the mean.
12.23 16.56 4.39 2.89 1.24 2.17
13.19 9.16 1.42 73.25 1.91 14.64
11.59 6.69 1.06 8.74 3.17 18.13
7.92 4.78 16.85 40.22 2.42 21.58
5.01 1.47 12.24 2.27 12.77 2.76
Ans: 6.752 < < 15.43

CONFIDENCE
INTERVALS FOR
THE MEAN WHEN σ
IS UNKNOWN

CHARACTERISTICS OF THE t - DISTRIBUTION
• The t distribution shares some characteristics of the
standard normal distribution and differs from it in others.
The t distribution is similar to the standard normal
distribution in these ways:
1. It is bell-shaped.
2. It is symmetric about the mean.
3. The mean, median, and mode are equal to 0 and are
located at the center of the distribution.
4. The curve approaches but never touches the x axis.

CHARACTERISTICS OF THE t - DISTRIBUTION
• The t distribution differs from the standard normal
distribution in the following ways:
1. The variance is greater than 1.
2. The t distribution is actually a family of curves based on
the concept of degrees of freedom, which is related to
sample size.
3. As the sample size increases, the t distribution approaches
the standard normal distribution.

DEGREES OF FREEDOM
• The degrees of freedom are the number of values that
are free to vary after a sample statistic has been computed,
and they tell the researcher which specific curve to use
when a distribution consists of a family of curves.

ASSUMPTIONS FOR FINDING A CONFIDENCE
INTERVAL FOR A MEAN WHEN σ IS UNKNOWN
1. The sample is a random sample.
2. Either n ≥ 30 or the population is normally distributed when n <
30.

SAMPLE PROBLEM: INFANT GROWTH
• A random sample of 10 children found that their average growth
for the first year was 9.8 inches.Assume the variable is normally
distributed and the sample standard deviation is 0.96 inch. Find the
95% confidence interval of the population mean for growth during
the first year.

SAMPLE PROBLEM: HOME FIRES
STARTED BY CANDLES
• The data represent a random sample of the number of home fires
started by candles for the past several years. (Data are from the
National Fire Protection Association.) Find the 99% confidence
interval for the mean number of home fires started by candles each
year.
5460 5900 6090 6310 7160 8440 9930

CONFIDENCE INTERVALS AND SAMPLE
SIZE FOR PROPORTIONS

STATISTICAL INTERVALS 2.pptx staticssmsokso

SAMPLE PROBLEM: COVERING COLLEGE COSTS
• A survey conducted by Sallie Mae and Gallup of 1404 respondents
found that 323 students paid for their education by student loans.
Find the 90% confidence interval of the true proportion of students
who paid for their education by student loans.

SAMPLE PROBLEM: LAWN WEEDS
•A survey of 1898 adults with lawns conducted by
Harris Interactive Poll found that 45% of the adults
said that dandelions were the toughest weeds to
control in their yards. Find the 95% confidence
interval of the true proportion who said that
dandelions were the toughest weeds to control in
their yards.

CONFIDENCE
INTERVALS FOR
VARIANCES AND
STANDARD
DEVIATIONS

SAMPLE PROBLEM: NICOTINE CONTENT
• Find the 95% confidence interval for the variance and
standard deviation of the nicotine content of cigarettes
manufactured if a random sample of 20 cigarettes has a
standard deviation of 1.6 milligrams.Assume the variable is
normally distributed.

SAMPLE PROBLEM: NAMED STORMS
• Find the 90% confidence interval for the variance and standard
deviation for the number of named storms per year in the Atlantic
basin.A random sample of 10 years has been used.Assume the
distribution is approximately normal.
10 5 12 11 13
15 19 18 14 16

PREDICTION INTERVALS
• Used to predict the possible value of a future observation
• Example: In quality control, an engineer may need to use the
observed data to predict a new observation.

Prediction Interval for Future Observation
The prediction interval for Xn+1 will always be longer than the confidence interval for .
40

EGR 252 Ch. 9 Lecture1 MDH 2015 9th edition Slide 41
PREDICTION INTERVAL
• For a normal distribution of unknown mean μ, and standard deviation σ, a
100(1-α)% prediction interval of a future observation, x0 is
if σ is known, and
if σ is unknown
n
z
X
x
n
z
X
1
1
1
1 2
/
0
2
/ 




 
 

n
s
t
X
x
n
s
t
X n
n
1
1
1
1 1
,
2
/
0
1
,
2
/ 




 
 


Consider the tensile adhesion tests on 22 specimens of U-700 alloy.
The load failure for the samples was observed and it was found that
the mean is 13.71 and the standard deviation is 3.55.We plan to test
a twenty third specimen. Find the load failure for this specimen at
95% prediction interval.
43

EXAMPLE 12
• Consider the following sample of fat content (in percentage) of n = 10 randomly selected hot
dogs (“Sensory and Mechanical Assessment of the Quality of Frankfurters,” J. ofTexture Studies,
1990: 395–409):
• Find the fat content of the 17th
sample at 90% prediction level.

TOLERANCE LIMITS (INTERVALS)
• What if you want to be 95% sure that the interval contains 95% of the values? Or 90% sure
that the interval contains 99% of the values?
• These questions are answered by a tolerance interval.To compute, or understand, a
tolerance interval you have to specify two different percentages. One expresses how sure
you want to be, and the other expresses what fraction of the values the interval will
contain.

Definition
8-7 TOLERANCE AND PREDICTION INTERVALS
8-7.2 Tolerance Interval for a Normal Distribution
47

9.7: TOLERANCE LIMITS
• For a normal distribution of unknown mean μ, and unknown standard
deviation σ, tolerance limits are given by
x + ks
where k is determined so that one can assert with 100(1-γ)%
confidence that the given limits contain at least the proportion 1-
α of the measurements.
• Table A.7 (page 745) gives values of k for (1-α) = 0.9,
0.95, or 0.99 and γ = 0.05 or 0.01 for selected
values of n.

TOLERANCE LIMITS
• How to determine 100(1-γ)% and 1-α.
For a sample size of 8, find the tolerance interval that gives two-sided 95%
bounds on 90% of the distribution or population. X is 15.6 and s is 1.4
From table on pg. 745, find the corresponding value:
n = 8, g = .05, a = 0.1 corresponding k…k = 3.136
x + ks = 15.6 + (3.136)(1.4)
Tolerance interval 19.99 – 11.21
We are 95% confident that 90% of the population falls within the limits
of 11.21 and 19.99
1-g (boundary or the limits) 1-a (proportion of the distribution)

CASE STUDY 9.1C (PAGE 281)
• Find the 99% tolerance limits that will contain 95% of
the metal pieces produced by the machine, given a
sample mean diameter of 1.0056 cm and a sample
standard deviation of 0.0246.
• Table A.7 (page 745)
– (1 - α ) = 0.95
– (1 – Ƴ ) = 0.99
– n = 9
– k = 4.550
– x ± ks = 1.0056 ± (4.550) (0.0246)
• We can assert with 99% confidence that the
tolerance interval from 0.894 to 1.117 cm will contain
95% of the metal pieces produced by the machine.

Example 8-10
8-7 TOLERANCE AND PREDICTION INTERVALS
51

TOLERANCE INTERVALS
• Consider a population of automobiles of a certain type, and suppose that under specified
conditions, fuel efficiency (mpg) has a normal distribution with  = 30 and  = 2.
Then since the interval from –1.645 to 1.645 captures 90% of the area under the z curve, 90%
of all these automobiles will have fuel efficiency values between  – 1.645 = 26.71 and  +
1.645 = 33.29.
But what if the values of  and  are not known? We can take a sample of size n, determine
the fuel efficiencies, and s, and form the interval whose lower limit is – 1.645s and whose
upper limit is + 1.645s.

TOLERANCE INTERVALS
• However, because of sampling variability in the estimates of  and , there is a good chance
that the resulting interval will include less than 90% of the population values.
Intuitively, to have an a priori 95% chance of the resulting interval including at least 90% of the
population values, when and s are used in place of  and  we should also replace 1.645 by
some larger number.
For example, when n = 20, the value 2.310 is such that we can be 95% confident that the
interval  2.310s will include at least 90% of the fuel efficiency values in the population.

TOLERANCE INTERVALS
• Let k be a number between 0 and 100.A tolerance interval for capturing at least k% of the
values in a normal population distribution with a confidence level 95% has the form
•  (tolerance critical value)  s
•
Tolerance critical values for k = 90, 95, and 99 in combination with various sample sizes are
given in Appendix Table A.6.This table also includes critical values for a confidence level of 99%
(these values are larger than the corresponding 95% values).

TOLERANCE INTERVALS
• Replacing  by + gives an upper tolerance bound, and using – in place of  results in a lower
tolerance bound. Critical values for obtaining these one-sided bounds also appear in Appendix
Table A.6.

EXAMPLE 14
• As part of a larger project to study the behavior of stressed-skin panels, a structural
component being used extensively in North America, the article “Time-Dependent Bending
Properties of Lumber” (J. ofTesting and Eval., 1996: 187–193) reported on various mechanical
properties of Scotch pine lumber specimens.
• Consider the following observations on modulus of elasticity (MPa) obtained 1 minute after
loading in a certain configuration:

EXAMPLE 14
• There is a pronounced linear pattern in a normal probability plot of the data. Relevant summary
quantities are n = 16,
= 14,532.5, s = 2055.67. For a confidence level of 95%, a two-sided tolerance interval for capturing
at least 95% of the modulus of elasticity values for specimens of lumber in the population sampled
uses the tolerance critical value of 2.903.
• The resulting interval is
• 14,532.5  (2.903)(2055.67) = 14,532.5  5967.6
• = (8,564.9, 20,500.1)
cont’d

EXAMPLE 14
• We can be highly confident that at least 95% of all lumber specimens have modulus of
elasticity values between 8,564.9 and 20,500.1.
• The 95% CI for  is (13,437.3, 15,627.7), and the 95% prediction interval for the modulus of
elasticity of a single lumber specimen is (10,017.0, 19,048.0).
•
Both the prediction interval and the tolerance interval are substantially wider than the
confidence interval.
cont’d

STATISTICAL INTERVALS 2.pptx staticssmsokso

More Related Content

Similar to STATISTICAL INTERVALS 2.pptx staticssmsokso (20)

More from JordanRonquillo3 (7)

Recently uploaded (20)

STATISTICAL INTERVALS 2.pptx staticssmsokso

Editor's Notes