SlideShare a Scribd company logo
Chapter 10 Introduction to Inference Section 10.1 Estimating with Confidence
Twenty five samples from the same population gave these 95% confidence intervals, 95% of all samples give an interval that contains the population mean u.
In 95% of all samples, x-bar lies within ± 9  of the unknown population mean µ. Also µ lies within ± 9 of x-bar of the samples.
To say that x-bar is a 95% confidence interval for the population mean µ is to say that in repeated samples, 95% of these intervals capture the µ
The central probability 0.8 under a standard normal curve lies between -1.28 and + 1.28.  That is, there is area .1 to the left and to the right of the curve.
The central probability C under a standard normal curve lies between –z* and + z* has area (1-C)/2 to its right under the curve, we call it the upper (1-C)/2 critical value.
As we learned in Chapter 9, because of sampling variability, the statistic calculated from a sample is rarely equal to the true parameter of interest.  Therefore, when we are trying to estimate a parameter, we must go beyond our statistic value to construct a reasonable range that captures the true parameter value.
Confidence Interval: The data must come from SRS of the population of interest The sampling distribution of x-bar is approximately normal
Sampling Distribution of p-hat Proportions: Let p-hat be the proportion of successes in a random sample of size n from a population whose proportion is p. The mean value of p-hat is µp-hat and standard deviation  σ -p-hat Rule 1: µ p-hat  = p Rule 2:  σ p-hat  =  Rule 3: when n is large and p is not very close to 0  or  1, the sampling distribution of p is  approximately normal.
Sampling Distribution of x-bar – Means Let x-bar be the mean of the observations in a random sample of size n having mean  µ  and standard deviation  σ .  The mean value of the  x-bar distribution  µ   and standard deviation of x-bar is  σ   Rule 1:  µ xbar  =  µ Rule 2:  σ   xbar  =  σ  /  as long as 10% of the population is in the sample. Rule 3: when the population distribution is normal, the sampling distribution of x-bar is also normal for any sample size n Rule 4: (Central Limit Theorem) CLT when n is sufficiently large, the sampling distribution of x-bar is well approximated by a normal curve even when the population is not itself normal.
Confidence Level C Single Tail Area (1-C)/2 Critical z value 90% .05 1.645 95% .025 1.960 99% .005 2.576
Estimating with Confidence – Interval Behavior Margin of Error   = (z critical value) σ /   What are some ways in which we could lower our margin of error? z get smaller – as smaller confidence level σ  gets smaller n gets larger
Confidence Interval for p Single Proportion Confidence Interval for µ Single Mean ( σ  known) Conditions Conditions If an SRS of size n is chosen from a population with unknown proportion p, then a level C Confidence Interval for p is: If an SRS of size n is chosen from a population with unknown proportion, µ then a level C Confidence Interval for µ is: p-hat ± z critical value ( ((p(1-p)/ n) X-bar ± z critical value ( σ  /
We will follow the 4 step process outlined below throughout the chapter.
Inference Toolbox:  Confidence Intervals To Construct a Confidence Interval for a parameter (1)Identify the population of interest and the parameter you want to draw conclusions about (2)Choose the appropriate inference procedure.  Verify the conditions for the selected parameter (3)If the conditions are met, carry out the inference procedures.  CI = estimate ± margin of error (4)Interpret the results in the context of the problem
Example Problem 1 A manufacturer of high resolution video terminals must control the tension of the viewing screen to avoid tears and wrinkles.  The tension is measured in millivolts (mV).  Careful study has shown that when the process is operating properly, the standard deviation of the tension reading is 43mV. (1) Given a sample of 20 screens, with xbar = 306.3 mV, from a single day’s production, construct a 90% confidence interval for the mean tension µ of all the screens produced on this day.  Follow the steps from the previous slide.
1)Identify the population of interest and the parameter you want to draw conclusions about  all video terminal produced on the same day Choose the appropriate inference procedure.   σ  is known use CI for µ of SRS of 20 Verify the conditions for the selected parameter  x-bar is approximately normal If the conditions are met, carry out the inference procedures.  CI = estimate ± margin of error CI = x-bar ± critical z value  ·  σ / CI = 306.3 ± 1.645  ·  43 /  = 306.3 ± 15.8 = (290.5, 322.1) Interpret the results in the context of the problem We are 90% confident that the true mean is between 290.5 and 322.1
2. Suppose the manufacturer wants 99% confidence rather than 90%.  Using your data from problem 1, construct a 99% CI.  How does it compare to the 90% CI? Why? 306.3 ± 2.576 (43/  ) = 306.3 ± 24.768 = (281.532, 331.068) the interval is larger. We have greater confidence that the interval captures the true value.
3. Company management wants to report the mean screen tension for the day’s production accurate within    5 mV with 95% confidence.  How large a sample of video monitors must be measured to comply with this request? 1.960(43)/ 84.28 = 16.856 = n = 284.12 approximately 285
Do problems 1,7 and 10 from The Practice of Statistics 2 nd  edition.
Section 10.2 Tests of Significance
Tests of Significance: We have learned that Confidence Intervals can be used to estimate a parameter.  Often in statistics we want to use sample data to determine whether or not a claim or hypothesis about a parameter is plausible.  A test of significance is a procedure in which we can use sample data to test such a claim.  We will focus on tests about a population mean  μ  and proportions p.
Note about Identifying  H 0  and  H A
Practice:  Identify the Null and Alternative Hypothesis. Express the corresponding null and alternative hypotheses in symbolic form. The proportion of drivers who admit to running red lights is greater than 0.5.
The proportion of drivers who admit to running red lights is greater than 0.5.  First we express the given claim as  p  > 0.5.  Next, we notice that the statement does not contain an equality so it has to be the alternative hypothesis. H 0  :  p  = 0.5 H A  :  p  > 0.5.
Practice:  Identify the Null and Alternative Hypothesis. Express the corresponding null and alternative hypotheses in symbolic form. b) The mean height of professional basketball players is at most 7 ft.
b) The mean height of professional basketball players is at most 7 ft.  Express “a mean of at most 7 ft” in symbols       7.  The expression µ     7 contains an equality so it is the null hypothesis. H 0 : µ    7 H A : µ > 0.5 .
c) The standard deviation of IQ scores of actors is equal to 15. Practice :  Identify the Null and Alternative Hypothesis. Express the corresponding null and alternative hypotheses in symbolic form.
c) The standard deviation of IQ scores of actors is equal to 15. Express the given claim as    = 15. This is the null hypothesis.  H o :    = 15 H A :      15
Example Problem 2 Spinifex pigeons, one of the few bird species that inhabit the desert of Western Australia, rely on seeds for food.  The article “Field metabolism and water requirements of Spinifex pigeons in Western Australia” reported the following Minitab analysis of the weight of seed in grams in the stomach contents of Spinifex pigeons.  Use the analysis to construct and interpret a 95% confidence interval for the average weight of seeds in this type of pigeon’s stomach and compare your results to the hypothesis that the mean seed amount is 1g for all Spinifex pigeons. TEST OF MU = 1.000 VS MU N.E. 1.000 N  MEAN  STDEV  SE MEAN  T  PVALUE WEIGHT  16  1.373  1.034  0.258  1.44  0.17
The Spinifex pigeon problem p (z = 1.373 =1.000)/.258 = p( z= 1.445) p = .1490 95% CI: 1.373    1.96 ( 1.373    1.96 (.258) .8664 <  μ < 1.880 95% CI confident the mean is between .8664 and 1.880 (.8664, 1.880)
Do problems 12, 30 and 42 from The Practice of Statistics 2 nd  edition
Section 10.3 Making Sense of Statistical Significance
Inference Toolbox: Test of Significance Identify the population of interest and the parameter you want to draw conclusions about State null and alternative hypotheses in words and symbols Choose the appropriate inference procedure Verify the conditions for using the procedure If the conditions are met, carry out the inference procedures Interpret the results in the context of the problem
1)  The sample observations are a simple random sample. 2) The conditions for a binomial experiment are  satisfied  The condition  np     5 and  n(1-p)   5 are satisfied,  so the binomial distribution of sample proportions can be approximated by a normal distribution with  µ  =  np  and    =  np(1-p)  . Assumptions for Testing  Claims About a  Population Proportion  p
To determine the significance at a set level, alpha, one needs to calculate the  P-value  for the observed statistic.  Careful consideration of the test statistic, z, can also be used to determine significance:
P -Value The  P -value  (or  p -value  or  probability value ) is the probability of getting a value of the test statistic that is  at least   as extreme  as the one representing the sample data, assuming that the null hypothesis is true.  The null hypothesis is rejected if the  P -value is very small, such as 0.05 or less.
 
Two-tailed Test H 0 :  =  H 1 :        is divided equally between  the two  tails of the critical  region Means less than or greater than
Right-tailed Test H 0 :  =  H 1 :  >  Points Right
Left-tailed Test H 0 : =  H 1 : <  Points Left
 
Critical Region The  critical region  (or  rejection region ) is the set of all values of the test statistic that cause us to reject the null hypothesis.
Practice:  Finding  P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the  P -values and state a conclusion about the null hypothesis.  A significance level of    = 0.05 is used in testing the claim that  p   > 0.25 ,  and the sample data result in a test statistic of   z   = 1.18 .
Practice:  Finding  P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the  P -values and state a conclusion about the null hypothesis.  a)  With a claim of  p   > 0.25, the test is right-tailed.  Because the test is right-tailed, the graph shows that the  P -value is the area to the right of the test statistic  z   = 1.18.  Using  Table A and find that the area to the  right  of  z   = 1.18 is 0.1190. The  P -value is 0.1190 is greater than the significance level    = 0.05, so we fail to reject the null hypothesis.
Practice:  Finding  P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the  P -values and state a conclusion about the null hypothesis.  b)   A significance level of    = 0.05 is used in testing the claim that  p      0.25 ,  and the sample data result in a test statistic of   z   = 2.34 .
Practice:  Finding  P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the  P -values and state a conclusion about the null hypothesis.  b)   With a claim of  p      0.25, the test is two-tailed.  Because the test is two-tailed, and because the test statistic of  z   = 2.34 is to the right of the center, the P-value is twice the area to the right of  z   = 2.34.  Using  Table A and find that the area to the right of  z   = 2.34 is 0.0096, so P-value = 2 x 0.0096 = 0.0192.  The P-value of 0.0192 is less than or equal to the significance level, so we reject the null hypothesis.
Assumptions for Testing  Claims About  Population Means   1)  The sample is a simple random    sample.   2)  The value of the population standard    deviation    is known. 3)  Either or both of these conditions is  satisfied: The population is normally  distributed or  n  > 30.
Example Problem 3 10.13 pg. 572 H o :  μ  = 128 H A :  μ   ≠ 128 σ  = 15, x-bar = 126.07, n = 72 z = x-bar  –   μ / ( σ /  ) 2 P (z  ≥  z |1.09|) = 2(1-.862) = .2758
Example Problem 3 #10.13 pg. 572 More than 27% of the time, an SRS of size 72 will have mean blood pressure at least a far away from 128 as the sample.  This is not good evidence that it differs.  We fail to reject the null hypothesis
Example 4:  We are given a data set of 106 body temperatures having a mean of 98.20°F.  Assume that the sample is a simple random sample and that the population standard deviation    is known to be 0.62°F.  Use a 0.05 significance level to test the common belief that the mean body temperature of healthy adults is equal to 98.6°F.  Use the  P -value method. This is a two-tailed test and the test statistic is to the left of the center, so the  P -value is twice the area to the left of  z  = –6.64.  We refer to Table A to find the area to the left of  z  = –6.64 is 0.0001, so the  P -value is 2(0.0001) = 0.0002. =   z   = x – µ x  n 98.2 – 98.6 = –6.64  106 H 0 :    = 98.6 H 1 :       98.6    = 0.05 x  = 98.2    = 0.62
Example 4: We are given a data set of 106 body temperatures having a mean of 98.20°F.  Assume that the sample is a simple random sample and that the population standard deviation    is known to be 0.62°F.  Use a 0.05 significance level to test the common belief that the mean body temperature of healthy adults is equal to 98.6°F.  Use the  P -value method. z   = –6.64 H 0 :    = 98.6 H 1 :       98.6    = 0.05 x  = 98.2    = 0.62
Example 4:  We are given a data set of 106 body temperatures having a mean of 98.20°F.  Assume that the sample is a simple random sample and that the population standard deviation    is known to be 0.62°F.  Use a 0.05 significance level to test the common belief that the mean body temperature of healthy adults is equal to 98.6°F.  Use the  P -value method. z   = –6.64 Because the  P -value of 0.0002 is less than the significance level of    = 0.05, we reject the null hypothesis. There is sufficient evidence to conclude that the mean body temperature of healthy adults differs from 98.6°F. H 0 :    = 98.6 H 1 :       98.6    = 0.05 x  = 98.2    = 0.62
Example 5:  A survey of  n  = 880 randomly  selected adult drivers showed that 56%(or  p-hat  = 0.56) of those respondents admitted to running red lights.  Find the value of the test statistic for the claim that the majority of all adult drivers admit to running red lights.
The claim is that the majority of all Americans run red lights.  That is,  p  > 0.5.  The sample data are  n  = 880, and  p = 0.56. np  = (880)(0.5) = 440    5 n(1-p)  = (880)(0.5) = 440    5
Example 5:  In an article distributed by the Associated Press included these results from a nationwide survey: Of 880 randomly selected drivers, 56% admitted that they run red lights.  The claim is that the majority of all Americans run red lights.  That is,  p  > 0.5.  The sample data are  n  = 880, and  p  = 0.56.  We will use the  P -value Method.  H 0 :  p  = 0.5 H 1 :  p  > 0.5    = 0.05 Referring to Table A, we see that for values of  z  = 3.50 and higher, we use 0.9999 for the cumulative area to the left of the test statistic.  The P-value is 1 – 0.9999 = 0.0001. p(1-p) n p –   p z   =  0.56 – 0.5 (0.5)(0.5) 880 = = 3.56
Interpretation:  In an article distributed by the Associated Press included these results from a nationwide survey: Of 880 randomly selected drivers, 56% admitted that they run red lights.  The claim is that the majority of all Americans run red lights.  That is,  p  > 0.5.  The sample data are  n  = 880, and  p  = 0.56.  We will use the  P -value Method.  H 0 :  p  = 0.5 H 1 :  p  > 0.5    = 0.05 We know from previous chapters that a  z  score of 3.56 is exceptionally large.   The corresponding  P -value of 0.0001 is less than the significance level of    = 0.05, we reject the null hypothesis. There is sufficient evidence to support the claim. p(1-p) n p –   p z   =  0.56 – 0.5 (0.5)(0.5) 880 = = 3.56
Interpretation :  In an article distributed by the Associated Press included these results from a nationwide survey: Of 880 randomly selected drivers, 56% admitted that they run red lights.  The claim is that the majority of all Americans run red lights.  That is,  p  > 0.5.  The sample data are  n  = 880, and  p  = 0.56.  We will use the  P -value Method.  H 0 :  p  = 0.5 H 1 :  p  > 0.5    = 0.05 z   =  3.56
Do problems 42 and 44
Section 10.4 Inference as Decision
Traditional method :  Reject H 0  if the test statistic falls within  the critical region. Fail to  reject   H 0  if the test statistic does  not fall within the critical region. Decision Criterion
P -value method :  Reject H 0  if  P -value       (where    is the  significance level, such as 0.05). Fail to  reject   H 0  if  P -value >   . ** this is the method you will use Decision Criterion**
Decision Criterion Confidence Intervals :  Because a confidence interval estimate of a population parameter contains the likely values of that parameter, reject a claim that the population parameter has a value that is not included in the confidence interval.
 
Wording of Final Conclusion
Accept versus  Fail to Reject Some texts use “accept the null hypothesis.” We will use the phrase “fail to reject” instead. We are not proving the null hypothesis. The sample evidence is not strong enough to warrant rejection (such as not enough evidence to convict a suspect).
Type I Error A  Type I error  is the mistake of rejecting the null hypothesis when it is true. The symbol    (alpha)  is used to represent the probability of a type I error.
Type II Error A  Type II error  is the mistake of failing to reject the null hypothesis when it is false . The symbol    (beta)  is used to represent the probability of a type II error .
 
Controlling Type I and  Type II Errors For any fixed   , an increase in the sample size  n   will cause a decrease in   For any fixed sample size  n   , a decrease in    will cause an increase in   .  Conversely, an increase in     will cause a decrease in    . To decrease both     and   , increase the sample size.
The ability to detect a false hypothesis is called the  power  of the test. The  power  of the test is the probability that it correctly rejects a false null hypothesis. Power  = 1 –  β Type II error β OK Fail to Reject H o Power  1- β Type I error  α Reject H o H o  False H o  True
When  power   is high we can be confident If we fail to reject the false null hypothesis,  the test’s  power  comes into question Was the sample big enough? Did we miss an effect? Did we fail to gather sufficient data? Was there too much variability? What  power   is needed? The choice of  power   is frequently determined by financial or scientific concerns.
Summary of the P-Value Method Identify the specific claim or hypothesis to be tested, and put it in symbolic form. Give the symbolic form that must be true when the original claim is false. Of the two symbolic expressions, let the alternative hypothesis H A  be the one not containing the equality, so that H A  uses the symbol > or < or ≠. Let the null hypothesis H 0  be the symbolic expression that the parameter equals the fixed value being considered. Select the significance level  α  based on the seriousness of a type 1 error.  Make  α  small if the consequences of rejecting a true H 0  are severe.  The values of 0.05 and 0.01 are most frequently used. Identify the statistic that is relevant to this test and determine its sampling distribution (such as normal, t, or chi square). Find the test statistic and find the P-value.  Draw a graph and show the test statistic and P-value. Reject H 0  if the P-value is less than or equal to the significance level  α  .  Fail to reject H 0  if the P-value is greater than  α  . Restate the decision in simple, non-technical terms and address the original claim.
Do problems 68 & 82 from The Practice of Statistics 2 nd  Edition
Baseball Problem Check conditions: (1) plausible independence condition (2) random condition (3) 10% condition (4) np > 10 and n(1-p)> 10 In 2002 in Major League Baseball there were 2425 regular season games.  Home teams won 1314 of 2425 or 54.2%.  Can this be explained by national sampling variability or home field advantage?  We want to know whether the home team in professional baseball is more likely to win.  The parameter of interest is the proportion of home team wins.  With no advantage, we’d expect that proportion to be .50.
p-hat = .542 z = p-hat – p/ sd phat =  p(z> 4.14) = p < .0001 There is a .0001 chance the home team is more likely to win.
CAUTION When the calculation of  p  results in a  decimal with many places, store the  number on your calculator and use all of the  decimals when evaluating the  z  test statistic. Large errors can result from rounding  p hat   too much.  

More Related Content

PDF
Practice test ch 9 inferences from two samples
PDF
Practice test ch 8 hypothesis testing ch 9 two populations
PDF
Practice Test 1
PDF
Practice Test Ch 8 Hypothesis Testing
PDF
Practice Test 4A Hypothesis Testing Solution
PDF
Solution to the practice test ch 8 hypothesis testing ch 9 two populations
PPTX
Estimating a Population Proportion
PPTX
Chapter 4 estimation of parameters
Practice test ch 9 inferences from two samples
Practice test ch 8 hypothesis testing ch 9 two populations
Practice Test 1
Practice Test Ch 8 Hypothesis Testing
Practice Test 4A Hypothesis Testing Solution
Solution to the practice test ch 8 hypothesis testing ch 9 two populations
Estimating a Population Proportion
Chapter 4 estimation of parameters

What's hot (20)

PPTX
Two Means, Two Dependent Samples, Matched Pairs
PDF
Practice test 4A Hypothesis testing
PPTX
Two Variances or Standard Deviations
PPTX
Addition Rule and Multiplication Rule
PPTX
Estimating a Population Mean
PPTX
1.1 statistical and critical thinking
PDF
Stat sample test ch 12
PPTX
Sampling Distributions and Estimators
PDF
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
PDF
Practice Test Chapter 6 (Normal Probability Distributions)
PPTX
Normal as Approximation to Binomial
PPTX
Sec 3.1 measures of center
PPTX
Testing a Claim About a Standard Deviation or Variance
PPTX
Basics of Hypothesis Testing
PPTX
Two Means, Independent Samples
PPTX
The Central Limit Theorem
PPT
Chapter15
PPTX
The Standard Normal Distribution
PPTX
Sec 1.3 collecting sample data
PPTX
Contingency Tables
Two Means, Two Dependent Samples, Matched Pairs
Practice test 4A Hypothesis testing
Two Variances or Standard Deviations
Addition Rule and Multiplication Rule
Estimating a Population Mean
1.1 statistical and critical thinking
Stat sample test ch 12
Sampling Distributions and Estimators
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Practice Test Chapter 6 (Normal Probability Distributions)
Normal as Approximation to Binomial
Sec 3.1 measures of center
Testing a Claim About a Standard Deviation or Variance
Basics of Hypothesis Testing
Two Means, Independent Samples
The Central Limit Theorem
Chapter15
The Standard Normal Distribution
Sec 1.3 collecting sample data
Contingency Tables
Ad

Similar to Chapter10 Revised (20)

PPT
Telesidang 4 bab_8_9_10stst
PPTX
STATISTIC ESTIMATION
PDF
Unit 4b- Hypothesis testing and confidence intervals (Slides - up to slide 17...
PPT
Sampling theory
PPTX
Confidence intervals, hypothesis testing and statistical tests of significanc...
PPTX
Probalities, Estimations and Hypothesis Testing.pptx
PPT
Chapter09
PPTX
Stats chapter 11
PDF
Test of hypotheses part i
PDF
XỬ LÝ SỐ LIỆU TRONG PHÂN TÍCH DƯỢC_02.pdf
PPT
More Statistics
PPTX
standar error of proporton hypthesis testing standard error of diff t test.pptx
PPT
Chapter 8-hypothesis-testing-1211425712197151-9
PPT
Chapter 8 – Hypothesis Testing
PPT
Chapter 8 – Hypothesis Testing
PPT
Normal distribution
PPTX
Hypothesis_Testing_Statistic_Zscore.pptx
PPTX
Stats chapter 11
DOCX
Inferential statistics
PPT
Review Z Test Ci 1
Telesidang 4 bab_8_9_10stst
STATISTIC ESTIMATION
Unit 4b- Hypothesis testing and confidence intervals (Slides - up to slide 17...
Sampling theory
Confidence intervals, hypothesis testing and statistical tests of significanc...
Probalities, Estimations and Hypothesis Testing.pptx
Chapter09
Stats chapter 11
Test of hypotheses part i
XỬ LÝ SỐ LIỆU TRONG PHÂN TÍCH DƯỢC_02.pdf
More Statistics
standar error of proporton hypthesis testing standard error of diff t test.pptx
Chapter 8-hypothesis-testing-1211425712197151-9
Chapter 8 – Hypothesis Testing
Chapter 8 – Hypothesis Testing
Normal distribution
Hypothesis_Testing_Statistic_Zscore.pptx
Stats chapter 11
Inferential statistics
Review Z Test Ci 1
Ad

Recently uploaded (20)

PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PPTX
TNA_Presentation-1-Final(SAVE)) (1).pptx
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
Empowerment Technology for Senior High School Guide
PDF
HVAC Specification 2024 according to central public works department
PDF
advance database management system book.pdf
PPTX
Virtual and Augmented Reality in Current Scenario
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
1_English_Language_Set_2.pdf probationary
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
TNA_Presentation-1-Final(SAVE)) (1).pptx
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Empowerment Technology for Senior High School Guide
HVAC Specification 2024 according to central public works department
advance database management system book.pdf
Virtual and Augmented Reality in Current Scenario
B.Sc. DS Unit 2 Software Engineering.pptx
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
1_English_Language_Set_2.pdf probationary
Paper A Mock Exam 9_ Attempt review.pdf.
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf

Chapter10 Revised

  • 1. Chapter 10 Introduction to Inference Section 10.1 Estimating with Confidence
  • 2. Twenty five samples from the same population gave these 95% confidence intervals, 95% of all samples give an interval that contains the population mean u.
  • 3. In 95% of all samples, x-bar lies within ± 9 of the unknown population mean µ. Also µ lies within ± 9 of x-bar of the samples.
  • 4. To say that x-bar is a 95% confidence interval for the population mean µ is to say that in repeated samples, 95% of these intervals capture the µ
  • 5. The central probability 0.8 under a standard normal curve lies between -1.28 and + 1.28. That is, there is area .1 to the left and to the right of the curve.
  • 6. The central probability C under a standard normal curve lies between –z* and + z* has area (1-C)/2 to its right under the curve, we call it the upper (1-C)/2 critical value.
  • 7. As we learned in Chapter 9, because of sampling variability, the statistic calculated from a sample is rarely equal to the true parameter of interest. Therefore, when we are trying to estimate a parameter, we must go beyond our statistic value to construct a reasonable range that captures the true parameter value.
  • 8. Confidence Interval: The data must come from SRS of the population of interest The sampling distribution of x-bar is approximately normal
  • 9. Sampling Distribution of p-hat Proportions: Let p-hat be the proportion of successes in a random sample of size n from a population whose proportion is p. The mean value of p-hat is µp-hat and standard deviation σ -p-hat Rule 1: µ p-hat = p Rule 2: σ p-hat = Rule 3: when n is large and p is not very close to 0 or 1, the sampling distribution of p is approximately normal.
  • 10. Sampling Distribution of x-bar – Means Let x-bar be the mean of the observations in a random sample of size n having mean µ and standard deviation σ . The mean value of the x-bar distribution µ and standard deviation of x-bar is σ Rule 1: µ xbar = µ Rule 2: σ xbar = σ / as long as 10% of the population is in the sample. Rule 3: when the population distribution is normal, the sampling distribution of x-bar is also normal for any sample size n Rule 4: (Central Limit Theorem) CLT when n is sufficiently large, the sampling distribution of x-bar is well approximated by a normal curve even when the population is not itself normal.
  • 11. Confidence Level C Single Tail Area (1-C)/2 Critical z value 90% .05 1.645 95% .025 1.960 99% .005 2.576
  • 12. Estimating with Confidence – Interval Behavior Margin of Error = (z critical value) σ / What are some ways in which we could lower our margin of error? z get smaller – as smaller confidence level σ gets smaller n gets larger
  • 13. Confidence Interval for p Single Proportion Confidence Interval for µ Single Mean ( σ known) Conditions Conditions If an SRS of size n is chosen from a population with unknown proportion p, then a level C Confidence Interval for p is: If an SRS of size n is chosen from a population with unknown proportion, µ then a level C Confidence Interval for µ is: p-hat ± z critical value ( ((p(1-p)/ n) X-bar ± z critical value ( σ /
  • 14. We will follow the 4 step process outlined below throughout the chapter.
  • 15. Inference Toolbox: Confidence Intervals To Construct a Confidence Interval for a parameter (1)Identify the population of interest and the parameter you want to draw conclusions about (2)Choose the appropriate inference procedure. Verify the conditions for the selected parameter (3)If the conditions are met, carry out the inference procedures. CI = estimate ± margin of error (4)Interpret the results in the context of the problem
  • 16. Example Problem 1 A manufacturer of high resolution video terminals must control the tension of the viewing screen to avoid tears and wrinkles. The tension is measured in millivolts (mV). Careful study has shown that when the process is operating properly, the standard deviation of the tension reading is 43mV. (1) Given a sample of 20 screens, with xbar = 306.3 mV, from a single day’s production, construct a 90% confidence interval for the mean tension µ of all the screens produced on this day. Follow the steps from the previous slide.
  • 17. 1)Identify the population of interest and the parameter you want to draw conclusions about all video terminal produced on the same day Choose the appropriate inference procedure. σ is known use CI for µ of SRS of 20 Verify the conditions for the selected parameter x-bar is approximately normal If the conditions are met, carry out the inference procedures. CI = estimate ± margin of error CI = x-bar ± critical z value · σ / CI = 306.3 ± 1.645 · 43 / = 306.3 ± 15.8 = (290.5, 322.1) Interpret the results in the context of the problem We are 90% confident that the true mean is between 290.5 and 322.1
  • 18. 2. Suppose the manufacturer wants 99% confidence rather than 90%. Using your data from problem 1, construct a 99% CI. How does it compare to the 90% CI? Why? 306.3 ± 2.576 (43/ ) = 306.3 ± 24.768 = (281.532, 331.068) the interval is larger. We have greater confidence that the interval captures the true value.
  • 19. 3. Company management wants to report the mean screen tension for the day’s production accurate within  5 mV with 95% confidence. How large a sample of video monitors must be measured to comply with this request? 1.960(43)/ 84.28 = 16.856 = n = 284.12 approximately 285
  • 20. Do problems 1,7 and 10 from The Practice of Statistics 2 nd edition.
  • 21. Section 10.2 Tests of Significance
  • 22. Tests of Significance: We have learned that Confidence Intervals can be used to estimate a parameter. Often in statistics we want to use sample data to determine whether or not a claim or hypothesis about a parameter is plausible. A test of significance is a procedure in which we can use sample data to test such a claim. We will focus on tests about a population mean μ and proportions p.
  • 23. Note about Identifying H 0 and H A
  • 24. Practice: Identify the Null and Alternative Hypothesis. Express the corresponding null and alternative hypotheses in symbolic form. The proportion of drivers who admit to running red lights is greater than 0.5.
  • 25. The proportion of drivers who admit to running red lights is greater than 0.5. First we express the given claim as p > 0.5. Next, we notice that the statement does not contain an equality so it has to be the alternative hypothesis. H 0 : p = 0.5 H A : p > 0.5.
  • 26. Practice: Identify the Null and Alternative Hypothesis. Express the corresponding null and alternative hypotheses in symbolic form. b) The mean height of professional basketball players is at most 7 ft.
  • 27. b) The mean height of professional basketball players is at most 7 ft. Express “a mean of at most 7 ft” in symbols   7. The expression µ  7 contains an equality so it is the null hypothesis. H 0 : µ  7 H A : µ > 0.5 .
  • 28. c) The standard deviation of IQ scores of actors is equal to 15. Practice : Identify the Null and Alternative Hypothesis. Express the corresponding null and alternative hypotheses in symbolic form.
  • 29. c) The standard deviation of IQ scores of actors is equal to 15. Express the given claim as  = 15. This is the null hypothesis. H o :  = 15 H A :   15
  • 30. Example Problem 2 Spinifex pigeons, one of the few bird species that inhabit the desert of Western Australia, rely on seeds for food. The article “Field metabolism and water requirements of Spinifex pigeons in Western Australia” reported the following Minitab analysis of the weight of seed in grams in the stomach contents of Spinifex pigeons. Use the analysis to construct and interpret a 95% confidence interval for the average weight of seeds in this type of pigeon’s stomach and compare your results to the hypothesis that the mean seed amount is 1g for all Spinifex pigeons. TEST OF MU = 1.000 VS MU N.E. 1.000 N MEAN STDEV SE MEAN T PVALUE WEIGHT 16 1.373 1.034 0.258 1.44 0.17
  • 31. The Spinifex pigeon problem p (z = 1.373 =1.000)/.258 = p( z= 1.445) p = .1490 95% CI: 1.373  1.96 ( 1.373  1.96 (.258) .8664 < μ < 1.880 95% CI confident the mean is between .8664 and 1.880 (.8664, 1.880)
  • 32. Do problems 12, 30 and 42 from The Practice of Statistics 2 nd edition
  • 33. Section 10.3 Making Sense of Statistical Significance
  • 34. Inference Toolbox: Test of Significance Identify the population of interest and the parameter you want to draw conclusions about State null and alternative hypotheses in words and symbols Choose the appropriate inference procedure Verify the conditions for using the procedure If the conditions are met, carry out the inference procedures Interpret the results in the context of the problem
  • 35. 1) The sample observations are a simple random sample. 2) The conditions for a binomial experiment are satisfied The condition np  5 and n(1-p)  5 are satisfied,  so the binomial distribution of sample proportions can be approximated by a normal distribution with µ = np and  = np(1-p) . Assumptions for Testing Claims About a Population Proportion p
  • 36. To determine the significance at a set level, alpha, one needs to calculate the P-value for the observed statistic. Careful consideration of the test statistic, z, can also be used to determine significance:
  • 37. P -Value The P -value (or p -value or probability value ) is the probability of getting a value of the test statistic that is at least as extreme as the one representing the sample data, assuming that the null hypothesis is true. The null hypothesis is rejected if the P -value is very small, such as 0.05 or less.
  • 38.  
  • 39. Two-tailed Test H 0 : = H 1 :   is divided equally between the two tails of the critical region Means less than or greater than
  • 40. Right-tailed Test H 0 : = H 1 : > Points Right
  • 41. Left-tailed Test H 0 : = H 1 : < Points Left
  • 42.  
  • 43. Critical Region The critical region (or rejection region ) is the set of all values of the test statistic that cause us to reject the null hypothesis.
  • 44. Practice: Finding P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the P -values and state a conclusion about the null hypothesis. A significance level of  = 0.05 is used in testing the claim that p > 0.25 , and the sample data result in a test statistic of z = 1.18 .
  • 45. Practice: Finding P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the P -values and state a conclusion about the null hypothesis. a) With a claim of p > 0.25, the test is right-tailed. Because the test is right-tailed, the graph shows that the P -value is the area to the right of the test statistic z = 1.18. Using Table A and find that the area to the right of z = 1.18 is 0.1190. The P -value is 0.1190 is greater than the significance level  = 0.05, so we fail to reject the null hypothesis.
  • 46. Practice: Finding P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the P -values and state a conclusion about the null hypothesis. b) A significance level of  = 0.05 is used in testing the claim that p  0.25 , and the sample data result in a test statistic of z = 2.34 .
  • 47. Practice: Finding P -values. First determine whether the given conditions result in a right-tailed test, a left-tailed test, or a two-tailed test, then find the P -values and state a conclusion about the null hypothesis. b) With a claim of p  0.25, the test is two-tailed. Because the test is two-tailed, and because the test statistic of z = 2.34 is to the right of the center, the P-value is twice the area to the right of z = 2.34. Using Table A and find that the area to the right of z = 2.34 is 0.0096, so P-value = 2 x 0.0096 = 0.0192. The P-value of 0.0192 is less than or equal to the significance level, so we reject the null hypothesis.
  • 48. Assumptions for Testing Claims About Population Means 1) The sample is a simple random   sample. 2) The value of the population standard deviation  is known. 3) Either or both of these conditions is satisfied: The population is normally distributed or n > 30.
  • 49. Example Problem 3 10.13 pg. 572 H o : μ = 128 H A : μ ≠ 128 σ = 15, x-bar = 126.07, n = 72 z = x-bar – μ / ( σ / ) 2 P (z ≥ z |1.09|) = 2(1-.862) = .2758
  • 50. Example Problem 3 #10.13 pg. 572 More than 27% of the time, an SRS of size 72 will have mean blood pressure at least a far away from 128 as the sample. This is not good evidence that it differs. We fail to reject the null hypothesis
  • 51. Example 4: We are given a data set of 106 body temperatures having a mean of 98.20°F. Assume that the sample is a simple random sample and that the population standard deviation  is known to be 0.62°F. Use a 0.05 significance level to test the common belief that the mean body temperature of healthy adults is equal to 98.6°F. Use the P -value method. This is a two-tailed test and the test statistic is to the left of the center, so the P -value is twice the area to the left of z = –6.64. We refer to Table A to find the area to the left of z = –6.64 is 0.0001, so the P -value is 2(0.0001) = 0.0002. = z = x – µ x  n 98.2 – 98.6 = –6.64  106 H 0 :  = 98.6 H 1 :   98.6  = 0.05 x = 98.2  = 0.62
  • 52. Example 4: We are given a data set of 106 body temperatures having a mean of 98.20°F. Assume that the sample is a simple random sample and that the population standard deviation  is known to be 0.62°F. Use a 0.05 significance level to test the common belief that the mean body temperature of healthy adults is equal to 98.6°F. Use the P -value method. z = –6.64 H 0 :  = 98.6 H 1 :   98.6  = 0.05 x = 98.2  = 0.62
  • 53. Example 4: We are given a data set of 106 body temperatures having a mean of 98.20°F. Assume that the sample is a simple random sample and that the population standard deviation  is known to be 0.62°F. Use a 0.05 significance level to test the common belief that the mean body temperature of healthy adults is equal to 98.6°F. Use the P -value method. z = –6.64 Because the P -value of 0.0002 is less than the significance level of  = 0.05, we reject the null hypothesis. There is sufficient evidence to conclude that the mean body temperature of healthy adults differs from 98.6°F. H 0 :  = 98.6 H 1 :   98.6  = 0.05 x = 98.2  = 0.62
  • 54. Example 5: A survey of n = 880 randomly selected adult drivers showed that 56%(or p-hat = 0.56) of those respondents admitted to running red lights. Find the value of the test statistic for the claim that the majority of all adult drivers admit to running red lights.
  • 55. The claim is that the majority of all Americans run red lights. That is, p > 0.5. The sample data are n = 880, and p = 0.56. np = (880)(0.5) = 440  5 n(1-p) = (880)(0.5) = 440  5
  • 56. Example 5: In an article distributed by the Associated Press included these results from a nationwide survey: Of 880 randomly selected drivers, 56% admitted that they run red lights. The claim is that the majority of all Americans run red lights. That is, p > 0.5. The sample data are n = 880, and p = 0.56. We will use the P -value Method.  H 0 : p = 0.5 H 1 : p > 0.5  = 0.05 Referring to Table A, we see that for values of z = 3.50 and higher, we use 0.9999 for the cumulative area to the left of the test statistic. The P-value is 1 – 0.9999 = 0.0001. p(1-p) n p – p z =  0.56 – 0.5 (0.5)(0.5) 880 = = 3.56
  • 57. Interpretation: In an article distributed by the Associated Press included these results from a nationwide survey: Of 880 randomly selected drivers, 56% admitted that they run red lights. The claim is that the majority of all Americans run red lights. That is, p > 0.5. The sample data are n = 880, and p = 0.56. We will use the P -value Method.  H 0 : p = 0.5 H 1 : p > 0.5  = 0.05 We know from previous chapters that a z score of 3.56 is exceptionally large. The corresponding P -value of 0.0001 is less than the significance level of  = 0.05, we reject the null hypothesis. There is sufficient evidence to support the claim. p(1-p) n p – p z =  0.56 – 0.5 (0.5)(0.5) 880 = = 3.56
  • 58. Interpretation : In an article distributed by the Associated Press included these results from a nationwide survey: Of 880 randomly selected drivers, 56% admitted that they run red lights. The claim is that the majority of all Americans run red lights. That is, p > 0.5. The sample data are n = 880, and p = 0.56. We will use the P -value Method.  H 0 : p = 0.5 H 1 : p > 0.5  = 0.05 z = 3.56
  • 59. Do problems 42 and 44
  • 60. Section 10.4 Inference as Decision
  • 61. Traditional method : Reject H 0 if the test statistic falls within the critical region. Fail to reject H 0 if the test statistic does not fall within the critical region. Decision Criterion
  • 62. P -value method : Reject H 0 if P -value   (where  is the significance level, such as 0.05). Fail to reject H 0 if P -value >  . ** this is the method you will use Decision Criterion**
  • 63. Decision Criterion Confidence Intervals : Because a confidence interval estimate of a population parameter contains the likely values of that parameter, reject a claim that the population parameter has a value that is not included in the confidence interval.
  • 64.  
  • 65. Wording of Final Conclusion
  • 66. Accept versus Fail to Reject Some texts use “accept the null hypothesis.” We will use the phrase “fail to reject” instead. We are not proving the null hypothesis. The sample evidence is not strong enough to warrant rejection (such as not enough evidence to convict a suspect).
  • 67. Type I Error A Type I error is the mistake of rejecting the null hypothesis when it is true. The symbol   (alpha) is used to represent the probability of a type I error.
  • 68. Type II Error A Type II error is the mistake of failing to reject the null hypothesis when it is false . The symbol  (beta) is used to represent the probability of a type II error .
  • 69.  
  • 70. Controlling Type I and Type II Errors For any fixed  , an increase in the sample size n will cause a decrease in  For any fixed sample size n , a decrease in  will cause an increase in  . Conversely, an increase in  will cause a decrease in  . To decrease both  and  , increase the sample size.
  • 71. The ability to detect a false hypothesis is called the power of the test. The power of the test is the probability that it correctly rejects a false null hypothesis. Power = 1 – β Type II error β OK Fail to Reject H o Power 1- β Type I error α Reject H o H o False H o True
  • 72. When power is high we can be confident If we fail to reject the false null hypothesis, the test’s power comes into question Was the sample big enough? Did we miss an effect? Did we fail to gather sufficient data? Was there too much variability? What power is needed? The choice of power is frequently determined by financial or scientific concerns.
  • 73. Summary of the P-Value Method Identify the specific claim or hypothesis to be tested, and put it in symbolic form. Give the symbolic form that must be true when the original claim is false. Of the two symbolic expressions, let the alternative hypothesis H A be the one not containing the equality, so that H A uses the symbol > or < or ≠. Let the null hypothesis H 0 be the symbolic expression that the parameter equals the fixed value being considered. Select the significance level α based on the seriousness of a type 1 error. Make α small if the consequences of rejecting a true H 0 are severe. The values of 0.05 and 0.01 are most frequently used. Identify the statistic that is relevant to this test and determine its sampling distribution (such as normal, t, or chi square). Find the test statistic and find the P-value. Draw a graph and show the test statistic and P-value. Reject H 0 if the P-value is less than or equal to the significance level α . Fail to reject H 0 if the P-value is greater than α . Restate the decision in simple, non-technical terms and address the original claim.
  • 74. Do problems 68 & 82 from The Practice of Statistics 2 nd Edition
  • 75. Baseball Problem Check conditions: (1) plausible independence condition (2) random condition (3) 10% condition (4) np > 10 and n(1-p)> 10 In 2002 in Major League Baseball there were 2425 regular season games. Home teams won 1314 of 2425 or 54.2%. Can this be explained by national sampling variability or home field advantage? We want to know whether the home team in professional baseball is more likely to win. The parameter of interest is the proportion of home team wins. With no advantage, we’d expect that proportion to be .50.
  • 76. p-hat = .542 z = p-hat – p/ sd phat = p(z> 4.14) = p < .0001 There is a .0001 chance the home team is more likely to win.
  • 77. CAUTION When the calculation of p results in a decimal with many places, store the number on your calculator and use all of the decimals when evaluating the z test statistic. Large errors can result from rounding p hat too much.  

Editor's Notes

  • #67: page 374 of text The term ‘accept’ is somewhat misleading, implying incorrectly that the null has been proven. The phrase ‘fail to reject’ represents the result more correctly.
  • #68: Example on page 375 of text
  • #71: page 377 in text