SlideShare a Scribd company logo
7
Most read
8
Most read
14
Most read
Hypothesis Testing
Same idea, different angle
In brief
Interval Estimation
• You collect sample(s)
• Make a guess about the range of
population parameters – lowest
and highest values
• μ ε [μL, μH]. What are μL and μH?
• μ1 – μ2 ε [a, b]. What are a and b?
• Talk about range of μ
Hypothesis Testing
• You collect sample(s)
• You want make (more) precise
statements about the parameter
values
• Can μ > 50?
• Is μ1 > μ2?
• Talk about probabilities of certain μ
Statistical Hypothesis - I
• An assertion or conjecture
• About distribution on one or more random variables
• μ = 7, μ 9,
• A hypothesis, if true, might completely specify the distribution,
then it is simple hypothesis. (μ = 7)
• If not, composite hypothesis (μ 9)
Statistical Hypothesis - II
• Importance of alternative hypothesis
• X ≈ N(μ, 1). Sample.
H0: μ = 50, HA: μ ≠ 50
• X ≈ N(μ, 1). Sample.
H0: , HA: μ < 50
Null and Alternative Hypotheses
H0 HA
Assumption or status
quo, nothing new
Rejection of an assumption
Assumed to be true
or given
Rejection of an assumption or
given
Negation of the
research question
Research q. needs to be proven
Always contains an
equality
Does not contain an equality
Null and Alternative Statements
• All statistical statements are made in relation to the null hyp.
• As researchers, we either reject the null hypothesis or fail to reject
the null hypothesis. We do not accept the null hypothesis.
• This is because the null is assumed to be true from the start.
• If we reject the null hypothesis, we conclude the data supports
the alternative hypothesis.
• However, if we fail to reject the null, that does not prove the null
is “true”.
• We only set up an assumption to either reject or fail to reject.
Example
• During the 2010-11 English Premier League season, Manchester United
home matches had an average attendance of 74,961. A club
marketing analyst would like to see if attendance decreased during
the most recent season. Establish a null and alternative hypothesis for
this analysis.
• What is our assumption?
• We can only assume that the attendance remained the same.
• Marketing analyst: interested in knowing if the attendance decreased.
• Which hypothesis format should we choose?
• I would choose: H0: 74961 and Ha < 74961
Thinking about hypotheses
• When formulating a statistical hypothesis:
• Ask: am I testing an assumption, or the status quo, that already
exists? Or am I testing a claim or assertion beyond what I already
know or can know?
• The null and the alternative are ALWAYS in opposition to each
other; cannot both be true.
Significance levels
• Consider a population with distribution where is unknown.
• We want to test a hypothesis about .
• Suppose F is a normal dist. With mean and variance =1.
• H0: (simple hyp.) OR H0 : (composite)
• To test this hyp, we observe a sample, and based on this, we
have to decide whether or not to accept H0.
• We define a region “critical region” with the proviso that the
hypothesis is to be rejected if the value is in the critical region.
Significance levels and errors
• In our example, variance =1, and sd =1.
• SE of mean =
• 95% CI =>
• Reject the null ( when sample average differs from 1 by more
than 1.96 divided by sq. root of the sample size.
• Type I error: rejecting null when it is supported by data.
• Type II errors: fail to reject the null when it is false.
Significance levels
• We are not determining is H0 is ”true” but only if its validity is
consistent with the resultant data.
• Thus, H0 is rejected if the resultant data are unlikely when H0 is
true.
• Specify , and then require the test to have the property that
whenever H0 is true, its probability of being rejected is never
greater than .
• Value of is the level of significance of the test.
• It is usually set in advance, common values: 0.1; 0.05; 0.01
Basic Method
• Suppose
• To develop a test of , at the level of significance is to start by
determining a point estimate of say d(X).
• The hypothesis is rejected if d(X) is “far away” from the region .
• To determine how “far away” it needs to be for us to reject , we
need to determine the probability dist of d(X) when is true.
• This will give us the critical region to make the test have the
required significance level
Die example
• 600 rolls of the die
• H0 : die is fair
• Ha : die is NOT fair
• In plain English: is the variation in outcomes due to chance, or is
the variation beyond what random chance would allow?
• How much should our data vary for us to conclude that our die is
not fair? i.e. we reject the null?
Errors Possible
To test H0, set , and then require the test to have the probability of Type
I error occurring can never be greater than
To be more precise, this is what we mean by α and β
Critical Value
• If we desire that the test has significance level then we must
determine the critical value c that will make the type I error =
• We can determine whether or not to accept the null hypothesis
by computing, first, the value of the test statistic,
• And second, the probability that a unit normal would (in
absolute value) exceed that quantity.
• This probability is called the p-value of the test – gives the critical
significance level.
• Relationship between alpha and p: reject null if p- value < alpha
Type I and Type II errors again
• : probability of committing a Type 1 error.
• : probability of committing a Type II error
• As decreases (level of significance increases), Type I error
decreases
• As decreases, probability of Type II error increases.
• Delicate balance!
Central Idea
Type I Error: reject H0 when it is correct.
Type II Error: accept H0 when it is false.
Which is in your control or smaller?
Type I and Type II errors
What is the null hypothesis here?
ONE-SIDED AND TWO-SIDED TESTS
t-Test
• When mean and std. dev. are both unknown
F TEST
• 5 cans of tuna filled by machines.The quality assurance manager wishes to test the variability
of two machines.
• Machine 1: n=25; mean: 5.0492
• Machine 2: n=22; mean: 4.9808
• Variance 1: 0.1130
• Variance 2: 0.0137 oz.
• Question: is this difference due to sampling error or is it statistically significant?
• Use F test to compare variances.
REGRESSION- ECOTRIX
• In this course we have learnt to measure effects. Students understand more of statistics concepts in evening classes
then morning classes. But does “evening” open up students’ brains? Does “evening” or “moon light” improve
students’ comprehension skills?
• Causal Effects – Next leap in data comprehension!
• Regress Wage Education:
– Does education affect wage?
– Wagei = α + β * Educationi + εi
• Regress Wage Education Gender:
– For a given gender, does education affect wage?
– For a given education, does gender affect wage?
– Wagei = α + β1 * Educationi + β2 * Genderi + εi

More Related Content

PPTX
Statistical inference concept, procedure of hypothesis testing
PPTX
hypothesis testing
PDF
Statistical Distributions
PPTX
Hypothesis testing
PPTX
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
PPT
Hypothesis Testing
PDF
Chi square test
PPTX
Normal distribution
Statistical inference concept, procedure of hypothesis testing
hypothesis testing
Statistical Distributions
Hypothesis testing
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
Hypothesis Testing
Chi square test
Normal distribution

What's hot (20)

PPT
Basics of Confidence Intervals
PPT
Inferential Statistics
PPTX
Hypothesis testing
PPTX
Chi square test
PPTX
Testing of hypotheses
PPT
Testing Hypothesis
PPT
Hypothesis testing and p-value, www.eyenirvaan.com
PPTX
Chi square test
PDF
Hypothesis testing
PPT
PPTX
Point Estimation
PDF
Hypothesis Testing
PPTX
Hypothesis testing
PPTX
F test and ANOVA
PPT
5 6 probability and odds lesson
PPTX
Hypothesis and Hypothesis Testing
PPTX
Two Variances or Standard Deviations
PPTX
Chi square
PDF
Research method ch07 statistical methods 1
PPT
Quartile in Statistics
Basics of Confidence Intervals
Inferential Statistics
Hypothesis testing
Chi square test
Testing of hypotheses
Testing Hypothesis
Hypothesis testing and p-value, www.eyenirvaan.com
Chi square test
Hypothesis testing
Point Estimation
Hypothesis Testing
Hypothesis testing
F test and ANOVA
5 6 probability and odds lesson
Hypothesis and Hypothesis Testing
Two Variances or Standard Deviations
Chi square
Research method ch07 statistical methods 1
Quartile in Statistics
Ad

Similar to Hypothesis testing - college hypothesis testing (20)

PPT
teast mean one and two sample
PPTX
HYPOTHESIS TESTING 20200702.pptx
PPT
Chapter 9 Fundamental of Hypothesis Testing.ppt
PDF
Test of hypotheses part i
PPT
Hypothesis_Testing.ppt
PPTX
Edexcel Hypothesis Testing A-Level Statistics
PPTX
TOPIC- HYPOTHESIS TESTING RMS.pptx
PPTX
Hypothesis 151221131534
PPTX
Hypothesis
PPTX
Topic 7 stat inference
PPTX
Hypothesis testing- Fundamentals to upload.pptx
PPTX
Hypothesis
PPTX
hypothesis testing and statistical infernce.pptx
PPTX
Confidence intervals, hypothesis testing and statistical tests of significanc...
PPTX
30043005-Hypothesis-Test-Full chapter.pptx
PPTX
Hypothesis testing1
PPTX
7 hypothesis testing
PPTX
Hypothesis testing
PPT
More Statistics
PPT
Morestatistics22 091208004743-phpapp01
teast mean one and two sample
HYPOTHESIS TESTING 20200702.pptx
Chapter 9 Fundamental of Hypothesis Testing.ppt
Test of hypotheses part i
Hypothesis_Testing.ppt
Edexcel Hypothesis Testing A-Level Statistics
TOPIC- HYPOTHESIS TESTING RMS.pptx
Hypothesis 151221131534
Hypothesis
Topic 7 stat inference
Hypothesis testing- Fundamentals to upload.pptx
Hypothesis
hypothesis testing and statistical infernce.pptx
Confidence intervals, hypothesis testing and statistical tests of significanc...
30043005-Hypothesis-Test-Full chapter.pptx
Hypothesis testing1
7 hypothesis testing
Hypothesis testing
More Statistics
Morestatistics22 091208004743-phpapp01
Ad

Recently uploaded (20)

PPTX
Introduction to Essence of Indian traditional knowledge.pptx
PPTX
EABDM Slides for Indifference curve.pptx
PPTX
Session 11-13. Working Capital Management and Cash Budget.pptx
PDF
ECONOMICS AND ENTREPRENEURS LESSONSS AND
PDF
Bladex Earnings Call Presentation 2Q2025
PDF
Chapter 9 IFRS Ed-Ed4_2020 Intermediate Accounting
DOCX
marketing plan Elkhabiry............docx
PPTX
kyc aml guideline a detailed pt onthat.pptx
PPTX
Antihypertensive_Drugs_Presentation_Poonam_Painkra.pptx
PDF
illuminati Uganda brotherhood agent in Kampala call 0756664682,0782561496
PPTX
Session 3. Time Value of Money.pptx_finance
PDF
Dr Tran Quoc Bao the first Vietnamese speaker at GITEX DigiHealth Conference ...
PDF
Mathematical Economics 23lec03slides.pdf
PDF
Understanding University Research Expenditures (1)_compressed.pdf
PPTX
4.5.1 Financial Governance_Appropriation & Finance.pptx
PDF
Copia de Minimal 3D Technology Consulting Presentation.pdf
PPTX
The discussion on the Economic in transportation .pptx
PPT
E commerce busin and some important issues
PDF
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
PDF
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf
Introduction to Essence of Indian traditional knowledge.pptx
EABDM Slides for Indifference curve.pptx
Session 11-13. Working Capital Management and Cash Budget.pptx
ECONOMICS AND ENTREPRENEURS LESSONSS AND
Bladex Earnings Call Presentation 2Q2025
Chapter 9 IFRS Ed-Ed4_2020 Intermediate Accounting
marketing plan Elkhabiry............docx
kyc aml guideline a detailed pt onthat.pptx
Antihypertensive_Drugs_Presentation_Poonam_Painkra.pptx
illuminati Uganda brotherhood agent in Kampala call 0756664682,0782561496
Session 3. Time Value of Money.pptx_finance
Dr Tran Quoc Bao the first Vietnamese speaker at GITEX DigiHealth Conference ...
Mathematical Economics 23lec03slides.pdf
Understanding University Research Expenditures (1)_compressed.pdf
4.5.1 Financial Governance_Appropriation & Finance.pptx
Copia de Minimal 3D Technology Consulting Presentation.pdf
The discussion on the Economic in transportation .pptx
E commerce busin and some important issues
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf

Hypothesis testing - college hypothesis testing

  • 2. In brief Interval Estimation • You collect sample(s) • Make a guess about the range of population parameters – lowest and highest values • μ ε [μL, μH]. What are μL and μH? • μ1 – μ2 ε [a, b]. What are a and b? • Talk about range of μ Hypothesis Testing • You collect sample(s) • You want make (more) precise statements about the parameter values • Can μ > 50? • Is μ1 > μ2? • Talk about probabilities of certain μ
  • 3. Statistical Hypothesis - I • An assertion or conjecture • About distribution on one or more random variables • μ = 7, μ 9, • A hypothesis, if true, might completely specify the distribution, then it is simple hypothesis. (μ = 7) • If not, composite hypothesis (μ 9)
  • 4. Statistical Hypothesis - II • Importance of alternative hypothesis • X ≈ N(μ, 1). Sample. H0: μ = 50, HA: μ ≠ 50 • X ≈ N(μ, 1). Sample. H0: , HA: μ < 50
  • 5. Null and Alternative Hypotheses H0 HA Assumption or status quo, nothing new Rejection of an assumption Assumed to be true or given Rejection of an assumption or given Negation of the research question Research q. needs to be proven Always contains an equality Does not contain an equality
  • 6. Null and Alternative Statements • All statistical statements are made in relation to the null hyp. • As researchers, we either reject the null hypothesis or fail to reject the null hypothesis. We do not accept the null hypothesis. • This is because the null is assumed to be true from the start. • If we reject the null hypothesis, we conclude the data supports the alternative hypothesis. • However, if we fail to reject the null, that does not prove the null is “true”. • We only set up an assumption to either reject or fail to reject.
  • 7. Example • During the 2010-11 English Premier League season, Manchester United home matches had an average attendance of 74,961. A club marketing analyst would like to see if attendance decreased during the most recent season. Establish a null and alternative hypothesis for this analysis. • What is our assumption? • We can only assume that the attendance remained the same. • Marketing analyst: interested in knowing if the attendance decreased. • Which hypothesis format should we choose? • I would choose: H0: 74961 and Ha < 74961
  • 8. Thinking about hypotheses • When formulating a statistical hypothesis: • Ask: am I testing an assumption, or the status quo, that already exists? Or am I testing a claim or assertion beyond what I already know or can know? • The null and the alternative are ALWAYS in opposition to each other; cannot both be true.
  • 9. Significance levels • Consider a population with distribution where is unknown. • We want to test a hypothesis about . • Suppose F is a normal dist. With mean and variance =1. • H0: (simple hyp.) OR H0 : (composite) • To test this hyp, we observe a sample, and based on this, we have to decide whether or not to accept H0. • We define a region “critical region” with the proviso that the hypothesis is to be rejected if the value is in the critical region.
  • 10. Significance levels and errors • In our example, variance =1, and sd =1. • SE of mean = • 95% CI => • Reject the null ( when sample average differs from 1 by more than 1.96 divided by sq. root of the sample size. • Type I error: rejecting null when it is supported by data. • Type II errors: fail to reject the null when it is false.
  • 11. Significance levels • We are not determining is H0 is ”true” but only if its validity is consistent with the resultant data. • Thus, H0 is rejected if the resultant data are unlikely when H0 is true. • Specify , and then require the test to have the property that whenever H0 is true, its probability of being rejected is never greater than . • Value of is the level of significance of the test. • It is usually set in advance, common values: 0.1; 0.05; 0.01
  • 12. Basic Method • Suppose • To develop a test of , at the level of significance is to start by determining a point estimate of say d(X). • The hypothesis is rejected if d(X) is “far away” from the region . • To determine how “far away” it needs to be for us to reject , we need to determine the probability dist of d(X) when is true. • This will give us the critical region to make the test have the required significance level
  • 13. Die example • 600 rolls of the die • H0 : die is fair • Ha : die is NOT fair • In plain English: is the variation in outcomes due to chance, or is the variation beyond what random chance would allow? • How much should our data vary for us to conclude that our die is not fair? i.e. we reject the null?
  • 14. Errors Possible To test H0, set , and then require the test to have the probability of Type I error occurring can never be greater than
  • 15. To be more precise, this is what we mean by α and β
  • 16. Critical Value • If we desire that the test has significance level then we must determine the critical value c that will make the type I error = • We can determine whether or not to accept the null hypothesis by computing, first, the value of the test statistic, • And second, the probability that a unit normal would (in absolute value) exceed that quantity. • This probability is called the p-value of the test – gives the critical significance level. • Relationship between alpha and p: reject null if p- value < alpha
  • 17. Type I and Type II errors again • : probability of committing a Type 1 error. • : probability of committing a Type II error • As decreases (level of significance increases), Type I error decreases • As decreases, probability of Type II error increases. • Delicate balance!
  • 18. Central Idea Type I Error: reject H0 when it is correct. Type II Error: accept H0 when it is false. Which is in your control or smaller?
  • 19. Type I and Type II errors What is the null hypothesis here?
  • 21. t-Test • When mean and std. dev. are both unknown
  • 22. F TEST • 5 cans of tuna filled by machines.The quality assurance manager wishes to test the variability of two machines. • Machine 1: n=25; mean: 5.0492 • Machine 2: n=22; mean: 4.9808 • Variance 1: 0.1130 • Variance 2: 0.0137 oz. • Question: is this difference due to sampling error or is it statistically significant? • Use F test to compare variances.
  • 23. REGRESSION- ECOTRIX • In this course we have learnt to measure effects. Students understand more of statistics concepts in evening classes then morning classes. But does “evening” open up students’ brains? Does “evening” or “moon light” improve students’ comprehension skills? • Causal Effects – Next leap in data comprehension! • Regress Wage Education: – Does education affect wage? – Wagei = α + β * Educationi + εi • Regress Wage Education Gender: – For a given gender, does education affect wage? – For a given education, does gender affect wage? – Wagei = α + β1 * Educationi + β2 * Genderi + εi