SlideShare a Scribd company logo
INTRODUCTION TO ANALYTICS
Part 9
STATISTICAL CONCEPTS AND THEIR
APPLICATIONS IN BUSINESS
TESTS OF SIGNIFICANCE
One sample z-test
Two sample z-test
One sample t-test
Two sample t-test
Paired t-test
Chi – Squaredtest
F test - Analysis of Variance (ANOVA)
F test - Regression
CHI- SQUARED TESTS
• Compare the observed result against an expected result based on a hypothesis
• Steps:
• State the null hypothesis
• Prepare the contingency table for the variable
• Determine the expected results
• Calculate the chi-squared value
• Calculate the degrees of freedom
• Based on the above, calculate the p-value
• If p-value < 0.05, reject the null hypothesis.
• Test of independence:
• Verify if two variables are independent
• Same steps as above.
CASE STUDY—CHI-SQUARED TEST
• A city has a newly opened nuclear plant, and there are families staying dangerously close to the
plant. A health safety officer wants to take this case up to provide relocation for the families that
live in the surrounding area. To make a strong case, he wants to prove with numbers that an
exposure to radiation levels is leading to an increase in diseased population. He formulates a
contingency table of exposure and disease.
• Does the data suggest an association between the disease and exposure?
Disease Total
Exposure Yes No
Yes 37 13 50
No 17 53 70
Total 54 66 120
Steps:
• Calculate the number of individuals of exposed and unexposed groups expected in each disease
category (yes and no) if the probabilities were the same.
• If there were no effect of exposure, the probabilities should be same and the chi-squared statistic would
have a very low value.
Proportion of population exposed = (50/120) = 0.42
Proportion of population not exposed = (70/120) = 0.58
Thus, expected values: Population
with disease = 54 Exposure Yes : 54
* 0.42 = 22.5
Exposure No : 54 * 0.58 = 31.5
Population without disease = 66
Exposure Yes : 66 * 0.42 = 27.5
Exposure No : 66 * 0.58 =38.5
CASE STUDY—CHI-SQUARED TEST (CONTD.)
• Calculate the Chi-squared statistic χ2=Σ
= 29.1
• Calculate the degrees of freedom :
(Number of rows – 1) X (Number of columns – 1)
df = (2 – 1) X (2 – 1) =1
• Calculate the p-value from the chi-squared table
For chi-squared value 29.1 and degrees of freedom = 1, from the table, p-value is < 0.001
• Interpretation: There is 0.001 chance of obtaining such discrepancies between expected and
observed values if there is no association
• Conclusion : There is an association between the exposure and disease.
CASE STUDY—CHI-SQUARED TEST (CONTD.)
ANOVA
• Analysis of Variance – used to compare more than two groups
• Extension of the independent t-tests
• Factor variable – variable defining the groups
• Response variable – variable being compared
• One wayANOVA
• Groups of a single variable
• E.g. : Is there a difference in student’s scores based on the row he is seated –
front/middle/back?
• Two wayANOVA
• Two independent variables
• E.g. : Does the race and gender affect a person’s yearly income?
• Marks obtained in the same subject by 3 students belonging to three different schools are given
below.
• Does the data suggest any association between schools and marks?
CASE STUDY—ONE WAY ANOVA
Basic Idea : Partition the total variation in the data into the variance between groups and variance
within groups.
School A B C
Marks
82 83 38
83 78 59
97 68 55
Steps:
• Calculate the means
• School A : mean(82,83,97) =87.3
• School B : mean(83,78,68) =76.3
• School C : mean(38,59,55) =50.6
• Calculate the grand mean
• Grand mean = mean(82,83,97,83,78,68,39,59,55) = 71.4
• Calculating the variations
• Sum of Squared Deviations about the grand mean, across all observed values : SSTotal = 2630.2
• Sum of Squared Deviations of group mean about the grand mean – three group means against
the grand mean : SSBetween = 2124.2
• Sum of Squared Deviations of observations within a group about their group mean; added
across all groups : SSWithin = 506
CASE STUDY—ONE WAY ANOVA (CONTD.)
• Calculate the degrees of freedom for every variance
• dfTotal = Number of observations – 1 = 9 -1 =8
• dfBetween = Number of groups -1 = 3-1 = 2
• dfWithin = Number of observations – number of groups = 9-3 = 6
• Calculate the Mean Squared Variances
• Mean Squared variance between groups : MSBetween= SSBetween /dfBetween = 2124.2/2 = 1062.1
• Mean Squared variance within groups : MSWithin= SSWithin /dfWithin = 506/6 = 84.3
• Calculate the f-statistic
• F-value : MSBetween /MSWithin= 1062.1/84.3 = 12.59
• Calculate the p-value from the F-table
• p-value for given f-value 12.59 and degrees of freedom 2 and 6 is 0.007
• Conclusion : Since the p-value is less than alpha, we can conclude by rejecting the null hypothesis,
that there is a difference in the marks obtained by students belonging to different groups.
CASE STUDY—ONE WAY ANOVA (CONTD.)
Thank You
If you are looking for business analytics training in Bangalore / Bengaluru then
visit: http://beamsync.com/business-analytics-training-bangalore/
Next Part We will Publish Soon.

More Related Content

PPT
PPTX
Introduction to Business Analytics Course Part 10
DOCX
Parametric Statistics
PPTX
Non parametric test
PDF
Nonparametric Statistics
PDF
Repeated Measures ANOVA
PPT
Non-parametric analysis: Wilcoxon, Kruskal Wallis & Spearman
PPTX
One-Sample Hypothesis Tests
Introduction to Business Analytics Course Part 10
Parametric Statistics
Non parametric test
Nonparametric Statistics
Repeated Measures ANOVA
Non-parametric analysis: Wilcoxon, Kruskal Wallis & Spearman
One-Sample Hypothesis Tests

What's hot (20)

PPTX
Non parametric-tests
PPT
Non parametric tests by meenu
PPTX
Repeated Measures ANOVA
PPTX
Chi square Test
PPT
Nonparametric and Distribution- Free Statistics
PPTX
t distribution, paired and unpaired t-test
PPTX
Statistics-Non parametric test
PPTX
non parametric statistics
PPTX
Data Analysis
PPTX
Statistical test
PPT
Parametric and non parametric test
PPTX
Chi square test final
PPTX
Non-parametric Statistical tests for Hypotheses testing
PDF
Statistical methods for the life sciences lb
PPTX
Parametric vs Nonparametric Tests: When to use which
PPTX
Fishers test
PPTX
Parametric tests
PPTX
Non parametric test
PPTX
Chisquare Test
PPTX
Non parametric study; Statistical approach for med student
Non parametric-tests
Non parametric tests by meenu
Repeated Measures ANOVA
Chi square Test
Nonparametric and Distribution- Free Statistics
t distribution, paired and unpaired t-test
Statistics-Non parametric test
non parametric statistics
Data Analysis
Statistical test
Parametric and non parametric test
Chi square test final
Non-parametric Statistical tests for Hypotheses testing
Statistical methods for the life sciences lb
Parametric vs Nonparametric Tests: When to use which
Fishers test
Parametric tests
Non parametric test
Chisquare Test
Non parametric study; Statistical approach for med student
Ad

Viewers also liked (9)

PDF
R - Basic Introduction
PPTX
Basic Analytic Techniques - Using R Tool - Part 1
PDF
COMPANY PROFILE
PPTX
Big Data solution for multi-national Bank
PDF
Data Analytics using R
PPTX
Introduction to Business Analytics Course Part 7
PPTX
Twitter Text Mining with Web scraping, R, Shiny and Hadoop - Richard Sheng
PPTX
Smart Meter Data Analytic using Hadoop
PDF
R Markdown Tutorial For Beginners
R - Basic Introduction
Basic Analytic Techniques - Using R Tool - Part 1
COMPANY PROFILE
Big Data solution for multi-national Bank
Data Analytics using R
Introduction to Business Analytics Course Part 7
Twitter Text Mining with Web scraping, R, Shiny and Hadoop - Richard Sheng
Smart Meter Data Analytic using Hadoop
R Markdown Tutorial For Beginners
Ad

Similar to Introduction to Business Analytics Course Part 9 (20)

PPT
lecture12.ppt
PPT
lecture12.ppt
PPTX
PPTX
Analysis of variance (ANOVA)
PPTX
Measures of Dispersion
PPT
Chi square
PPTX
Analysis of variance
PPTX
PPTX
Tests of Significance.pptx powerpoint presentation
PPTX
Chi square test
PDF
Anova one way sem 1 20142015 dk
PPTX
One way AVOVA
PDF
Analysis of Variance (ANOVA)
PPTX
Significance Tests
PPT
anova & analysis of variance pearson.ppt
PPTX
Chi-Square power point presentation .ppt
PPTX
Chi Square & Anova
PPT
Anova.ppt
PPT
6Tests of significance Parametric and Non Parametric tests.ppt
PDF
Chisquare Test of Association.pdf in biostatistics
lecture12.ppt
lecture12.ppt
Analysis of variance (ANOVA)
Measures of Dispersion
Chi square
Analysis of variance
Tests of Significance.pptx powerpoint presentation
Chi square test
Anova one way sem 1 20142015 dk
One way AVOVA
Analysis of Variance (ANOVA)
Significance Tests
anova & analysis of variance pearson.ppt
Chi-Square power point presentation .ppt
Chi Square & Anova
Anova.ppt
6Tests of significance Parametric and Non Parametric tests.ppt
Chisquare Test of Association.pdf in biostatistics

Recently uploaded (20)

PDF
Sports Quiz easy sports quiz sports quiz
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
01-Introduction-to-Information-Management.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Cell Types and Its function , kingdom of life
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
master seminar digital applications in india
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Classroom Observation Tools for Teachers
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Insiders guide to clinical Medicine.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Complications of Minimal Access Surgery at WLH
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
Sports Quiz easy sports quiz sports quiz
Anesthesia in Laparoscopic Surgery in India
O7-L3 Supply Chain Operations - ICLT Program
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Microbial disease of the cardiovascular and lymphatic systems
PPH.pptx obstetrics and gynecology in nursing
01-Introduction-to-Information-Management.pdf
TR - Agricultural Crops Production NC III.pdf
Computing-Curriculum for Schools in Ghana
Cell Types and Its function , kingdom of life
FourierSeries-QuestionsWithAnswers(Part-A).pdf
master seminar digital applications in india
VCE English Exam - Section C Student Revision Booklet
Classroom Observation Tools for Teachers
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Insiders guide to clinical Medicine.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Complications of Minimal Access Surgery at WLH
2.FourierTransform-ShortQuestionswithAnswers.pdf

Introduction to Business Analytics Course Part 9

  • 2. STATISTICAL CONCEPTS AND THEIR APPLICATIONS IN BUSINESS
  • 3. TESTS OF SIGNIFICANCE One sample z-test Two sample z-test One sample t-test Two sample t-test Paired t-test Chi – Squaredtest F test - Analysis of Variance (ANOVA) F test - Regression
  • 4. CHI- SQUARED TESTS • Compare the observed result against an expected result based on a hypothesis • Steps: • State the null hypothesis • Prepare the contingency table for the variable • Determine the expected results • Calculate the chi-squared value • Calculate the degrees of freedom • Based on the above, calculate the p-value • If p-value < 0.05, reject the null hypothesis. • Test of independence: • Verify if two variables are independent • Same steps as above.
  • 5. CASE STUDY—CHI-SQUARED TEST • A city has a newly opened nuclear plant, and there are families staying dangerously close to the plant. A health safety officer wants to take this case up to provide relocation for the families that live in the surrounding area. To make a strong case, he wants to prove with numbers that an exposure to radiation levels is leading to an increase in diseased population. He formulates a contingency table of exposure and disease. • Does the data suggest an association between the disease and exposure? Disease Total Exposure Yes No Yes 37 13 50 No 17 53 70 Total 54 66 120
  • 6. Steps: • Calculate the number of individuals of exposed and unexposed groups expected in each disease category (yes and no) if the probabilities were the same. • If there were no effect of exposure, the probabilities should be same and the chi-squared statistic would have a very low value. Proportion of population exposed = (50/120) = 0.42 Proportion of population not exposed = (70/120) = 0.58 Thus, expected values: Population with disease = 54 Exposure Yes : 54 * 0.42 = 22.5 Exposure No : 54 * 0.58 = 31.5 Population without disease = 66 Exposure Yes : 66 * 0.42 = 27.5 Exposure No : 66 * 0.58 =38.5 CASE STUDY—CHI-SQUARED TEST (CONTD.)
  • 7. • Calculate the Chi-squared statistic χ2=Σ = 29.1 • Calculate the degrees of freedom : (Number of rows – 1) X (Number of columns – 1) df = (2 – 1) X (2 – 1) =1 • Calculate the p-value from the chi-squared table For chi-squared value 29.1 and degrees of freedom = 1, from the table, p-value is < 0.001 • Interpretation: There is 0.001 chance of obtaining such discrepancies between expected and observed values if there is no association • Conclusion : There is an association between the exposure and disease. CASE STUDY—CHI-SQUARED TEST (CONTD.)
  • 8. ANOVA • Analysis of Variance – used to compare more than two groups • Extension of the independent t-tests • Factor variable – variable defining the groups • Response variable – variable being compared • One wayANOVA • Groups of a single variable • E.g. : Is there a difference in student’s scores based on the row he is seated – front/middle/back? • Two wayANOVA • Two independent variables • E.g. : Does the race and gender affect a person’s yearly income?
  • 9. • Marks obtained in the same subject by 3 students belonging to three different schools are given below. • Does the data suggest any association between schools and marks? CASE STUDY—ONE WAY ANOVA Basic Idea : Partition the total variation in the data into the variance between groups and variance within groups. School A B C Marks 82 83 38 83 78 59 97 68 55
  • 10. Steps: • Calculate the means • School A : mean(82,83,97) =87.3 • School B : mean(83,78,68) =76.3 • School C : mean(38,59,55) =50.6 • Calculate the grand mean • Grand mean = mean(82,83,97,83,78,68,39,59,55) = 71.4 • Calculating the variations • Sum of Squared Deviations about the grand mean, across all observed values : SSTotal = 2630.2 • Sum of Squared Deviations of group mean about the grand mean – three group means against the grand mean : SSBetween = 2124.2 • Sum of Squared Deviations of observations within a group about their group mean; added across all groups : SSWithin = 506 CASE STUDY—ONE WAY ANOVA (CONTD.)
  • 11. • Calculate the degrees of freedom for every variance • dfTotal = Number of observations – 1 = 9 -1 =8 • dfBetween = Number of groups -1 = 3-1 = 2 • dfWithin = Number of observations – number of groups = 9-3 = 6 • Calculate the Mean Squared Variances • Mean Squared variance between groups : MSBetween= SSBetween /dfBetween = 2124.2/2 = 1062.1 • Mean Squared variance within groups : MSWithin= SSWithin /dfWithin = 506/6 = 84.3 • Calculate the f-statistic • F-value : MSBetween /MSWithin= 1062.1/84.3 = 12.59 • Calculate the p-value from the F-table • p-value for given f-value 12.59 and degrees of freedom 2 and 6 is 0.007 • Conclusion : Since the p-value is less than alpha, we can conclude by rejecting the null hypothesis, that there is a difference in the marks obtained by students belonging to different groups. CASE STUDY—ONE WAY ANOVA (CONTD.)
  • 12. Thank You If you are looking for business analytics training in Bangalore / Bengaluru then visit: http://beamsync.com/business-analytics-training-bangalore/ Next Part We will Publish Soon.