D I S C R I M I N AT E
A N A L Y S I S
P R E S E N T E D B Y:
B H O J R A J G A U TA M
R O S H A N T H A PA
TABLE OF CONTENT
• Introduction
• Assumptions of discriminate analysis
• Discriminate Analysis Model
• Types of DA
• Steps of Analysis
• Hypothesis
• Strengths and Weakness
• References
INTRODUCTION
• Discriminate analysis is a statistical method used for
Classifying a set of observation into predefined groups
• In simple, DA helps to understand the relationship between a “dependent variable” and one/
more “independent variables”
• It is used to analyze and interpret the differences between two or more groups or categories
based on multiple predictor variables
DA is sometimes also called as:
Discriminate factor analysis
Canonical discriminate analysis
CONTD…
• DA is used when the data are normally distributed whereas the logistic regression is used when
the data are not normally distributed
Dependent variable – categorical
Independent variable – intervals or ratio scale
5
EXAMPLE
Examples includes;
i. Dependent variable is person diseased status (1 yes, 2 no) and independent variables are
plasma lipid profile, that is total cholesterol (X1) , triglycerides (X2), HDL (X3) and LDL
(X4). Since dependent variable is having two categories, so it it an example of two-group
DA. Here independent variables are continuous in nature.
ii. We want to know whether somebody has lung cancer. Hence, we wish to predict a yes or no
outcome.
• Possible predictor variables: number of cigarettes smoked a day, coughing frequency and
intensity etc.
WHY DO WE USE DA?
• DA has various benefits as a statistical tool and is quite similar to regression analysis
• It can be used to determine
Which predictor variables are related to the dependent variable
To predict the value of the dependent variable given certain values of the predictor variables
WHEN TO USE DA?
• Data must be from different groups
• To analysis of differences in groups
• For classification of new objects
ASSUMPTIONS OF DISCRIMINATE ANALYSIS
Homogeneous
within-group
variances
Multivariate
normality
within groups
No multi-
collinearity
Prior
probabilities
DISCRIMINATE ANALYSIS MODEL
The discriminate analysis model involves linear combination of the following form:
where
D = discriminate score
b’s = discriminate coefficient or weight
X’s = predictor or independent variable
• The coefficient, or weight (b), are estimated so that the groups differ as much as possible on the
values of the discriminate function
• Discriminate analysis– creates an equation which will minimize the possibility of
misclassifying cases into their respective groups or categories
D = b0+ b1X1+ b2X2+ b3X3+…...+ bkXk
TYPES OF DA
i. Linear discriminate analysis:
When the criterion/dependent variable has two categories eg: adopters and non-adopters
ii. Multiple discriminate analysis:
When three or more categories of dependent variable are involved
LINEAR DISCRIMINATE ANALYSIS
• Given by Ronald Fisher in 1936
• This methods group images of the same classes and separate images of the different classes
• This classification involves 2 target categories and 2 predictor variables. These features divide or
characterize two or more than two objects or events.
MULTIPLE DISCRIMINATE ANALYSIS
• To discriminate among more than 2 groups
• It requires
STEPS IN ANALYSIS
• STEP 1
In step one the
independent variables
which have the
discriminating power are
being chosen
• STEP 2
A discriminant function
model is developed by
using the coefficients of
independent variables
STEPS IN ANALYSIS CONTD…
• STEP 3
In step three Wilk’s
Lambda is computed for
testing the significance of
discriminant function
• STEP 4
In the step four the
independent variables
which possess
importance in
discriminating the groups
are being found
• STEP 5
In step five classification of
subjects to their respective
group is being made
STEPS OF DISCRIMINANT ANALYSIS IN SPSS
1. Analyze >> Classify >> Discriminant
2. Select ‘dependent variable’ as your grouping variable and enter it into the Grouping Variable Box
3. Click Define Range button and enter the lowest and highest code for your groups
4. Click Continue.
5. Select your predictors (IV’s) and enter into Independents box and select Enter Independents
Together. If you planned a stepwise analysis you would at this point select Use Stepwise Method
and not the previous instruction.
6. Click on Statistics button and select Means, Univariate Anovas, Box’s M, Unstandardized and
Within-Groups Correlation
7. Continue >> Classify. Select Compute From Group Sizes, Summary Table, Leave One Out
Classification, Within Groups, and all Plots
8. Continue >> Save and select Predicted Group Membership and Discriminant Scores
9. OK.
STATISTICS ASSOCIATED WITH DISCRIMINANT ANALYSIS
•Eigen values:
• It is the ratio of between groups to within group sum of squares.
Higher the eigen value, more will be the appropriateness of the
discriminant function. For a two-group DA, there will be one
function and one eigen value. It accounts for explained variance in
the model. In case of multiple DA, the eigen values will be more but
the values will gradually decline. The first eigen value will be
largest and most important, the second will be less than the firstone
having less explanatory power.
• Box’s M statistic: It is a test for equality of the covariance
matrices of the independent variables across the groups.
Here the null hypothesis (Ho) is that the observed
covariance matrices are equal across groups. So, a
nonsignificant test result (i.e., one with a large p-value) will
indicate that the covariance matrices are equal.
• Wilks’ Lambda: It provides a statistical test to assess
discriminating power of the independent variables. If it is
significant (revealed by significance of Chisquare), we
reject the “HN: No group separability”, and conclude that
the discriminant function is statistically significant.
discriminate analysis of Biostatistics ppt for MPH Students
HYPOTHESIS
Discriminant analysis tests the following hypothesis:
H0: the group means of a set of independent variable for two or more groups are
equal
H1: the group means for two or more groups are not equal
This group means is referred to as a centroid
STRENGTHS
1. Classification accuracy: discriminate analysis can be effective when the data is well-separated, and the
assumptions of the technique are met. It can produce accurate results when the classes are well-defined and
distinct
2. Multiclass classification: Unlike some other classification methods, discriminate analysis can handle
multiple classes efficiently. It extends naturally to more than two classes without requiring significant
modifications
3. Dimensionally reduction: Discriminant analysis can be used to reduce the number of predictors
(independent variables) by creating new variables (discriminant function) that capture most of the
information about the group differences
4. Normality assumptions: Discriminant analysis can still perform reasonably well when the normality
assumptions is slightly violated, especially with large sample sizes
WEAKNESS/LIMITATIONS
1. Sensitivity to assumption:
2. Overfitting:
3. Outliers:
4. Non-linear boundaries:
SIMILARITIES AND DIFFERENCES AMONG ANOVA, REGRESSION AND
DISCRIMINATE ANALYSIS
ANOVA Regression Discriminate
Analysis
Similarities
Number of dependent variables One One One
Number of independent variables Multiple Multiple Multiple
Differences
Nature of dependent variable Metric Metric Categorical Binary
Nature of independent variable Categorical Metric Metric
REFERENCES
• https://www.statisticssolutions.com/discriminant-analysis/
• https://en.wikipedia.org/wiki/Linear_discriminant_analysis
• https://www.slideshare.net/slideshow/discriminant-function-analysis-dfa/238649297
• https://www.slideshare.net/slideshow/discriminantfunctionanalysisdfa2009261213041pptx/256616478
• https://www.sciencedirect.com/topics/agricultural-and-biological-sciences/discriminant-analysis
• https://www.wallstreetmojo.com/discriminant-analysis/
• https://stats.oarc.ucla.edu/stata/dae/discriminant-function-analysis/

More Related Content

PPTX
Discriminate Analysis for health statistics .pptx
PDF
Discrimination Analysis and working on SPSS assignment
PPTX
Discriminant analysis
PPTX
Discriminant analysis
PPTX
Discriminant analysis.pptx
PPTX
Discriminant Analysis in Sports
PPTX
diiscriminant analysis1.pptx
PDF
discriminant analysis.pdf
Discriminate Analysis for health statistics .pptx
Discrimination Analysis and working on SPSS assignment
Discriminant analysis
Discriminant analysis
Discriminant analysis.pptx
Discriminant Analysis in Sports
diiscriminant analysis1.pptx
discriminant analysis.pdf

Similar to discriminate analysis of Biostatistics ppt for MPH Students (20)

PPTX
Discriminant Analysis
PPTX
Discriminant analysis-Business Intelligence tool
DOCX
simple discriminant
PPTX
discriminant analysis
PDF
Discriminate analysis
PPTX
Discriminant Analysis in Multivariate data analysis.pptx
PPT
Discriminant analysis
PPTX
discriminantfunctionanalysisdfa-200926121304(1).pptx
PPTX
discriminantfunctionanalysisdfa-200926121304.pptx
PPTX
MRA - Session 12 - Discriminant Analysis.pptx
PDF
An Overview and Application of Discriminant Analysis in Data Analysis
PPTX
Discriminant function analysis (DFA)
PPTX
R studio.pptx
PPTX
DISCRIMINANT ANALYSIS.pptx
PPTX
DISCRIMINABLE CLUSTER ANALYSIS.pptx
PPTX
Discriminant Analysis.pptx
PPTX
Discrimination Analysis
PPTX
discrimination analysis
PPT
Discriminant analysis group no. 4
Discriminant Analysis
Discriminant analysis-Business Intelligence tool
simple discriminant
discriminant analysis
Discriminate analysis
Discriminant Analysis in Multivariate data analysis.pptx
Discriminant analysis
discriminantfunctionanalysisdfa-200926121304(1).pptx
discriminantfunctionanalysisdfa-200926121304.pptx
MRA - Session 12 - Discriminant Analysis.pptx
An Overview and Application of Discriminant Analysis in Data Analysis
Discriminant function analysis (DFA)
R studio.pptx
DISCRIMINANT ANALYSIS.pptx
DISCRIMINABLE CLUSTER ANALYSIS.pptx
Discriminant Analysis.pptx
Discrimination Analysis
discrimination analysis
Discriminant analysis group no. 4
Ad

More from B. R Gautam (6)

PPTX
Critical-overview-of-municipal-food-protection-and-inspection[3584].pptx
PPTX
Emerging & Re-emerging presentataion .pptx
PPTX
approaches_in_health_promotion. pptx
PPTX
FBIMNCI.pptx session 1
PPTX
Session 1 FBIMNCI.pptx
PPTX
1. General Principles for management of sick Children.pptx
Critical-overview-of-municipal-food-protection-and-inspection[3584].pptx
Emerging & Re-emerging presentataion .pptx
approaches_in_health_promotion. pptx
FBIMNCI.pptx session 1
Session 1 FBIMNCI.pptx
1. General Principles for management of sick Children.pptx
Ad

Recently uploaded (20)

PDF
Calcified coronary lesions management tips and tricks
DOCX
PEADIATRICS NOTES.docx lecture notes for medical students
PPTX
thio and propofol mechanism and uses.pptx
PPTX
preoerative assessment in anesthesia and critical care medicine
PDF
04 dr. Rahajeng - dr.rahajeng-KOGI XIX 2025-ed1.pdf
PDF
Plant-Based Antimicrobials: A New Hope for Treating Diarrhea in HIV Patients...
PDF
OSCE Series Set 1 ( Questions & Answers ).pdf
PDF
AGE(Acute Gastroenteritis)pdf. Specific.
PDF
B C German Homoeopathy Medicineby Dr Brij Mohan Prasad
PPTX
y4d nutrition and diet in pregnancy and postpartum
PPTX
Effects of lipid metabolism 22 asfelagi.pptx
PPTX
HYPERSENSITIVITY REACTIONS - Pathophysiology Notes for Second Year Pharm D St...
PPTX
IMAGING EQUIPMENiiiiìiiiiiTpptxeiuueueur
PDF
Transcultural that can help you someday.
PPTX
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
PDF
focused on the development and application of glycoHILIC, pepHILIC, and comm...
PPTX
Acute Coronary Syndrome for Cardiology Conference
PPT
Dermatology for member of royalcollege.ppt
PPTX
Manage HIV exposed child and a child with HIV infection.pptx
PPTX
Post Op complications in general surgery
Calcified coronary lesions management tips and tricks
PEADIATRICS NOTES.docx lecture notes for medical students
thio and propofol mechanism and uses.pptx
preoerative assessment in anesthesia and critical care medicine
04 dr. Rahajeng - dr.rahajeng-KOGI XIX 2025-ed1.pdf
Plant-Based Antimicrobials: A New Hope for Treating Diarrhea in HIV Patients...
OSCE Series Set 1 ( Questions & Answers ).pdf
AGE(Acute Gastroenteritis)pdf. Specific.
B C German Homoeopathy Medicineby Dr Brij Mohan Prasad
y4d nutrition and diet in pregnancy and postpartum
Effects of lipid metabolism 22 asfelagi.pptx
HYPERSENSITIVITY REACTIONS - Pathophysiology Notes for Second Year Pharm D St...
IMAGING EQUIPMENiiiiìiiiiiTpptxeiuueueur
Transcultural that can help you someday.
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
focused on the development and application of glycoHILIC, pepHILIC, and comm...
Acute Coronary Syndrome for Cardiology Conference
Dermatology for member of royalcollege.ppt
Manage HIV exposed child and a child with HIV infection.pptx
Post Op complications in general surgery

discriminate analysis of Biostatistics ppt for MPH Students

  • 1. D I S C R I M I N AT E A N A L Y S I S P R E S E N T E D B Y: B H O J R A J G A U TA M R O S H A N T H A PA
  • 2. TABLE OF CONTENT • Introduction • Assumptions of discriminate analysis • Discriminate Analysis Model • Types of DA • Steps of Analysis • Hypothesis • Strengths and Weakness • References
  • 3. INTRODUCTION • Discriminate analysis is a statistical method used for Classifying a set of observation into predefined groups • In simple, DA helps to understand the relationship between a “dependent variable” and one/ more “independent variables” • It is used to analyze and interpret the differences between two or more groups or categories based on multiple predictor variables DA is sometimes also called as: Discriminate factor analysis Canonical discriminate analysis
  • 4. CONTD… • DA is used when the data are normally distributed whereas the logistic regression is used when the data are not normally distributed Dependent variable – categorical Independent variable – intervals or ratio scale
  • 5. 5 EXAMPLE Examples includes; i. Dependent variable is person diseased status (1 yes, 2 no) and independent variables are plasma lipid profile, that is total cholesterol (X1) , triglycerides (X2), HDL (X3) and LDL (X4). Since dependent variable is having two categories, so it it an example of two-group DA. Here independent variables are continuous in nature. ii. We want to know whether somebody has lung cancer. Hence, we wish to predict a yes or no outcome. • Possible predictor variables: number of cigarettes smoked a day, coughing frequency and intensity etc.
  • 6. WHY DO WE USE DA? • DA has various benefits as a statistical tool and is quite similar to regression analysis • It can be used to determine Which predictor variables are related to the dependent variable To predict the value of the dependent variable given certain values of the predictor variables
  • 7. WHEN TO USE DA? • Data must be from different groups • To analysis of differences in groups • For classification of new objects
  • 8. ASSUMPTIONS OF DISCRIMINATE ANALYSIS Homogeneous within-group variances Multivariate normality within groups No multi- collinearity Prior probabilities
  • 9. DISCRIMINATE ANALYSIS MODEL The discriminate analysis model involves linear combination of the following form: where D = discriminate score b’s = discriminate coefficient or weight X’s = predictor or independent variable • The coefficient, or weight (b), are estimated so that the groups differ as much as possible on the values of the discriminate function • Discriminate analysis– creates an equation which will minimize the possibility of misclassifying cases into their respective groups or categories D = b0+ b1X1+ b2X2+ b3X3+…...+ bkXk
  • 10. TYPES OF DA i. Linear discriminate analysis: When the criterion/dependent variable has two categories eg: adopters and non-adopters ii. Multiple discriminate analysis: When three or more categories of dependent variable are involved
  • 11. LINEAR DISCRIMINATE ANALYSIS • Given by Ronald Fisher in 1936 • This methods group images of the same classes and separate images of the different classes • This classification involves 2 target categories and 2 predictor variables. These features divide or characterize two or more than two objects or events.
  • 12. MULTIPLE DISCRIMINATE ANALYSIS • To discriminate among more than 2 groups • It requires
  • 13. STEPS IN ANALYSIS • STEP 1 In step one the independent variables which have the discriminating power are being chosen • STEP 2 A discriminant function model is developed by using the coefficients of independent variables
  • 14. STEPS IN ANALYSIS CONTD… • STEP 3 In step three Wilk’s Lambda is computed for testing the significance of discriminant function • STEP 4 In the step four the independent variables which possess importance in discriminating the groups are being found • STEP 5 In step five classification of subjects to their respective group is being made
  • 15. STEPS OF DISCRIMINANT ANALYSIS IN SPSS 1. Analyze >> Classify >> Discriminant 2. Select ‘dependent variable’ as your grouping variable and enter it into the Grouping Variable Box 3. Click Define Range button and enter the lowest and highest code for your groups 4. Click Continue. 5. Select your predictors (IV’s) and enter into Independents box and select Enter Independents Together. If you planned a stepwise analysis you would at this point select Use Stepwise Method and not the previous instruction. 6. Click on Statistics button and select Means, Univariate Anovas, Box’s M, Unstandardized and Within-Groups Correlation 7. Continue >> Classify. Select Compute From Group Sizes, Summary Table, Leave One Out Classification, Within Groups, and all Plots 8. Continue >> Save and select Predicted Group Membership and Discriminant Scores 9. OK.
  • 16. STATISTICS ASSOCIATED WITH DISCRIMINANT ANALYSIS •Eigen values: • It is the ratio of between groups to within group sum of squares. Higher the eigen value, more will be the appropriateness of the discriminant function. For a two-group DA, there will be one function and one eigen value. It accounts for explained variance in the model. In case of multiple DA, the eigen values will be more but the values will gradually decline. The first eigen value will be largest and most important, the second will be less than the firstone having less explanatory power.
  • 17. • Box’s M statistic: It is a test for equality of the covariance matrices of the independent variables across the groups. Here the null hypothesis (Ho) is that the observed covariance matrices are equal across groups. So, a nonsignificant test result (i.e., one with a large p-value) will indicate that the covariance matrices are equal.
  • 18. • Wilks’ Lambda: It provides a statistical test to assess discriminating power of the independent variables. If it is significant (revealed by significance of Chisquare), we reject the “HN: No group separability”, and conclude that the discriminant function is statistically significant.
  • 20. HYPOTHESIS Discriminant analysis tests the following hypothesis: H0: the group means of a set of independent variable for two or more groups are equal H1: the group means for two or more groups are not equal This group means is referred to as a centroid
  • 21. STRENGTHS 1. Classification accuracy: discriminate analysis can be effective when the data is well-separated, and the assumptions of the technique are met. It can produce accurate results when the classes are well-defined and distinct 2. Multiclass classification: Unlike some other classification methods, discriminate analysis can handle multiple classes efficiently. It extends naturally to more than two classes without requiring significant modifications 3. Dimensionally reduction: Discriminant analysis can be used to reduce the number of predictors (independent variables) by creating new variables (discriminant function) that capture most of the information about the group differences 4. Normality assumptions: Discriminant analysis can still perform reasonably well when the normality assumptions is slightly violated, especially with large sample sizes
  • 22. WEAKNESS/LIMITATIONS 1. Sensitivity to assumption: 2. Overfitting: 3. Outliers: 4. Non-linear boundaries:
  • 23. SIMILARITIES AND DIFFERENCES AMONG ANOVA, REGRESSION AND DISCRIMINATE ANALYSIS ANOVA Regression Discriminate Analysis Similarities Number of dependent variables One One One Number of independent variables Multiple Multiple Multiple Differences Nature of dependent variable Metric Metric Categorical Binary Nature of independent variable Categorical Metric Metric
  • 24. REFERENCES • https://www.statisticssolutions.com/discriminant-analysis/ • https://en.wikipedia.org/wiki/Linear_discriminant_analysis • https://www.slideshare.net/slideshow/discriminant-function-analysis-dfa/238649297 • https://www.slideshare.net/slideshow/discriminantfunctionanalysisdfa2009261213041pptx/256616478 • https://www.sciencedirect.com/topics/agricultural-and-biological-sciences/discriminant-analysis • https://www.wallstreetmojo.com/discriminant-analysis/ • https://stats.oarc.ucla.edu/stata/dae/discriminant-function-analysis/

Editor's Notes

  • #8: Homogeneous within group variances- variances among group variables are the same levels of predictors Multivariate normality within groups- when all other independent variables are held constant, the independent variable under examination should have a normal distribution No multi-collinearity- predictive power can decrease with an increased correlation between predictor variables Prior probabilities- the probability of an observation coming from a particular group in a simple random sample with replacement