SlideShare a Scribd company logo
Canonical Correlation
Introduction If we have two sets of variables, x1,...., xn and y1,….., ym, and there are correlations among the variables, then canonical correlation analysis will enable us to find linear combinations of the x's and the y's which have maximum correlation with each other.Canonical correlation begin with the observed values of two sets of variables relating to the same set of areas, and a theory or hypothesis that suggests that the two are interrelated.The overriding concern is with the structural relationship between the two sets of data  as a whole, rather than the associations between individual variables
Canonical correlation is the most general form of correlation.Multiple regression analysis is a more specific case in which  one of the sets of data contains only one variable, while product moment correlation is the most specific case in that both sets of data contain only one variable.Canonical correlation analysis is not related to factor/principal components  analysis despite certain conceptual and terminological similarities. Canonical correlation analysis is used to investigate the inter-correlation between two sets of variables, whereas factor/principal components analysis identifies the patterns of relationship within one set of data.
Difficulties in Canonical CorrelationCanonical correlation is not the easiest of techniques to follow, though the problems of comprehension are conceptual rather than mathematical.Unlike multiple regression and principal components analysis, we cannot provide a graphic device to illustrate even the simplest form. For with canonical correlation analysis we are dealing with  two sets of data. Even the most elementary example must, therefore, have at least two variables on each side and so we require 2 + 2 = 4 dimensions. Tied as we are, however, to a three dimensional world, a true understanding of the technique in the conventional cognitive/visual sense of the term, is beyond our grasp.
Conceptual OverviewData InputThe size of the matrices : There is no requirement in canonical analysis that there must be the same number of variables (columns) in each matrix, though there must be the same number of areas (rows). (There must of course be more than one variable in each set otherwise we would be dealing with multiple regression analysis)The order of the matrices :  Neither set of data is given priority in the analysis so it does not matter which we term the criteria and which the predictors. Unlike simple linear regression there is no concept of a 'dependent' set or an 'independent' set. But in practice the smaller set is always taken second as this simplifies the calculation enormously
AdvantagesUseful and powerful technique for exploring the relationships among multiple dependent and independent variables. Results obtained from a canonical analysis should suggest answers to questions concerning the number of ways in which the two sets of multiple variables are related, the strengths of the relationships.Multiple regressions are used for many-to-one relationships, canonical correlation is used for many-to-many relationships.                                       Canonical Correlation- More than one such linear correlation                                                                               relating the two sets of variables, with each                                                                             such correlation representing a different                                                                             dimension by which the independent set of                                                                             variables is related to the dependent set.
Interpretability:Although mathematically elegant, canonical  solutions are often un-           interpretable.  Furthermore, the rotation of canonical variates to           improve  interpretability is not a common practice in research, even            though it is commonplace to do this for factor analysis and principle            components analysis.Linear relationship:Another problem using canonical correlation for research is that           the algorithm used emphasizes the linear relationship between            two sets of variables. If the relationship between variables is not           linear, then using a canonical correlation for the analysis may           miss some or most of the relationship between variables.
The Canonical ProblemLatent Roots and weights Canonical ScoresResults and InterpretationLatent RootsCanonical Weights Canonical Scores
Mathematical ModelThe partitioned intercorrelation matrixwhere 	R11 is the matrix of intercorrelations among the p criteria variables	R22 is the matrix of intercorrelations among the q predictor variables	R12 is the matrix of intercorrelations of the p criteria with the q predictors	R21 is the transpose of R12
The Canonical EquationThe product matrix
The canonical rootsThe significance of the roots:Wilk’s Lambda (ᴧ) :  Bartlett’s chi squared:
The canonical vectors Weights B for the predictor variables are given by : 	Weights A for the criteria variables are given by :
The canonical scores	The scores Sa for the criteria are given by  Sa = Zp A    The scores Sb for the predictors are given by Sb = Zq B where Zp and Zq are the standardized raw data
Canonical correlation analysis-promotion bias scoring detector(a case study of American university of Nigeria(AUN))Researchers-A. O. Unegbu &James J. Adefila`
IntroductionProblem: AUN bids to keep with her value statement 		i.e. highest standards of integrity, 				transparency and academic honest.Solution: Appraise & select Faculties for promotion   		based on various promotion committees’ 			scores.Issues      : Dwindling funding, 			need for a bias free selection technique,
Research HypothesesH01 : CCA cannot detect bias scoring for any of the       	  	  candidates from any  of the named 			  	  committees with 90% confidence level.
H02: CCA cannot detect significantly whether or 		  not score-weights of each of the Promotion 	 	  Assessors have over bearing influence on the 	 	  promotability of candidates.
H03: CCA cannot at 90% level of certainity 	 	  	  discriminate between candidates that have 	 	  earned promotion scores and those that could not 	  from various promotion committees of the 	 	  university.Research objectives To test the efficacy of Canonical Correlation Analysis as a relevant statistical tool for adaption  in bias free promotion score processing and promotion bias scoring detector so as to ensure fairness, integrity, transparency and academic honest in analysis of applicants’ score and in reaching Faculties’ promotion decision.
Steps of the ResearchData collectionManual computationsSPSS analysisTest the Hypothesis
AUN promotion procedure Weights:The benchmark for promotion is securing a weighted average  score  should be  more than 65%age.
Each of the Committee’s point allocation will be based on the below criteria
Supporting documents for Teaching Effectiveness Peer evaluation  Student evaluation Course Syllabi Record of participation in teaching seminars, workshops, etcContributions  to  the  development  of  new  academic  programsFaculty awards for excellence in teaching
Scholarship, Research and Creative WorksTerminal degrees/Professional qualificationsAt  least  Five  publications,  three  of  which shall be journal articlesComputer  Software and Program  developmentCreative  work  in  the  areas  of  advertising, public  relations,  layout  design,  photography  and graphics, visual arts etc.
Service to the University, Profession and                  CommunityMembership/leadership     in     departmental,  school-wide  or university-wide committeesPlanning  or  participation  in  workshops, conferences, seminars .Evidence  of  participation  in  mentoring  or career counseling of students.Membership in Civil Society organizationsEvidence  of  service  as  external  assessor  or      external examiner on examination committees
Raw Scores of Candidates
Processed scores of the Candidates
Scores of Promotable and Non-promotable Candidates
Data InputThe data input view containing the three groups of assessors and individual assessors
SPSS Results Analyze ⇒General Linear Model⇒MultivariateSPSS classified candidates into two groups of promotable and non promotable of 5 and 9 respectively.The result leads to the rejection of Null hypothesis Ho3 which states that Canonical Correlation Analysis cannot with 90% confidence level discriminate between promotable and non promotable candidates
Cannonical correlation
Multivariate TestThe Multivariate tests indicate the effect of scores of the group and individual assessors both on status determination and bias impact on such status. The figure shows that the computed values and critical table values differences are very insignificant.
Candidate’s status determination resulting from scores across the assessors and those that might result from bias scoring are very insignificant(Wilk’s lambda value =0.041)
There is no between-status differences in the scores between assessors of both group and individuals
Rejection of Null hypothesis (Ho1) which states that Canonical Correlation Analysis cannot detect bias
The results of the table show that the scores of each assessor had a significant effect on the determination of each Candidate Status as the significance is 0.135.
Test for homogeneity of varianceOverbearing score weight influence test hypothesis is aimed at detecting across the individual assessors’ mark allocations and weights assigned to each.In this test, the assessors having low significance value mean that there is homogeneity of variance.
Cannonical correlation
This Leads to rejection of null hypothesis (Ho2) which states that Canonical Correlation Analysis cannot detect significantly whether or not score-weights of each of the promotion assessors has overbearing influence on the promotability of candidates.
Shortcomings and limitations of the process Procedures that maximize correlation between canonical variate pairs do not   necessarily lead to solutions that make logical sense. it is the canonical variates  that are actually being  interpreted and they are interpreted in pairs. a variate is interpreted by considering the pattern of variables that are highly correlated (loaded) with it. variables in one set of the solution can be very sensitive to the identity of the variables in the other set.

More Related Content

PPTX
Logistic regression with SPSS
PPTX
Path analysis
PPTX
Correlation & Regression Analysis using SPSS
ODP
Multiple Linear Regression II and ANOVA I
DOCX
Binary Logistic Regression
PPTX
What is a partial correlation?
PDF
Correlation and Simple Regression
Logistic regression with SPSS
Path analysis
Correlation & Regression Analysis using SPSS
Multiple Linear Regression II and ANOVA I
Binary Logistic Regression
What is a partial correlation?
Correlation and Simple Regression

What's hot (20)

PPTX
Canonical analysis
PPTX
Regression analysis on SPSS
PDF
Ordinal logistic regression
PDF
Logistic Regression Analysis
PPTX
Spearman's Rank order Correlation
PPTX
Logistic regression
PPTX
ANCOVA-Analysis-of-Covariance.pptx
PPTX
Factor Analysis in Research
PDF
Introduction to Generalized Linear Models
PPTX
Regression analysis
PPTX
Factor analysis
PDF
Ordinal Logistic Regression
PPT
Confidence Intervals
PPT
Regression analysis
PPTX
Correlation and regression
PPT
Logistic regression
PPTX
Cluster analysis
PDF
Multivariate Analysis
PPT
Factor analysis
PDF
7. logistics regression using spss
Canonical analysis
Regression analysis on SPSS
Ordinal logistic regression
Logistic Regression Analysis
Spearman's Rank order Correlation
Logistic regression
ANCOVA-Analysis-of-Covariance.pptx
Factor Analysis in Research
Introduction to Generalized Linear Models
Regression analysis
Factor analysis
Ordinal Logistic Regression
Confidence Intervals
Regression analysis
Correlation and regression
Logistic regression
Cluster analysis
Multivariate Analysis
Factor analysis
7. logistics regression using spss
Ad

Viewers also liked (9)

PDF
Canonical Correlation Analysis
PDF
Correspondentie Analyse
PPT
Correspondence Analysis
PPTX
Exploratory factor analysis
PPT
Research Methology -Factor Analyses
ODP
Exploratory factor analysis
PPTX
Factor analysis (fa)
Canonical Correlation Analysis
Correspondentie Analyse
Correspondence Analysis
Exploratory factor analysis
Research Methology -Factor Analyses
Exploratory factor analysis
Factor analysis (fa)
Ad

Similar to Cannonical correlation (20)

PDF
cannonicalpresentation-110505114327-phpapp01.pdf
PDF
Canonical correlation
PPTX
Canonical Correlation in SPSS - Merging Multiple Variables for Deeper Insights
PPTX
An-Introduction-to-Correlation-and-Linear-Regression FYBSc(IT) SNK.pptx
PPTX
Canonical correlation analysis()
PPT
Correlational research
PPT
Sumit presentation
PPTX
Multivariate Variate Techniques
PPTX
Module 4- Correlation & Regression Analysis.pptx
DOCX
Data Analytics Notes
PPTX
Statistical testing.pptxstatisctics bachelors
PPTX
Statistical testing.pptxhsnskemenemkwkwmwjnw
PDF
Deepak_DAI101_Data_Anal_lecture6 (1).pdf
PPTX
Data processing
PPTX
Regression analysis
PPT
Canonical Correlation Analysis
PDF
Aronchpt3correlation
PPTX
Multivariate Analysis Degree of association between two variable - Test of Ho...
PPT
Econometrics
cannonicalpresentation-110505114327-phpapp01.pdf
Canonical correlation
Canonical Correlation in SPSS - Merging Multiple Variables for Deeper Insights
An-Introduction-to-Correlation-and-Linear-Regression FYBSc(IT) SNK.pptx
Canonical correlation analysis()
Correlational research
Sumit presentation
Multivariate Variate Techniques
Module 4- Correlation & Regression Analysis.pptx
Data Analytics Notes
Statistical testing.pptxstatisctics bachelors
Statistical testing.pptxhsnskemenemkwkwmwjnw
Deepak_DAI101_Data_Anal_lecture6 (1).pdf
Data processing
Regression analysis
Canonical Correlation Analysis
Aronchpt3correlation
Multivariate Analysis Degree of association between two variable - Test of Ho...
Econometrics

More from domsr (20)

PDF
performance of banks india
PPTX
Performance of banks in india 2011
PPT
Uzbekistan Analysis
PPT
Switzerland Analysis
PPTX
Main
PDF
Brazil country analysis report [team 2]
PPT
Time value of money
PPTX
Supply
PPT
Structure
DOC
Report
PPT
Pricecontrol
PDF
Perfectcompetition
PPT
National income
PPT
National income & related concepts
PPT
Price Control
PPT
Gdp+deflator+vs+cpi
PPT
Foreign exchange
PPT
Economics intro
PPTX
Demand analysis
PPT
Comp mono
performance of banks india
Performance of banks in india 2011
Uzbekistan Analysis
Switzerland Analysis
Main
Brazil country analysis report [team 2]
Time value of money
Supply
Structure
Report
Pricecontrol
Perfectcompetition
National income
National income & related concepts
Price Control
Gdp+deflator+vs+cpi
Foreign exchange
Economics intro
Demand analysis
Comp mono

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
cuic standard and advanced reporting.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation theory and applications.pdf
PPT
Teaching material agriculture food technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
MYSQL Presentation for SQL database connectivity
Electronic commerce courselecture one. Pdf
Machine learning based COVID-19 study performance prediction
Reach Out and Touch Someone: Haptics and Empathic Computing
Encapsulation_ Review paper, used for researhc scholars
Network Security Unit 5.pdf for BCA BBA.
Per capita expenditure prediction using model stacking based on satellite ima...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
cuic standard and advanced reporting.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Digital-Transformation-Roadmap-for-Companies.pptx
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Spectral efficient network and resource selection model in 5G networks
Encapsulation theory and applications.pdf
Teaching material agriculture food technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
MYSQL Presentation for SQL database connectivity

Cannonical correlation

  • 2. Introduction If we have two sets of variables, x1,...., xn and y1,….., ym, and there are correlations among the variables, then canonical correlation analysis will enable us to find linear combinations of the x's and the y's which have maximum correlation with each other.Canonical correlation begin with the observed values of two sets of variables relating to the same set of areas, and a theory or hypothesis that suggests that the two are interrelated.The overriding concern is with the structural relationship between the two sets of data as a whole, rather than the associations between individual variables
  • 3. Canonical correlation is the most general form of correlation.Multiple regression analysis is a more specific case in which one of the sets of data contains only one variable, while product moment correlation is the most specific case in that both sets of data contain only one variable.Canonical correlation analysis is not related to factor/principal components analysis despite certain conceptual and terminological similarities. Canonical correlation analysis is used to investigate the inter-correlation between two sets of variables, whereas factor/principal components analysis identifies the patterns of relationship within one set of data.
  • 4. Difficulties in Canonical CorrelationCanonical correlation is not the easiest of techniques to follow, though the problems of comprehension are conceptual rather than mathematical.Unlike multiple regression and principal components analysis, we cannot provide a graphic device to illustrate even the simplest form. For with canonical correlation analysis we are dealing with two sets of data. Even the most elementary example must, therefore, have at least two variables on each side and so we require 2 + 2 = 4 dimensions. Tied as we are, however, to a three dimensional world, a true understanding of the technique in the conventional cognitive/visual sense of the term, is beyond our grasp.
  • 5. Conceptual OverviewData InputThe size of the matrices : There is no requirement in canonical analysis that there must be the same number of variables (columns) in each matrix, though there must be the same number of areas (rows). (There must of course be more than one variable in each set otherwise we would be dealing with multiple regression analysis)The order of the matrices : Neither set of data is given priority in the analysis so it does not matter which we term the criteria and which the predictors. Unlike simple linear regression there is no concept of a 'dependent' set or an 'independent' set. But in practice the smaller set is always taken second as this simplifies the calculation enormously
  • 6. AdvantagesUseful and powerful technique for exploring the relationships among multiple dependent and independent variables. Results obtained from a canonical analysis should suggest answers to questions concerning the number of ways in which the two sets of multiple variables are related, the strengths of the relationships.Multiple regressions are used for many-to-one relationships, canonical correlation is used for many-to-many relationships. Canonical Correlation- More than one such linear correlation relating the two sets of variables, with each such correlation representing a different dimension by which the independent set of variables is related to the dependent set.
  • 7. Interpretability:Although mathematically elegant, canonical solutions are often un- interpretable. Furthermore, the rotation of canonical variates to improve interpretability is not a common practice in research, even though it is commonplace to do this for factor analysis and principle components analysis.Linear relationship:Another problem using canonical correlation for research is that the algorithm used emphasizes the linear relationship between two sets of variables. If the relationship between variables is not linear, then using a canonical correlation for the analysis may miss some or most of the relationship between variables.
  • 8. The Canonical ProblemLatent Roots and weights Canonical ScoresResults and InterpretationLatent RootsCanonical Weights Canonical Scores
  • 9. Mathematical ModelThe partitioned intercorrelation matrixwhere R11 is the matrix of intercorrelations among the p criteria variables R22 is the matrix of intercorrelations among the q predictor variables R12 is the matrix of intercorrelations of the p criteria with the q predictors R21 is the transpose of R12
  • 10. The Canonical EquationThe product matrix
  • 11. The canonical rootsThe significance of the roots:Wilk’s Lambda (ᴧ) : Bartlett’s chi squared:
  • 12. The canonical vectors Weights B for the predictor variables are given by : Weights A for the criteria variables are given by :
  • 13. The canonical scores The scores Sa for the criteria are given by Sa = Zp A The scores Sb for the predictors are given by Sb = Zq B where Zp and Zq are the standardized raw data
  • 14. Canonical correlation analysis-promotion bias scoring detector(a case study of American university of Nigeria(AUN))Researchers-A. O. Unegbu &James J. Adefila`
  • 15. IntroductionProblem: AUN bids to keep with her value statement i.e. highest standards of integrity, transparency and academic honest.Solution: Appraise & select Faculties for promotion based on various promotion committees’ scores.Issues : Dwindling funding, need for a bias free selection technique,
  • 16. Research HypothesesH01 : CCA cannot detect bias scoring for any of the candidates from any of the named committees with 90% confidence level.
  • 17. H02: CCA cannot detect significantly whether or not score-weights of each of the Promotion Assessors have over bearing influence on the promotability of candidates.
  • 18. H03: CCA cannot at 90% level of certainity discriminate between candidates that have earned promotion scores and those that could not from various promotion committees of the university.Research objectives To test the efficacy of Canonical Correlation Analysis as a relevant statistical tool for adaption in bias free promotion score processing and promotion bias scoring detector so as to ensure fairness, integrity, transparency and academic honest in analysis of applicants’ score and in reaching Faculties’ promotion decision.
  • 19. Steps of the ResearchData collectionManual computationsSPSS analysisTest the Hypothesis
  • 20. AUN promotion procedure Weights:The benchmark for promotion is securing a weighted average score should be more than 65%age.
  • 21. Each of the Committee’s point allocation will be based on the below criteria
  • 22. Supporting documents for Teaching Effectiveness Peer evaluation Student evaluation Course Syllabi Record of participation in teaching seminars, workshops, etcContributions to the development of new academic programsFaculty awards for excellence in teaching
  • 23. Scholarship, Research and Creative WorksTerminal degrees/Professional qualificationsAt least Five publications, three of which shall be journal articlesComputer Software and Program developmentCreative work in the areas of advertising, public relations, layout design, photography and graphics, visual arts etc.
  • 24. Service to the University, Profession and CommunityMembership/leadership in departmental, school-wide or university-wide committeesPlanning or participation in workshops, conferences, seminars .Evidence of participation in mentoring or career counseling of students.Membership in Civil Society organizationsEvidence of service as external assessor or external examiner on examination committees
  • 25. Raw Scores of Candidates
  • 26. Processed scores of the Candidates
  • 27. Scores of Promotable and Non-promotable Candidates
  • 28. Data InputThe data input view containing the three groups of assessors and individual assessors
  • 29. SPSS Results Analyze ⇒General Linear Model⇒MultivariateSPSS classified candidates into two groups of promotable and non promotable of 5 and 9 respectively.The result leads to the rejection of Null hypothesis Ho3 which states that Canonical Correlation Analysis cannot with 90% confidence level discriminate between promotable and non promotable candidates
  • 31. Multivariate TestThe Multivariate tests indicate the effect of scores of the group and individual assessors both on status determination and bias impact on such status. The figure shows that the computed values and critical table values differences are very insignificant.
  • 32. Candidate’s status determination resulting from scores across the assessors and those that might result from bias scoring are very insignificant(Wilk’s lambda value =0.041)
  • 33. There is no between-status differences in the scores between assessors of both group and individuals
  • 34. Rejection of Null hypothesis (Ho1) which states that Canonical Correlation Analysis cannot detect bias
  • 35. The results of the table show that the scores of each assessor had a significant effect on the determination of each Candidate Status as the significance is 0.135.
  • 36. Test for homogeneity of varianceOverbearing score weight influence test hypothesis is aimed at detecting across the individual assessors’ mark allocations and weights assigned to each.In this test, the assessors having low significance value mean that there is homogeneity of variance.
  • 38. This Leads to rejection of null hypothesis (Ho2) which states that Canonical Correlation Analysis cannot detect significantly whether or not score-weights of each of the promotion assessors has overbearing influence on the promotability of candidates.
  • 39. Shortcomings and limitations of the process Procedures that maximize correlation between canonical variate pairs do not necessarily lead to solutions that make logical sense. it is the canonical variates that are actually being interpreted and they are interpreted in pairs. a variate is interpreted by considering the pattern of variables that are highly correlated (loaded) with it. variables in one set of the solution can be very sensitive to the identity of the variables in the other set.
  • 40. The pairings of canonical variates must be independent of all other pairs.Conclusion from research analysis:From Table it can be seen that the order of promotable rankings but application of Canonical Correlation Analysis results produced different ranking of candidates.Rejection of Null Hypothesis(H03):The results as shown in tables indicate the Canonical Correlation Analysis status discriminatory ability of grouping Candidates into promotable and Non-promotable status. The result leads to the rejection of Null hypothesis Ho3 which states that Canonical Correlation Analysis cannot with 90% confidence level discriminate between promotable and nonpromotable candidates based on their earned scores.
  • 41. Continued………….Rejection of Null Hypothesis(Ho1):Pillar’s trace of 0.041, Wilk’s Lambda of 0.041, Hotelling’s trace of 0.041 and Roy’s Largest Root of 0.041 - all of them showed that p<0.05, it means that there is no between-status differences in the scores between assessors of both group and individuals, thereby leading to the rejection of Null hypothesis (Ho1) which states that Canonical Correlation Analysis cannot detect bias.Rejection of Null Hypothesis(Ho2):For Group Assessors - Internal Assessors with p=0.096, External Academic Assessors with p=0.526 and The President’s Assessment with p=0.0001, shows that except that of the President, the weight assigned to scores of other two are group assessors are insignificant- lead us to reject the Null hypothesis (Ho2) which states that Canonical Correlation Analysis cannot detect significantly whether or not score-weights of each of the promotion assessors has overbearing influence on the promotability of candidates.

Editor's Notes

  • #3: In employment example the area was different zones, and in another example the area were particular people ( 3 psychological variables , 4 academic variables and 1 gender variable and area were 600 students )