SlideShare a Scribd company logo
Statistical analysis
Mean, standard deviation, reliability,
correlation, and regression
Data entry in SPSS
• SPSS Statistics is a software package used for logical
batched and non-batched statistical analysis.
• The data entry in SPSS is crucial for smoother
analysis.
• Refer to this link for the entry of data in SPSS
https://www.youtube.com/watch?v=BvwNPRy6HJU
Descriptive analysis
• Mean : The mean is the average of all numbers
and is sometimes called the arithmetic mean.
• Standard deviation : a quantity expressing by
how much the members of a group differ from
the mean value for the group.
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Total value
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
• Divide the outcome of mean and standard deviation
by the number of items for each scale.
• Like: If value given in table for TC is 12.47 and
number of items is 5, then 12.47/5 = 2.494
Interpretation:
This indicate that respondents considered their training
as less helpful as the mean estimate is low on the scale
of 7 point Likert scale.
Reliability test
• Reliability -A test is considered reliable if we get the same result
repeatedly.
• Cronbach’s alpha, α (or coefficient alpha), developed by Lee
Cronbach in 1951, measures reliability, or internal consistency.
“Reliability” is how well a test measures what it should. For example,
a company might give a job satisfaction survey to their employees.
High reliability means it measures job satisfaction, while low
reliability means it measures something else (or possibly nothing at
all).
• Cronbach’s alpha tests to see if multiple-question Likert scale surveys
are reliable. These questions measure latent variables — hidden or
unobservable variables like: a person’s conscientiousness, neurosis or
openness. These are very difficult to measure in real life. Cronbach’s
alpha will tell you if the test you have designed is accurately
measuring the variable of interest.
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Interpretation
• Reliability above 0.70 is acceptable level to
indicate that the scale used to collect data
provides consistent results and thus is reliable
for further analysis
Correlation
• Correlation analysis is a method of statistical
evaluation used to study the strength of a relationship
between two, numerically measured, continuous
variables (e.g. height and weight).
• Pearson’s product-moment coefficient is the
measurement of correlation and ranges (depending on
the correlation) between +1 and -1. +1 indicates the
strongest positive correlation possible, and -1
indicates the strongest negative correlation possible.
• Put all the total values in SPSS
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Correlation
estimates
P values
• As correlation estimates between each
variable is positive and significant (p value
<0.001), this indicates all the variables are
related to each other.
• This gives basis for further regression analysis
to understand the causal relationship between
variables.
Regression analysis
• Regression analysis is used to model the
relationship between a response variable and
one or more predictor variables.
• Eg:
IV DV
Steps
1. Standardize the variable data
In statistics, standardized coefficients or beta
coefficients are the estimates resulting from
a regression analysis that have been
standardized so that the variances of
dependent and independent variables are 1.
Statistical analysis in SPSS_
Statistical analysis in SPSS_
Standardized
estimates
CONTD…
2. Put the data in regression model
IV: Independent variable = TC
M: Mediator = SE
DV: Dependent variable = CAA
Mediation analysis
• First enter IV in independent variable and
mediator as dependent variable
• Then, put DV as dependent variable and
IV as independent variable.
• Then click on ‘next’
• And put mediator as independent variable
TC
SE
CAA
H1
H2
H3
HYPOTHESISED MODEL
Statistical analysis in SPSS_
MEDIATOR
INDEPEDENT
VARIABLE
Beta
coefficient
P value
R square
estimate
Dependent
variable
Independen
variable
Statistical analysis in SPSS_
Mediator
Dependent
variable
R square
estimate
P value
Beta
estimate
R square estimate
• R-squared is a statistical measure of how close the
data are to the fitted regression line. It is also known
as the coefficient of determination, or the coefficient
of multiple determination for multiple regression.
• R-squared is always between 0 and 100%:
• 0% indicates that the model explains none of the
variability of the response data around its mean.
• 100% indicates that the model explains all the
variability of the response data around its mean.
• In general, the higher the R-squared, the better the
model fits your data.
P value
• The p-value for each term tests the null hypothesis
that the coefficient is equal to zero (no effect). A low
p-value (< 0.05) indicates that you can reject the null
hypothesis. In other words, a predictor that has a low
p-value is likely to be a meaningful addition to your
model because changes in the predictor's value are
related to changes in the response variable.
• Conversely, a larger (insignificant) p-value suggests
that changes in the predictor are not associated with
changes in the response.
Beta coefficient
A standardized beta coefficient compares the strength of the effect of
each individual independent variable to the dependent variable. The
higher the absolute value of the beta coefficient, the stronger the
effect.
For example, a beta of -.9 has a stronger effect than a beta of +.8.
Standardized beta coefficients have standard deviations as their units.
This means the variables can be easily compared to each other. In
other words, standardized beta coefficients are the coefficients that
you would get if the variables in the regression were all converted
to z-scores before running the analysis.
Interpretation
• As R square changed from 0.250 to 0.299, this
indicate addition of mediator in equation
contributes towards relationship between IV
and DV.
• As P value is below 0.05, this indicate chances
of Type I and type II error is less than 5%.
Contd..
• Effect of TC on SE = 0.526, p < 0.05
• Effect of SE on CAA = 0.261, P< 0.05
• Effect of TC on CAA = 0.362, P< 0.05
• As all the relationship is significant, this indicate the
SE has a significant mediating role between TC and
CAA.
TC
SE
CAA
Β = 0.362***
Β = 0.526***
Β = 0.261***
• Note that a mediational model is a causal
model.
• For example, the mediator is presumed to
cause the outcome and not vice versa. If the
presumed causal model is not correct, the
results from the mediational analysis are likely
of little value.
• Mediation is not defined statistically; rather
statistics can be used to evaluate a presumed
mediational model.
Baron and Kenny mediation steps
• The above steps of mediation is based on the four step mediation
analysis test proposed by Baron and Kenny (1986), Judd and Kenny
(1981), and James and Brett (1984).
• Thus, to indicate mediation four steps are to be analyzed-
Step 1: Show that the causal variable is correlated with the
outcome. Use Y as the criterion variable in a regression equation and X
as a predictor (estimate and test path c in the above figure). This step
establishes that there is an effect that may be mediated.
Step 2: Show that the causal variable is correlated with the
mediator. Use M as the criterion variable in the regression equation and
X as a predictor (estimate and test path a). This step essentially involves
treating the mediator as if it were an outcome variable.
Step 3: Show that the mediator affects the outcome variable. Use
Y as the criterion variable in a regression equation and X and M
as predictors (estimate and test path b). It is not sufficient just to
correlate the mediator with the outcome because the mediator and
the outcome may be correlated because they are both caused by
the causal variable X. Thus, the causal variable must be
controlled in establishing the effect of the mediator on the
outcome.
Step 4: To establish that M completely mediates the X-Y
relationship, the effect of X on Y controlling for M (path c')
should be zero (see discussion below on significance
testing). The effects in both Steps 3 and 4 are estimated in the
same equation.
Final mediation decision
• If all four of these steps are met, then the data are
consistent with the hypothesis that variable
M completely mediates the X-Y relationship, and if
the first three steps are met but the Step 4 is not,
then partial mediation is indicated. Meeting these
steps does not, however, conclusively establish that
mediation has occurred because there are other
(perhaps less plausible) models that are consistent
with the data. Some of these models are considered
later in the Specification Error section.
Statistical analysis in SPSS_

More Related Content

ODP
ANOVA II
PPTX
One Way ANOVA and Two Way ANOVA using R
PPT
Estimation and hypothesis testing 1 (graduate statistics2)
PPT
Factor analysis
PDF
Binary OR Binomial logistic regression
PPT
Factor analysis
PPTX
Statistical analysis and interpretation
PPTX
Inferential statistics powerpoint
ANOVA II
One Way ANOVA and Two Way ANOVA using R
Estimation and hypothesis testing 1 (graduate statistics2)
Factor analysis
Binary OR Binomial logistic regression
Factor analysis
Statistical analysis and interpretation
Inferential statistics powerpoint

What's hot (20)

PDF
Ordinal logistic regression
PDF
Ordinal Logistic Regression
PPTX
Statistical inference concept, procedure of hypothesis testing
PPTX
5 numerical descriptive statitics
PPTX
Factor Analysis
PPTX
PROCEDURE FOR TESTING HYPOTHESIS
PDF
Logistic Ordinal Regression
PPTX
05 confidence interval & probability statements
PPT
Quantitative analysis
PPTX
Factor analysis
PDF
Difference-in-Difference Methods
PPTX
Hypothesis testing
PDF
Confirmatory Factor Analysis
PPTX
Analysis of variance (ANOVA)
PPT
Discriminant analysis
PPT
Simple Linier Regression
PPTX
Scatter plot- Complete
PDF
Survival analysis 1
PPT
Data Analysis with SPSS : One-way ANOVA
PPT
Chapter 15
Ordinal logistic regression
Ordinal Logistic Regression
Statistical inference concept, procedure of hypothesis testing
5 numerical descriptive statitics
Factor Analysis
PROCEDURE FOR TESTING HYPOTHESIS
Logistic Ordinal Regression
05 confidence interval & probability statements
Quantitative analysis
Factor analysis
Difference-in-Difference Methods
Hypothesis testing
Confirmatory Factor Analysis
Analysis of variance (ANOVA)
Discriminant analysis
Simple Linier Regression
Scatter plot- Complete
Survival analysis 1
Data Analysis with SPSS : One-way ANOVA
Chapter 15
Ad

Similar to Statistical analysis in SPSS_ (20)

PPTX
Regression &amp; correlation coefficient
PPT
Correlation Research_Arslan Sheikh_PhD Scholar
PDF
hành-trangFFFFFFFFFFFFFFFFFFFFFFD-BA.pdf
PPTX
Measure of Association
PDF
Kendall's ,partial correlation and scatter plot
PDF
Dr. A Sumathi - LINEARITY CONCEPT OF SIGNIFICANCE.pdf
PDF
Section 5 - Improve Phase pdf Lean Six sigma
PPTX
Chapter no. 05.pptx statistic analysis for masnagers
PPTX
Correlation and Regression
PPTX
6 the six uContinuous data analysis.pptx
PPTX
correlation Types in statistical Education
PPTX
Hm306 week 5
PPTX
Hm306 week 5
PPTX
PPTX
s.analysis
PPTX
Correlation research
PPTX
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
PPTX
Pearson's correlation coefficient 
PPTX
Regression presentation
PPT
statistics in nursing
Regression &amp; correlation coefficient
Correlation Research_Arslan Sheikh_PhD Scholar
hành-trangFFFFFFFFFFFFFFFFFFFFFFD-BA.pdf
Measure of Association
Kendall's ,partial correlation and scatter plot
Dr. A Sumathi - LINEARITY CONCEPT OF SIGNIFICANCE.pdf
Section 5 - Improve Phase pdf Lean Six sigma
Chapter no. 05.pptx statistic analysis for masnagers
Correlation and Regression
6 the six uContinuous data analysis.pptx
correlation Types in statistical Education
Hm306 week 5
Hm306 week 5
s.analysis
Correlation research
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
Pearson's correlation coefficient 
Regression presentation
statistics in nursing
Ad

More from Dr. Anugamini Priya (11)

PPTX
Introduction to Industrial Relations
PPTX
Data collection
PPTX
Questionnaire development
PPT
Types of Business organisation
PPTX
Organisational culture and change management
PPT
EXTRA ROLE BEHAVIOR
PPT
Japan culture business environment
PPT
MOA AND AOA
PPTX
Organizational justice
PPT
need for teachers - learning attitude and internal motivation
PPT
Arcs strategies
Introduction to Industrial Relations
Data collection
Questionnaire development
Types of Business organisation
Organisational culture and change management
EXTRA ROLE BEHAVIOR
Japan culture business environment
MOA AND AOA
Organizational justice
need for teachers - learning attitude and internal motivation
Arcs strategies

Recently uploaded (20)

PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
medical staffing services at VALiNTRY
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Essential Infomation Tech presentation.pptx
PDF
top salesforce developer skills in 2025.pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
System and Network Administraation Chapter 3
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
ai tools demonstartion for schools and inter college
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
medical staffing services at VALiNTRY
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
How to Migrate SBCGlobal Email to Yahoo Easily
Upgrade and Innovation Strategies for SAP ERP Customers
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Design an Analysis of Algorithms I-SECS-1021-03
Essential Infomation Tech presentation.pptx
top salesforce developer skills in 2025.pdf
Design an Analysis of Algorithms II-SECS-1021-03
Which alternative to Crystal Reports is best for small or large businesses.pdf
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
System and Network Administraation Chapter 3
Understanding Forklifts - TECH EHS Solution
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
wealthsignaloriginal-com-DS-text-... (1).pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
ai tools demonstartion for schools and inter college

Statistical analysis in SPSS_

  • 1. Statistical analysis Mean, standard deviation, reliability, correlation, and regression
  • 2. Data entry in SPSS • SPSS Statistics is a software package used for logical batched and non-batched statistical analysis. • The data entry in SPSS is crucial for smoother analysis. • Refer to this link for the entry of data in SPSS https://www.youtube.com/watch?v=BvwNPRy6HJU
  • 3. Descriptive analysis • Mean : The mean is the average of all numbers and is sometimes called the arithmetic mean. • Standard deviation : a quantity expressing by how much the members of a group differ from the mean value for the group.
  • 14. • Divide the outcome of mean and standard deviation by the number of items for each scale. • Like: If value given in table for TC is 12.47 and number of items is 5, then 12.47/5 = 2.494 Interpretation: This indicate that respondents considered their training as less helpful as the mean estimate is low on the scale of 7 point Likert scale.
  • 15. Reliability test • Reliability -A test is considered reliable if we get the same result repeatedly. • Cronbach’s alpha, α (or coefficient alpha), developed by Lee Cronbach in 1951, measures reliability, or internal consistency. “Reliability” is how well a test measures what it should. For example, a company might give a job satisfaction survey to their employees. High reliability means it measures job satisfaction, while low reliability means it measures something else (or possibly nothing at all). • Cronbach’s alpha tests to see if multiple-question Likert scale surveys are reliable. These questions measure latent variables — hidden or unobservable variables like: a person’s conscientiousness, neurosis or openness. These are very difficult to measure in real life. Cronbach’s alpha will tell you if the test you have designed is accurately measuring the variable of interest.
  • 21. Interpretation • Reliability above 0.70 is acceptable level to indicate that the scale used to collect data provides consistent results and thus is reliable for further analysis
  • 22. Correlation • Correlation analysis is a method of statistical evaluation used to study the strength of a relationship between two, numerically measured, continuous variables (e.g. height and weight). • Pearson’s product-moment coefficient is the measurement of correlation and ranges (depending on the correlation) between +1 and -1. +1 indicates the strongest positive correlation possible, and -1 indicates the strongest negative correlation possible. • Put all the total values in SPSS
  • 27. • As correlation estimates between each variable is positive and significant (p value <0.001), this indicates all the variables are related to each other. • This gives basis for further regression analysis to understand the causal relationship between variables.
  • 28. Regression analysis • Regression analysis is used to model the relationship between a response variable and one or more predictor variables. • Eg: IV DV
  • 29. Steps 1. Standardize the variable data In statistics, standardized coefficients or beta coefficients are the estimates resulting from a regression analysis that have been standardized so that the variances of dependent and independent variables are 1.
  • 33. CONTD… 2. Put the data in regression model IV: Independent variable = TC M: Mediator = SE DV: Dependent variable = CAA
  • 34. Mediation analysis • First enter IV in independent variable and mediator as dependent variable • Then, put DV as dependent variable and IV as independent variable. • Then click on ‘next’ • And put mediator as independent variable
  • 43. R square estimate • R-squared is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination, or the coefficient of multiple determination for multiple regression. • R-squared is always between 0 and 100%: • 0% indicates that the model explains none of the variability of the response data around its mean. • 100% indicates that the model explains all the variability of the response data around its mean. • In general, the higher the R-squared, the better the model fits your data.
  • 44. P value • The p-value for each term tests the null hypothesis that the coefficient is equal to zero (no effect). A low p-value (< 0.05) indicates that you can reject the null hypothesis. In other words, a predictor that has a low p-value is likely to be a meaningful addition to your model because changes in the predictor's value are related to changes in the response variable. • Conversely, a larger (insignificant) p-value suggests that changes in the predictor are not associated with changes in the response.
  • 45. Beta coefficient A standardized beta coefficient compares the strength of the effect of each individual independent variable to the dependent variable. The higher the absolute value of the beta coefficient, the stronger the effect. For example, a beta of -.9 has a stronger effect than a beta of +.8. Standardized beta coefficients have standard deviations as their units. This means the variables can be easily compared to each other. In other words, standardized beta coefficients are the coefficients that you would get if the variables in the regression were all converted to z-scores before running the analysis.
  • 46. Interpretation • As R square changed from 0.250 to 0.299, this indicate addition of mediator in equation contributes towards relationship between IV and DV. • As P value is below 0.05, this indicate chances of Type I and type II error is less than 5%.
  • 47. Contd.. • Effect of TC on SE = 0.526, p < 0.05 • Effect of SE on CAA = 0.261, P< 0.05 • Effect of TC on CAA = 0.362, P< 0.05 • As all the relationship is significant, this indicate the SE has a significant mediating role between TC and CAA. TC SE CAA Β = 0.362*** Β = 0.526*** Β = 0.261***
  • 48. • Note that a mediational model is a causal model. • For example, the mediator is presumed to cause the outcome and not vice versa. If the presumed causal model is not correct, the results from the mediational analysis are likely of little value. • Mediation is not defined statistically; rather statistics can be used to evaluate a presumed mediational model.
  • 49. Baron and Kenny mediation steps • The above steps of mediation is based on the four step mediation analysis test proposed by Baron and Kenny (1986), Judd and Kenny (1981), and James and Brett (1984). • Thus, to indicate mediation four steps are to be analyzed- Step 1: Show that the causal variable is correlated with the outcome. Use Y as the criterion variable in a regression equation and X as a predictor (estimate and test path c in the above figure). This step establishes that there is an effect that may be mediated. Step 2: Show that the causal variable is correlated with the mediator. Use M as the criterion variable in the regression equation and X as a predictor (estimate and test path a). This step essentially involves treating the mediator as if it were an outcome variable.
  • 50. Step 3: Show that the mediator affects the outcome variable. Use Y as the criterion variable in a regression equation and X and M as predictors (estimate and test path b). It is not sufficient just to correlate the mediator with the outcome because the mediator and the outcome may be correlated because they are both caused by the causal variable X. Thus, the causal variable must be controlled in establishing the effect of the mediator on the outcome. Step 4: To establish that M completely mediates the X-Y relationship, the effect of X on Y controlling for M (path c') should be zero (see discussion below on significance testing). The effects in both Steps 3 and 4 are estimated in the same equation.
  • 51. Final mediation decision • If all four of these steps are met, then the data are consistent with the hypothesis that variable M completely mediates the X-Y relationship, and if the first three steps are met but the Step 4 is not, then partial mediation is indicated. Meeting these steps does not, however, conclusively establish that mediation has occurred because there are other (perhaps less plausible) models that are consistent with the data. Some of these models are considered later in the Specification Error section.

Editor's Notes

  • #44: http://blog.minitab.com/blog/adventures-in-statistics-2/regression-analysis-how-do-i-interpret-r-squared-and-assess-the-goodness-of-fit
  • #45: http://blog.minitab.com/blog/adventures-in-statistics-2/how-to-interpret-regression-analysis-results-p-values-and-coefficients
  • #46: http://www.statisticshowto.com/standardized-beta-coefficient/
  • #48: *** indicates p < 0.05