SlideShare a Scribd company logo
Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines
Prof. Dr. Chang Zhu page 1
Table of Contents
LECTURE 2.......................................................................................................................................................... 2
EXPLORING DATA/ASSUMPTION TESTING........................................................................................................ 2
DESCRIPTIVE STATISTICS................................................................................................................................ 2
NORMAL DISTRIBUTION TEST (the Kolmogorov-Smirnov (K-S) test) ............................................................ 3
ASSIGNMENT..................................................................................................................................................... 5
Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines
Prof. Dr. Chang Zhu page 2
LECTURE 2
EXPLORING DATA/ASSUMPTION TESTING
Before proceeding to certain kind of analysis, it is important that we should explore:
• the characteristics of our data (mean, mode, median, variance, standard deviation, and range),
• the distribution of your data, if they are normally distributed or not by (1) values of skewness and
kurtosis (SPSS also provides histograms to visualize the distribution in the Descriptive Statistics
Command) and (2) tests of normality,
• the homogeneity of variance between groups (if we are to conduct analysis for groups).
DESCRIPTIVE STATISTICS
e.g. we want to know the characteristics and distribution of our data for the variable Intrinsic
_Motivation_learn (the file named sample data 1.sav)
In SPSS, choose Analyse > Descriptive Statistics > Frequencies
Select the variable Intrinsic _Motivation_learn and move it to the Variable(s) box by clicking the
button.
Click on the Statistics button to access the dialog box and select the options of your preference. After
finishing, click Continue.
Click on the Charts button to access the Frequencies: Charts dialog box. Choose Histograms (Show normal
curve on histogram), and click Continue to finish.
On the main dialog box, click OK to run the analysis.
In the Output document, you will see a table of descriptive statistics and a histogram with curve, based on
which you will have an idea of the characteristics (visual) distribution of your data.
Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines
Prof. Dr. Chang Zhu page 3
NORMAL DISTRIBUTION TEST (the Kolmogorov-Smirnov (K-S) test)
To compare our data with a normally distributed data with the same mean and standard deviation, we can
use the Kolmogorov-Smirnov (K-S) test and the Shapiro-Wilk test.
Ex1. we want to know if all the scores for the variable Intrinsic_Motivation_learn are different from a
normally distributed dataset.
In SPSS, choose Analyse > Descriptive Statistics > Explore
Select the variable Intrinsic_Motivation_learn and move it to the Dependent List box by clicking the
button.
Click on the Statistics button to access the dialog box and select the Descriptive button. After finishing, click
Continue.
Click on Plots and select the option Normality plots with tests, which will provide us with the Kolmogorov-
Smirnov test and the Shapiro-Wilk test and the normal Q-Q plots.
On the main dialog box, click OK to run the analysis.
On the main dialog box, click OK to run the analysis.
In the Output document, the most important table we should look at is the table labelled Tests of
Normality.
Tests of Normality
Kolmogorov-Smirnova
Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
1-5 .189 58 .000 .902 58 .000
a. Lilliefors Significance Correction
According to the result, the Kolmogorov-Smirnov test and the Shapiro-Wilk test are highly significant,
indicating that the distribution of scores for the variable Intrinsic_Motivation_learn is significantly different
from a normal distribution. In other words, the distribution is not normal.
Ex2. Let us now look at a data set selected from Field (2009), namely SPSSexam.sav, that includes data on
student performance on SPSS exam. The data set contains four variables: exam (scores), computer
(measure of computer literacy in percent), lecture (percentage of SPSS lectures attended), numeracy (a
measure of numerical ability out of 15), and uni (the university of the participants, either Duncetown or
Sussex).
Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines
Prof. Dr. Chang Zhu page 4
Conduct the Kolmogorov-Smirnov (K-S) test for the variable exam (scores) for each of the two groups
(Duncetown/Sussex university) and report the result by filling the missing information in the following table
and statement.
Tip: to conduct the K-S test separately for each group, we just need to move the variable uni to the Factor
List before proceeding to the next step.
Tests of Normality
University
Kolmogorov-Smirnova
Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Percentage on SPSS exam Duncetown University .106 50 _____ .972 50 _____
Sussex University .073 50 _____ .984 50 _____
a. Lilliefors Significance Correction
*. This is a lower bound of the true significance.
The percentage on the SPSS exam, D (50) = 0.10, p (</>) ___ .05 and D (50) = 0.07, p (</>) ___ .05 are
significantly (normal/non-normal) ________________ for both the Duncetown and Sussex groups,
respectively.
(Note: the test statistic for the K-S test is denoted by the letter D in papers).
Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines
Prof. Dr. Chang Zhu page 5
ASSIGNMENT
1) Use your own data set, explore your data with descriptive statistics and normal distribution tests
for 2 scale variables. Report the results in APA format.
2) Subscribe yourself to one of the groups in Pointcarre (based on the group formation in Assignment
1). For students who worked individually, you can also join one of the groups for those assignments
that need group discussion.
To do this, access the course Introduction to Applied Statistics and Statistical Methods, click on
Course group and you will find all the possible groups that you can subscribe as a member (just
subscribe to one group).
Post your group’s answer to the given question assigned to your group as indicated below, to the
forum labelled Exploring data, under the topic Questions and Answers/basic concepts in statistics
and distributions
When you post the group’s answer to the forum, remember to mention the question, e.g.
Question: What is ..?
Answer: ….
(shortly state which part of the answer you still have doubt (if any) and need other group’s support?
Extra credit is given if you can contribute to the answers of other group (giving critical feedback or
provide more information on the issue for your peers)
Group 1: How can we deal with outliers in case we detect them?
Group 2: Under what circumstances should we be cautious about using the mean as a measure of
central tendency?
Group 3: What are dependent/independent variables? Are there any other ways to refer to
dependent/independent variables in research papers?
Group 4: What are the cut-off (limit) values for skewness and kurtosis?
Group 5: For data to be normally distributed, what characteristics it should have?
Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines
Prof. Dr. Chang Zhu page 6
Group 6: What is within-subjects design? What are the possible problems associated with this kind
of design?
Group 7: What are Type I and Type II error in statistics?
Group 8: What is an effect size and how is it measured?
Group 10: What is the standard error of the mean (SE)? How is it calculated?
Group 11: What does a p-value generally tell us? How can p <.05 interpreted?
Group 12: What is a z-score? How is it calculated?
Group 13: In the equation to calculate the variance, we divide the sum of squared errors by the
number of participants (N) minus 1. In this case, (N – 1) is referred to as degree of freedom. Can
you explain the concept “degree of freedom”?
Group 14: In addition to the K-S test, what are some other ways to examine the distribution of a
data set?

More Related Content

PPT
Data analysis powerpoint
PDF
Lecture notes on STS 102
PPTX
Basics of data_interpretation
PPTX
Data collection,tabulation,processing and analysis
PPT
Research methodology - Analysis of Data
PPTX
Data collection and presentation
PPTX
Research methodology unit6
PPTX
RESEARCH METHODOLOGY- PROCESSING OF DATA
Data analysis powerpoint
Lecture notes on STS 102
Basics of data_interpretation
Data collection,tabulation,processing and analysis
Research methodology - Analysis of Data
Data collection and presentation
Research methodology unit6
RESEARCH METHODOLOGY- PROCESSING OF DATA

What's hot (20)

PPTX
A seminar on quantitave data analysis
PPT
Data Interpretation
PPTX
Research Methodology-Data Processing
PPT
Quantitative data analysis
PPTX
Chapter 8 Data analysis and interpretation ( 2007 book )
PPTX
Data processing and presentation
PPTX
Data analysis copy
PPTX
Data Analysis, Presentation and Interpretation of Data
DOCX
Quantitative data analysis
PPT
Ch21 22 data analysis and interpretation
PPTX
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
PDF
Basic Concepts of Statistics - Lecture Notes
PPTX
Data collection and interpretation SBL1023
PPTX
data analysis techniques and statistical softwares
PPTX
Topic interpretation of data and its analysis
PPT
Introduction To Statistics
PPTX
Statistical analysis and interpretation
PPT
Quantitative data 2
A seminar on quantitave data analysis
Data Interpretation
Research Methodology-Data Processing
Quantitative data analysis
Chapter 8 Data analysis and interpretation ( 2007 book )
Data processing and presentation
Data analysis copy
Data Analysis, Presentation and Interpretation of Data
Quantitative data analysis
Ch21 22 data analysis and interpretation
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Basic Concepts of Statistics - Lecture Notes
Data collection and interpretation SBL1023
data analysis techniques and statistical softwares
Topic interpretation of data and its analysis
Introduction To Statistics
Statistical analysis and interpretation
Quantitative data 2
Ad

Similar to Lecture 2 practical_guidelines_assignment (20)

PDF
Applied statistics lecture_2
PPTX
Kolmogorov-Smirnov Test.pptx
PPTX
Testing Assumptions in repeated Measures Design using SPSS
PDF
article.pdf
PPTX
Introduction to Educational statistics and measurement
PPTX
Lec 5 - Normality Testing.pptx
PPTX
Data Normality (1).pptx
PPTX
Normality test on SPSS
PPTX
Presentation 7.pptx
PDF
Stats- S.K. Mangal statistics textook in pdf
PPT
Stat 4 the normal distribution & steps of testing hypothesis
PDF
Statistics A Gentle Introduction 4th Edition Frederick L Coolidge
PPTX
PPTX
3. parametric assumptions
PPTX
PARAMETRIC TESTS.pptx
PPT
Presentation1group b
PDF
Biostatistics CH Lecture Pack
PPTX
Normality evaluation in a data
PDF
Statistics For Research With A Guide To Spss George Argyrous
PDF
UG_B.Sc._Psycology_11933 –PSYCHOLOGICAL STATISTICS.pdf
Applied statistics lecture_2
Kolmogorov-Smirnov Test.pptx
Testing Assumptions in repeated Measures Design using SPSS
article.pdf
Introduction to Educational statistics and measurement
Lec 5 - Normality Testing.pptx
Data Normality (1).pptx
Normality test on SPSS
Presentation 7.pptx
Stats- S.K. Mangal statistics textook in pdf
Stat 4 the normal distribution & steps of testing hypothesis
Statistics A Gentle Introduction 4th Edition Frederick L Coolidge
3. parametric assumptions
PARAMETRIC TESTS.pptx
Presentation1group b
Biostatistics CH Lecture Pack
Normality evaluation in a data
Statistics For Research With A Guide To Spss George Argyrous
UG_B.Sc._Psycology_11933 –PSYCHOLOGICAL STATISTICS.pdf
Ad

More from Daria Bogdanova (20)

PPT
Get started: Learning approaches
PDF
Template outline of_a_systematic_review_research_paper
PDF
Template of a_research_proposal
PDF
Research seminar lecture_apa_writing_and_references_students_full
PDF
Research seminar lecture_10_analysing_qualitative_data
PDF
Research seminar lecture_9_focus_groups
PDF
Research seminar lecture_9_focus_groups
PDF
Research seminar lecture_8_mixed_methods_research
PDF
Research seminar lecture_7_criteria_good_research
PDF
Research seminar lecture_6
PDF
Research seminar lecture_4_research_questions
PDF
Research seminar lecture_3_literature_review
PDF
Research seminar lecture_2_research_proposal__types_of_research_methods_stude...
PDF
Research seminar lecture_1_educational_research_proposal_&_apa
PDF
Lecture 8 guidelines_and_assignments
PDF
Lecture 7 guidelines_and_assignment
PDF
Lecture 6 guidelines_and_assignment
PDF
Lecture 5 practical_guidelines_assignments
PDF
Lecture 3 practical_guidelines_assignment
PDF
Lecture 1 practical_guidelines_assignment
Get started: Learning approaches
Template outline of_a_systematic_review_research_paper
Template of a_research_proposal
Research seminar lecture_apa_writing_and_references_students_full
Research seminar lecture_10_analysing_qualitative_data
Research seminar lecture_9_focus_groups
Research seminar lecture_9_focus_groups
Research seminar lecture_8_mixed_methods_research
Research seminar lecture_7_criteria_good_research
Research seminar lecture_6
Research seminar lecture_4_research_questions
Research seminar lecture_3_literature_review
Research seminar lecture_2_research_proposal__types_of_research_methods_stude...
Research seminar lecture_1_educational_research_proposal_&_apa
Lecture 8 guidelines_and_assignments
Lecture 7 guidelines_and_assignment
Lecture 6 guidelines_and_assignment
Lecture 5 practical_guidelines_assignments
Lecture 3 practical_guidelines_assignment
Lecture 1 practical_guidelines_assignment

Lecture 2 practical_guidelines_assignment

  • 1. Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines Prof. Dr. Chang Zhu page 1 Table of Contents LECTURE 2.......................................................................................................................................................... 2 EXPLORING DATA/ASSUMPTION TESTING........................................................................................................ 2 DESCRIPTIVE STATISTICS................................................................................................................................ 2 NORMAL DISTRIBUTION TEST (the Kolmogorov-Smirnov (K-S) test) ............................................................ 3 ASSIGNMENT..................................................................................................................................................... 5
  • 2. Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines Prof. Dr. Chang Zhu page 2 LECTURE 2 EXPLORING DATA/ASSUMPTION TESTING Before proceeding to certain kind of analysis, it is important that we should explore: • the characteristics of our data (mean, mode, median, variance, standard deviation, and range), • the distribution of your data, if they are normally distributed or not by (1) values of skewness and kurtosis (SPSS also provides histograms to visualize the distribution in the Descriptive Statistics Command) and (2) tests of normality, • the homogeneity of variance between groups (if we are to conduct analysis for groups). DESCRIPTIVE STATISTICS e.g. we want to know the characteristics and distribution of our data for the variable Intrinsic _Motivation_learn (the file named sample data 1.sav) In SPSS, choose Analyse > Descriptive Statistics > Frequencies Select the variable Intrinsic _Motivation_learn and move it to the Variable(s) box by clicking the button. Click on the Statistics button to access the dialog box and select the options of your preference. After finishing, click Continue. Click on the Charts button to access the Frequencies: Charts dialog box. Choose Histograms (Show normal curve on histogram), and click Continue to finish. On the main dialog box, click OK to run the analysis. In the Output document, you will see a table of descriptive statistics and a histogram with curve, based on which you will have an idea of the characteristics (visual) distribution of your data.
  • 3. Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines Prof. Dr. Chang Zhu page 3 NORMAL DISTRIBUTION TEST (the Kolmogorov-Smirnov (K-S) test) To compare our data with a normally distributed data with the same mean and standard deviation, we can use the Kolmogorov-Smirnov (K-S) test and the Shapiro-Wilk test. Ex1. we want to know if all the scores for the variable Intrinsic_Motivation_learn are different from a normally distributed dataset. In SPSS, choose Analyse > Descriptive Statistics > Explore Select the variable Intrinsic_Motivation_learn and move it to the Dependent List box by clicking the button. Click on the Statistics button to access the dialog box and select the Descriptive button. After finishing, click Continue. Click on Plots and select the option Normality plots with tests, which will provide us with the Kolmogorov- Smirnov test and the Shapiro-Wilk test and the normal Q-Q plots. On the main dialog box, click OK to run the analysis. On the main dialog box, click OK to run the analysis. In the Output document, the most important table we should look at is the table labelled Tests of Normality. Tests of Normality Kolmogorov-Smirnova Shapiro-Wilk Statistic df Sig. Statistic df Sig. 1-5 .189 58 .000 .902 58 .000 a. Lilliefors Significance Correction According to the result, the Kolmogorov-Smirnov test and the Shapiro-Wilk test are highly significant, indicating that the distribution of scores for the variable Intrinsic_Motivation_learn is significantly different from a normal distribution. In other words, the distribution is not normal. Ex2. Let us now look at a data set selected from Field (2009), namely SPSSexam.sav, that includes data on student performance on SPSS exam. The data set contains four variables: exam (scores), computer (measure of computer literacy in percent), lecture (percentage of SPSS lectures attended), numeracy (a measure of numerical ability out of 15), and uni (the university of the participants, either Duncetown or Sussex).
  • 4. Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines Prof. Dr. Chang Zhu page 4 Conduct the Kolmogorov-Smirnov (K-S) test for the variable exam (scores) for each of the two groups (Duncetown/Sussex university) and report the result by filling the missing information in the following table and statement. Tip: to conduct the K-S test separately for each group, we just need to move the variable uni to the Factor List before proceeding to the next step. Tests of Normality University Kolmogorov-Smirnova Shapiro-Wilk Statistic df Sig. Statistic df Sig. Percentage on SPSS exam Duncetown University .106 50 _____ .972 50 _____ Sussex University .073 50 _____ .984 50 _____ a. Lilliefors Significance Correction *. This is a lower bound of the true significance. The percentage on the SPSS exam, D (50) = 0.10, p (</>) ___ .05 and D (50) = 0.07, p (</>) ___ .05 are significantly (normal/non-normal) ________________ for both the Duncetown and Sussex groups, respectively. (Note: the test statistic for the K-S test is denoted by the letter D in papers).
  • 5. Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines Prof. Dr. Chang Zhu page 5 ASSIGNMENT 1) Use your own data set, explore your data with descriptive statistics and normal distribution tests for 2 scale variables. Report the results in APA format. 2) Subscribe yourself to one of the groups in Pointcarre (based on the group formation in Assignment 1). For students who worked individually, you can also join one of the groups for those assignments that need group discussion. To do this, access the course Introduction to Applied Statistics and Statistical Methods, click on Course group and you will find all the possible groups that you can subscribe as a member (just subscribe to one group). Post your group’s answer to the given question assigned to your group as indicated below, to the forum labelled Exploring data, under the topic Questions and Answers/basic concepts in statistics and distributions When you post the group’s answer to the forum, remember to mention the question, e.g. Question: What is ..? Answer: …. (shortly state which part of the answer you still have doubt (if any) and need other group’s support? Extra credit is given if you can contribute to the answers of other group (giving critical feedback or provide more information on the issue for your peers) Group 1: How can we deal with outliers in case we detect them? Group 2: Under what circumstances should we be cautious about using the mean as a measure of central tendency? Group 3: What are dependent/independent variables? Are there any other ways to refer to dependent/independent variables in research papers? Group 4: What are the cut-off (limit) values for skewness and kurtosis? Group 5: For data to be normally distributed, what characteristics it should have?
  • 6. Introduction to Applied Statistics and Applied Statistical Methods Practical guidelines Prof. Dr. Chang Zhu page 6 Group 6: What is within-subjects design? What are the possible problems associated with this kind of design? Group 7: What are Type I and Type II error in statistics? Group 8: What is an effect size and how is it measured? Group 10: What is the standard error of the mean (SE)? How is it calculated? Group 11: What does a p-value generally tell us? How can p <.05 interpreted? Group 12: What is a z-score? How is it calculated? Group 13: In the equation to calculate the variance, we divide the sum of squared errors by the number of participants (N) minus 1. In this case, (N – 1) is referred to as degree of freedom. Can you explain the concept “degree of freedom”? Group 14: In addition to the K-S test, what are some other ways to examine the distribution of a data set?