SlideShare a Scribd company logo
Introduction to statistics-3

 Measures of variability or spread
Summary Last Class
• Measures of central Tendency
  – Mean
  – Median
  – Mode
Mode

• The mode can be found with any scale of
  measurement; it is the only measure of
  typicality that can be used with a nominal
  scale.
Median
• The median can be used with ordinal, as well
  as interval/ratio, scales. It can even be used
  with scales that have open-ended categories
  at either end (e.g., 10 or more).
• It is not greatly affected by outliers, and it can
  be a good descriptive statistic for a strongly
  skewed distribution.
Mean
• The mean can only be used with interval or
  ratio scales. It is affected by every score in the
  distribution, and it can be strongly affected by
  outliers.
• It may not be a good descriptive statistic for a
  skewed distribution, but it plays an important
  role in advanced statistics.
Key Terms
•   Variability
•   Measures of Variability
•   Variance
•   Standard Deviation
•   Normal Distribution
Variability
• The difference in data or in a set of scores
• Populations can be described as homogenous
  or heterogeneous based on the level of
  variability in the data set
Measures of Variability
• Provide an estimate of how much scores in a
  distribution vary from an average score. The
  usual average is the mean
• Range
• Variance
• Standard Deviation
Range
• A single number that represents the spread of
  the data

• Upper real limit of Xmax- Lower real limit of
  Xmin
Variance
• Variance is a measure of variability
• Standard deviation is the square root of
  variance
Steps to calculate SD
1. Determine the Mean= x̄
2. Determine the deviations (X-x̄)
3. Square these (X-x̄)2
4. Add the squares ∑ (X-x̄)2
5. Divide by total numbers less one ∑ (X-x̄)2/n-1
6. Square root of result is Standard Deviation

       Standard Deviation=√∑ (X-x̄)2/n-1
Variance and Standard Deviation
            Population                  Sample

Mean =µ                  Mean= x̄




Variance                 Variance




Standard Deviation       Standard Deviation
Worked Example (appspot.com)
Step 1- Calculate mean
Step 2 Calculate Deviation
Step 3 Sum Mean Square Deviation
Step 4 Calculate standard deviation
Introduction to statistics 3
Range

• The range tells you the largest difference that
  you have among your scores. It is strongly
  affected by outliers, and being based on only
  two scores, it can be very unreliable.
Mean Deviation
• The mean deviation, and the two measures
  that follow, can only be used with
  interval/ratio scales.
• It is a good descriptive measure, which is less
  affected by outliers than the standard
  deviation, but it is not used in advanced
  statistics.
Variance
• The variance is not appropriate for descriptive
  purposes, but it plays an important role in
  advanced statistics.
Standard Deviation
• The standard deviation is a good descriptive
  measure of variability, although it can be
  affected strongly by outliers. It plays an
  important role in advanced statistics.
Scaled Scores
• A key element of statistics is making
  comparison between variables and
  populations.
Changing the mean and standard
      deviation of a distribution
• If a constant is added to each score in a
  distribution the mean for the distribution
  changes but the variance and standard
  deviation does not.

• Adding a score changes the sum of all the
  scores but not the spread or shape of a
  distribution
Adding or Subtracting a constant
• When you add or subtract a constant from
  each score in a distribution the mean changes
  by the amount added or subtracted but the
  standard deviation and variance remain the
  same
 x̄ new= x̄ original +/- constant
 s new= s original
Multiplying or dividing by a constant
x̄ new= x̄ original x or / by the constant
 s new= s original x or / by the constant
Z scores
• The z score provides the exact position of a
  score in its distribution.
• This allows us to compare scores from
  different distributions
Z-score distribution


• The z score distribution has a mean of 0 and a
  standard deviation of 1
Z-Score formula
Z score for sample



       Xi X
z
         S
Example Z Score
• For scores above the mean, the z score has a
  positive sign. Example + 1.5z.
• Below the mean, the z score has a minus sign.
  Example - 0.5z.
• Calculate Z score for blood pressure of 140 if
  the sample mean is 110 and the standard
  deviation is 10
•    Z = 140 – 110 / 10 = 3

                                               33
Comparing Scores from Different
            Distributions
• Interpreting a raw score requires additional
  information about the entire distribution. In most
  situations, we need some idea about the mean score
  and an indication of how much the scores vary.
• For example, assume that an individual took two
  tests in reading and mathematics. The reading score
  was 32 and mathematics was 48. Is it correct to say
  that performance in mathematics was better than in
  reading?



                                                    34
Z Scores Help in Comparisons
• Not without additional information. One
  method to interpret the raw score is to
  transform it to a z score.
• The advantage of the z score transformation is
  that it takes into account both the mean value
  and the variability in a set of raw scores.



                                               35
Example 1
                                          Dave in Statistics:
15

     Statistics                Calculus   (50 - 40)/10 = 1
                                          (one SD above the
                                           mean)
10



                                          Dave in Calculus
5




                                          (50 - 60)/10 = -1
                                          (one SD below the
                                           mean)
0




     0      20406080100

     Mean Statistics   G RADE
                            Mean
        = 40            Calculus = 60
Example 2
                                                   An example where the
                                                    means are identical, but
0 5 10 15 20 25 30

                                                    the two sets of scores
                         Statistics                 have different spreads


                                                   Dave’s Stats Z-score
                                                   (50-40)/5 = 2


                                        Calculus   Dave’s Calc Z-score
                                                   (50-40)/20 = .5

                     0      20406080100

                                      G RA DE
Thee Properties of Standard Scores
• 1. The mean of a set of z-scores is always
  zero
Properties of Standard Scores
• Why?

• The mean has been subtracted from each
  score. Therefore, following the definition
  of the mean as a balancing point, the sum
  (and, accordingly, the average) of all the
  deviation scores must be zero.
Three Properties of Standard Scores

• 2. The SD of a set of standardized scores is
  always 1
Three Properties of Standard Scores

• 3. The distribution of a set of standardized
  scores has the same shape as the
  unstandardized scores
  – beware of the “normalization”
    misinterpretation
The shape is the same
                  (but the scaling or metric is different)


           UNSTANDARDIZED                                  STANDARDIZED




                                         0.5
6




                                         0.4
4




                                         0.3
                                         0.2
2




                                         0.1
                                         0.0
0




    0.4   0.6        0.8      1.0              -6     -4         -2       0   2
Two Advantages of Standard Scores
1. We can use standard scores to find centile
  scores: the proportion of people with scores
  less than or equal to a particular score.
  Centile scores are intuitive ways of
  summarizing a person’s location in a larger set
  of scores.
The area under a normal curve
0. 0.1 0.2 0.3 0.4                                              50%




                                     34%    34%

                               14%                 14%

                          2%                               2%
                     -4        -2          0               2          4

                                        SCO RE
Two Advantages of Standard Scores
2. Standard scores provides a way to standardize
  or equate different metrics. We can now
  interpret Dave’s scores in Statistics and
  Calculus on the same metric (the z-score
  metric). (Each score comes from a distribution
  with the same mean [zero] and the same
  standard deviation [1].)
Two Disadvantages of Standard Scores
1. Because a person’s score is expressed relative to
   the group (X - M), the same person can have
   different z-scores when assessed in different
   samples

Example: If Dave had taken his Calculus exam in a
  class in which everyone knew math well his z-
  score would be well below the mean. If the class
  didn’t know math very well, however, Dave
  would be above the mean. Dave’s score depends
  on everyone else’s scores.
Two Disadvantages of Standard Scores
2. If the absolute score is meaningful or of
  psychological interest, it will be obscured by
  transforming it to a relative metric.
Properties of the Mean, Standard
  Deviation, and Standardized Scores

Mean. Adding or subtracting a constant from
the scores changes the mean in the same way.
Multiplying or dividing by a constant also
changes the mean in the same way. The sum of
squared deviations is smaller around the mean
than any other point in the distribution.
Standard Deviation
• Standard deviation. Adding or subtracting a
  constant from the scores does not change the
  standard deviation. However, multiplying or
  dividing by a constant means that the
  standard deviation will be multiplied or
  divided by the same constant. The standard
  deviation is smaller when calculated around
  the mean than any other point in the
  distribution.
Standardized Scores

• Standardized scores. Adding, subtracting,
  multiplying or dividing the scores by a
  constant does not change the standardized
  scores. The mean of a set of z scores is zero,
  and the standard deviation is 1.0.

More Related Content

PPT
Chapter 6. Mining Frequent Patterns, Associations and Correlations Basic Conc...
PPT
02 data
PDF
Bayesian classification
ODP
NAIVE BAYES CLASSIFIER
PDF
Bayesian learning
PPTX
Logistic Regression.pptx
PPT
Chapter 8. Classification Basic Concepts.ppt
PPT
Data preprocessing
Chapter 6. Mining Frequent Patterns, Associations and Correlations Basic Conc...
02 data
Bayesian classification
NAIVE BAYES CLASSIFIER
Bayesian learning
Logistic Regression.pptx
Chapter 8. Classification Basic Concepts.ppt
Data preprocessing

What's hot (20)

PPTX
Machine learning with ADA Boost
PPTX
ML - Multiple Linear Regression
PPTX
Exploratory data analysis with Python
PPT
Data preprocessing
PPTX
Exploratory data analysis
PDF
Outlier Detection
PDF
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
PDF
Singular Value Decompostion (SVD): Worked example 3
PDF
Supervised and Unsupervised Machine Learning
PPTX
Implement principal component analysis (PCA) in python from scratch
DOC
Data Mining: Data Preprocessing
PPTX
Data warehouse architecture
PDF
Understanding Bagging and Boosting
PDF
Data warehousing unit 1
PPTX
PDF
Statistics for data scientists
PPTX
Statistics for data science
PPTX
Exploring Data
PPT
Naive bayes
PPTX
big data and machine learning ppt.pptx
Machine learning with ADA Boost
ML - Multiple Linear Regression
Exploratory data analysis with Python
Data preprocessing
Exploratory data analysis
Outlier Detection
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Singular Value Decompostion (SVD): Worked example 3
Supervised and Unsupervised Machine Learning
Implement principal component analysis (PCA) in python from scratch
Data Mining: Data Preprocessing
Data warehouse architecture
Understanding Bagging and Boosting
Data warehousing unit 1
Statistics for data scientists
Statistics for data science
Exploring Data
Naive bayes
big data and machine learning ppt.pptx
Ad

Viewers also liked (20)

PPT
Introduction To Statistics
PPTX
Properties of Standard Deviation
PPTX
Statistics
PPTX
Applications of central tendency
PPTX
Chinese new year 2016
PPTX
Introduction to Standardized Work
PPTX
Stdw for managers and supervisors october 2016
PDF
Lean Standard or Standardized Work Training Module
PPT
Taiko drumming
PPT
statistics in pharmaceutical sciences
PPT
Standardized Work
PPT
Happy Lantern Festival
PPT
Present Loy Krathong Festival
PPT
Lantern Festival
PPTX
Taiko Drum Festival
PPTX
Balinese dance festival of indonesia
PDF
What Is Statistics
PPT
Topic 4 chapter 5 part 3 enzymes - limiting factors
PPT
Statistics lesson 1
PPTX
Kabuki of Japan by Group One VIII - Acapulco (1)
Introduction To Statistics
Properties of Standard Deviation
Statistics
Applications of central tendency
Chinese new year 2016
Introduction to Standardized Work
Stdw for managers and supervisors october 2016
Lean Standard or Standardized Work Training Module
Taiko drumming
statistics in pharmaceutical sciences
Standardized Work
Happy Lantern Festival
Present Loy Krathong Festival
Lantern Festival
Taiko Drum Festival
Balinese dance festival of indonesia
What Is Statistics
Topic 4 chapter 5 part 3 enzymes - limiting factors
Statistics lesson 1
Kabuki of Japan by Group One VIII - Acapulco (1)
Ad

Similar to Introduction to statistics 3 (20)

PPTX
Variability, the normal distribution and converted scores
PPT
Day 4 normal curve and standard scores
PPTX
Topic 8a Basic Statistics
PPTX
Chapters 2 4
PPT
Describing quantitative data with numbers
PPTX
Statistics
PPTX
Z scores lecture chapter 2 and 4
PPT
statistics
PPTX
Chapters 2 & 4
PPTX
Chapters 2 & 4
PPT
lecture No, 4 center tendendy and dispersion.ppt
PPT
Psy295 chap05 (1)
PPTX
m2_2_variation_z_scores.pptx
PPT
T7 data analysis
PPT
Univariant Descriptive Stats.skewness(3).ppt
PDF
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
PPT
Standard Scores
PPTX
Measures of Variability
PPTX
measure of dispersion in stats whichncomes in first chapter
Variability, the normal distribution and converted scores
Day 4 normal curve and standard scores
Topic 8a Basic Statistics
Chapters 2 4
Describing quantitative data with numbers
Statistics
Z scores lecture chapter 2 and 4
statistics
Chapters 2 & 4
Chapters 2 & 4
lecture No, 4 center tendendy and dispersion.ppt
Psy295 chap05 (1)
m2_2_variation_z_scores.pptx
T7 data analysis
Univariant Descriptive Stats.skewness(3).ppt
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Standard Scores
Measures of Variability
measure of dispersion in stats whichncomes in first chapter

Recently uploaded (20)

PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Pharma ospi slides which help in ospi learning
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Sports Quiz easy sports quiz sports quiz
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
RMMM.pdf make it easy to upload and study
PPTX
master seminar digital applications in india
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Cell Structure & Organelles in detailed.
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Microbial diseases, their pathogenesis and prophylaxis
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Final Presentation General Medicine 03-08-2024.pptx
Pharma ospi slides which help in ospi learning
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Sports Quiz easy sports quiz sports quiz
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
102 student loan defaulters named and shamed – Is someone you know on the list?
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
RMMM.pdf make it easy to upload and study
master seminar digital applications in india
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Cell Structure & Organelles in detailed.
Anesthesia in Laparoscopic Surgery in India
Abdominal Access Techniques with Prof. Dr. R K Mishra
Microbial diseases, their pathogenesis and prophylaxis

Introduction to statistics 3

  • 1. Introduction to statistics-3 Measures of variability or spread
  • 2. Summary Last Class • Measures of central Tendency – Mean – Median – Mode
  • 3. Mode • The mode can be found with any scale of measurement; it is the only measure of typicality that can be used with a nominal scale.
  • 4. Median • The median can be used with ordinal, as well as interval/ratio, scales. It can even be used with scales that have open-ended categories at either end (e.g., 10 or more). • It is not greatly affected by outliers, and it can be a good descriptive statistic for a strongly skewed distribution.
  • 5. Mean • The mean can only be used with interval or ratio scales. It is affected by every score in the distribution, and it can be strongly affected by outliers. • It may not be a good descriptive statistic for a skewed distribution, but it plays an important role in advanced statistics.
  • 6. Key Terms • Variability • Measures of Variability • Variance • Standard Deviation • Normal Distribution
  • 7. Variability • The difference in data or in a set of scores • Populations can be described as homogenous or heterogeneous based on the level of variability in the data set
  • 8. Measures of Variability • Provide an estimate of how much scores in a distribution vary from an average score. The usual average is the mean
  • 9. • Range • Variance • Standard Deviation
  • 10. Range • A single number that represents the spread of the data • Upper real limit of Xmax- Lower real limit of Xmin
  • 11. Variance • Variance is a measure of variability • Standard deviation is the square root of variance
  • 12. Steps to calculate SD 1. Determine the Mean= x̄ 2. Determine the deviations (X-x̄) 3. Square these (X-x̄)2 4. Add the squares ∑ (X-x̄)2
  • 13. 5. Divide by total numbers less one ∑ (X-x̄)2/n-1 6. Square root of result is Standard Deviation Standard Deviation=√∑ (X-x̄)2/n-1
  • 14. Variance and Standard Deviation Population Sample Mean =µ Mean= x̄ Variance Variance Standard Deviation Standard Deviation
  • 17. Step 2 Calculate Deviation
  • 18. Step 3 Sum Mean Square Deviation
  • 19. Step 4 Calculate standard deviation
  • 21. Range • The range tells you the largest difference that you have among your scores. It is strongly affected by outliers, and being based on only two scores, it can be very unreliable.
  • 22. Mean Deviation • The mean deviation, and the two measures that follow, can only be used with interval/ratio scales. • It is a good descriptive measure, which is less affected by outliers than the standard deviation, but it is not used in advanced statistics.
  • 23. Variance • The variance is not appropriate for descriptive purposes, but it plays an important role in advanced statistics.
  • 24. Standard Deviation • The standard deviation is a good descriptive measure of variability, although it can be affected strongly by outliers. It plays an important role in advanced statistics.
  • 25. Scaled Scores • A key element of statistics is making comparison between variables and populations.
  • 26. Changing the mean and standard deviation of a distribution • If a constant is added to each score in a distribution the mean for the distribution changes but the variance and standard deviation does not. • Adding a score changes the sum of all the scores but not the spread or shape of a distribution
  • 27. Adding or Subtracting a constant • When you add or subtract a constant from each score in a distribution the mean changes by the amount added or subtracted but the standard deviation and variance remain the same x̄ new= x̄ original +/- constant s new= s original
  • 28. Multiplying or dividing by a constant x̄ new= x̄ original x or / by the constant s new= s original x or / by the constant
  • 29. Z scores • The z score provides the exact position of a score in its distribution. • This allows us to compare scores from different distributions
  • 30. Z-score distribution • The z score distribution has a mean of 0 and a standard deviation of 1
  • 32. Z score for sample Xi X z S
  • 33. Example Z Score • For scores above the mean, the z score has a positive sign. Example + 1.5z. • Below the mean, the z score has a minus sign. Example - 0.5z. • Calculate Z score for blood pressure of 140 if the sample mean is 110 and the standard deviation is 10 • Z = 140 – 110 / 10 = 3 33
  • 34. Comparing Scores from Different Distributions • Interpreting a raw score requires additional information about the entire distribution. In most situations, we need some idea about the mean score and an indication of how much the scores vary. • For example, assume that an individual took two tests in reading and mathematics. The reading score was 32 and mathematics was 48. Is it correct to say that performance in mathematics was better than in reading? 34
  • 35. Z Scores Help in Comparisons • Not without additional information. One method to interpret the raw score is to transform it to a z score. • The advantage of the z score transformation is that it takes into account both the mean value and the variability in a set of raw scores. 35
  • 36. Example 1 Dave in Statistics: 15 Statistics Calculus (50 - 40)/10 = 1 (one SD above the mean) 10 Dave in Calculus 5 (50 - 60)/10 = -1 (one SD below the mean) 0 0 20406080100 Mean Statistics G RADE Mean = 40 Calculus = 60
  • 37. Example 2 An example where the means are identical, but 0 5 10 15 20 25 30 the two sets of scores Statistics have different spreads Dave’s Stats Z-score (50-40)/5 = 2 Calculus Dave’s Calc Z-score (50-40)/20 = .5 0 20406080100 G RA DE
  • 38. Thee Properties of Standard Scores • 1. The mean of a set of z-scores is always zero
  • 39. Properties of Standard Scores • Why? • The mean has been subtracted from each score. Therefore, following the definition of the mean as a balancing point, the sum (and, accordingly, the average) of all the deviation scores must be zero.
  • 40. Three Properties of Standard Scores • 2. The SD of a set of standardized scores is always 1
  • 41. Three Properties of Standard Scores • 3. The distribution of a set of standardized scores has the same shape as the unstandardized scores – beware of the “normalization” misinterpretation
  • 42. The shape is the same (but the scaling or metric is different) UNSTANDARDIZED STANDARDIZED 0.5 6 0.4 4 0.3 0.2 2 0.1 0.0 0 0.4 0.6 0.8 1.0 -6 -4 -2 0 2
  • 43. Two Advantages of Standard Scores 1. We can use standard scores to find centile scores: the proportion of people with scores less than or equal to a particular score. Centile scores are intuitive ways of summarizing a person’s location in a larger set of scores.
  • 44. The area under a normal curve 0. 0.1 0.2 0.3 0.4 50% 34% 34% 14% 14% 2% 2% -4 -2 0 2 4 SCO RE
  • 45. Two Advantages of Standard Scores 2. Standard scores provides a way to standardize or equate different metrics. We can now interpret Dave’s scores in Statistics and Calculus on the same metric (the z-score metric). (Each score comes from a distribution with the same mean [zero] and the same standard deviation [1].)
  • 46. Two Disadvantages of Standard Scores 1. Because a person’s score is expressed relative to the group (X - M), the same person can have different z-scores when assessed in different samples Example: If Dave had taken his Calculus exam in a class in which everyone knew math well his z- score would be well below the mean. If the class didn’t know math very well, however, Dave would be above the mean. Dave’s score depends on everyone else’s scores.
  • 47. Two Disadvantages of Standard Scores 2. If the absolute score is meaningful or of psychological interest, it will be obscured by transforming it to a relative metric.
  • 48. Properties of the Mean, Standard Deviation, and Standardized Scores Mean. Adding or subtracting a constant from the scores changes the mean in the same way. Multiplying or dividing by a constant also changes the mean in the same way. The sum of squared deviations is smaller around the mean than any other point in the distribution.
  • 49. Standard Deviation • Standard deviation. Adding or subtracting a constant from the scores does not change the standard deviation. However, multiplying or dividing by a constant means that the standard deviation will be multiplied or divided by the same constant. The standard deviation is smaller when calculated around the mean than any other point in the distribution.
  • 50. Standardized Scores • Standardized scores. Adding, subtracting, multiplying or dividing the scores by a constant does not change the standardized scores. The mean of a set of z scores is zero, and the standard deviation is 1.0.