SlideShare a Scribd company logo
1
Measures of Dispersion
Greg C Elvers, Ph.D.
2
Definition
Measures of dispersion are descriptive
statistics that describe how similar a set of
scores are to each other
The more similar the scores are to each other,
the lower the measure of dispersion will be
The less similar the scores are to each other, the
higher the measure of dispersion will be
In general, the more spread out a distribution is,
the larger the measure of dispersion will be
3
Measures of Dispersion
Which of the
distributions of scores
has the larger
dispersion?
0
25
50
75
100
125
1 2 3 4 5 6 7 8 9 10
0
25
50
75
100
125
1 2 3 4 5 6 7 8 9 10
The upper distribution
has more dispersion
because the scores are
more spread out
That is, they are less
similar to each other
4
Measures of Dispersion
There are three main measures of
dispersion:
The range
The semi-interquartile range (SIR)
Variance / standard deviation
5
The Range
The range is defined as the difference
between the largest score in the set of data
and the smallest score in the set of data, XL
- XS
What is the range of the following data:
4 8 1 6 6 2 9 3 6 9
The largest score (XL) is 9; the smallest
score (XS) is 1; the range is XL - XS = 9 - 1
= 8
6
When To Use the Range
The range is used when
you have ordinal data or
you are presenting your results to people with
little or no knowledge of statistics
The range is rarely used in scientific work
as it is fairly insensitive
It depends on only two scores in the set of data,
XL and XS
Two very different sets of data can have the
same range:
1 1 1 1 9 vs 1 3 5 7 9
7
The Semi-Interquartile Range
The semi-interquartile range (or SIR) is
defined as the difference of the first and
third quartiles divided by two
The first quartile is the 25th percentile
The third quartile is the 75th percentile
SIR = (Q3 - Q1) / 2
8
SIR Example
What is the SIR for the
data to the right?
25 % of the scores are
below 5
5 is the first quartile
25 % of the scores are
above 25
25 is the third quartile
SIR = (Q3 - Q1) / 2 = (25
- 5) / 2 = 10
2
4
6
 5 = 25th
%tile
8
10
12
14
20
30
 25 = 75th
%tile
60
9
When To Use the SIR
The SIR is often used with skewed data as it
is insensitive to the extreme scores
10
Variance
Variance is defined as the average of the
square deviations:
 
N
X
2
2  



11
What Does the Variance Formula
Mean?
First, it says to subtract the mean from each
of the scores
This difference is called a deviate or a deviation
score
The deviate tells us how far a given score is
from the typical, or average, score
Thus, the deviate is a measure of dispersion for
a given score
12
What Does the Variance Formula
Mean?
Why can’t we simply take the average of
the deviates? That is, why isn’t variance
defined as:
 
N
X
2  



This is not the
formula for
variance!
13
What Does the Variance Formula
Mean?
One of the definitions of the mean was that
it always made the sum of the scores minus
the mean equal to 0
Thus, the average of the deviates must be 0
since the sum of the deviates must equal 0
To avoid this problem, statisticians square
the deviate score prior to averaging them
Squaring the deviate score makes all the
squared scores positive
14
What Does the Variance Formula
Mean?
Variance is the mean of the squared
deviation scores
The larger the variance is, the more the
scores deviate, on average, away from the
mean
The smaller the variance is, the less the
scores deviate, on average, from the mean
15
Standard Deviation
When the deviate scores are squared in variance,
their unit of measure is squared as well
E.g. If people’s weights are measured in pounds,
then the variance of the weights would be expressed
in pounds2 (or squared pounds)
Since squared units of measure are often
awkward to deal with, the square root of variance
is often used instead
The standard deviation is the square root of variance
16
Standard Deviation
Standard deviation = variance
Variance = standard deviation2
17
Computational Formula
When calculating variance, it is often easier to use
a computational formula which is algebraically
equivalent to the definitional formula:
 
 
N
N
N X
X
X
2
2
2
2  


 



2 is the population variance, X is a score,  is the
population mean, and N is the number of scores
18
Computational Formula Example
X X2
X- (X-)2
9 81 2 4
8 64 1 1
6 36 -1 1
5 25 -2 4
8 64 1 1
6 36 -1 1
 = 42  = 306  = 0  = 12
19
Computational Formula Example
 
2
6
12
6
294
306
6
6
306
N
N
42
X
X
2
2
2
2











 
2
6
12
N
X
2
2



 


20
Variance of a Sample
Because the sample mean is not a perfect estimate
of the population mean, the formula for the
variance of a sample is slightly different from the
formula for the variance of a population:
 
1
N
X
X
s
2
2


 
s2 is the sample variance, X is a score, X is the
sample mean, and N is the number of scores
21
Measure of Skew
Skew is a measure of symmetry in the
distribution of scores
Positive
Skew
Negative Skew
Normal
(skew = 0)
22
Measure of Skew
The following formula can be used to
determine skew:
 
 
N
N
X
X
X
X
s 2
3
3
 
 

23
Measure of Skew
If s3 < 0, then the distribution has a negative
skew
If s3 > 0 then the distribution has a positive
skew
If s3 = 0 then the distribution is symmetrical
The more different s3 is from 0, the greater
the skew in the distribution
24
Kurtosis
(Not Related to Halitosis)
Kurtosis measures whether the scores are
spread out more or less than they would be
in a normal (Gaussian) distribution
Mesokurtic
(s4 = 3)
Leptokurtic (s4
> 3)
Platykurtic (s4
< 3)
25
Kurtosis
When the distribution is normally
distributed, its kurtosis equals 3 and it is
said to be mesokurtic
When the distribution is less spread out than
normal, its kurtosis is greater than 3 and it is
said to be leptokurtic
When the distribution is more spread out
than normal, its kurtosis is less than 3 and it
is said to be platykurtic
26
Measure of Kurtosis
The measure of kurtosis is given by:
 
N
N
X
X
X
X
s
4
2
4

 
















27
s2, s3, & s4
Collectively, the variance (s2), skew (s3),
and kurtosis (s4) describe the shape of the
distribution

More Related Content

PPT
UNIT III -Measures of Central Tendency 2.ppt
PPTX
Descriptive statistics
PDF
Measures of dispersion
PPT
2. measures of dis[persion
PPTX
Sec 1.3 collecting sample data
PPTX
Basics of Hypothesis Testing
PPTX
Frequency Distributions for Organizing and Summarizing
PPTX
Presentationofdata 120111034007-phpapp02
UNIT III -Measures of Central Tendency 2.ppt
Descriptive statistics
Measures of dispersion
2. measures of dis[persion
Sec 1.3 collecting sample data
Basics of Hypothesis Testing
Frequency Distributions for Organizing and Summarizing
Presentationofdata 120111034007-phpapp02

What's hot (20)

PPT
Central tendency
PPT
Descriptive statistics
PPT
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
PPTX
Measure of Central Tendency
PPTX
Range, quartiles, and interquartile range
PDF
Practice Test 1
PPTX
Inter quartile range
PPTX
3.2 Measures of variation
PPTX
Regression analysis: Simple Linear Regression Multiple Linear Regression
PPTX
6.5 central limit
PPT
Data analysis
PPTX
Measures of dispersion
PDF
Statistics for Data Analytics
PPTX
Measures of Variation
PPT
Standard Deviation
PPTX
Basic Statistics in 1 hour.pptx
PPTX
COVARIANCE IN PROBABILITY
PPT
Box and whiskers power point
PPTX
Introduction to Linear Discriminant Analysis
PPT
Linear regression
Central tendency
Descriptive statistics
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of Central Tendency
Range, quartiles, and interquartile range
Practice Test 1
Inter quartile range
3.2 Measures of variation
Regression analysis: Simple Linear Regression Multiple Linear Regression
6.5 central limit
Data analysis
Measures of dispersion
Statistics for Data Analytics
Measures of Variation
Standard Deviation
Basic Statistics in 1 hour.pptx
COVARIANCE IN PROBABILITY
Box and whiskers power point
Introduction to Linear Discriminant Analysis
Linear regression
Ad

Similar to Unit iii measures of dispersion (2) (20)

PPT
UNIT III -Measures of Dispersion (2).ppt
PPT
Dispersion
PPT
dispersion ppt.ppt
PPTX
Measures of dispersion
PPTX
Measures of Dispersion.pptx
PDF
Lesson 5.pdf ....probability and statistics
PPT
Dispersion according to geography statistic.ppt
PPT
250380111-Measures-of-Dispersion-ppt.ppt
PPT
Measure of Dispersion - Grade 8 Statistics.ppt
PPTX
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
PPTX
UNIT IV probability and standard distribution
PPTX
variance
PPT
ch-4-measures-of-variability-11 2.ppt for nursing
PPT
measures-of-variability-11.ppt
PPT
Measures of dispersion
PDF
Measures of dispersion discuss 2.2
PPT
Measures of dispersion
PPTX
State presentation2
DOCX
CJ 301 – Measures of DispersionVariability Think back to the .docx
PPTX
ders 3 Unit root test.pptx
UNIT III -Measures of Dispersion (2).ppt
Dispersion
dispersion ppt.ppt
Measures of dispersion
Measures of Dispersion.pptx
Lesson 5.pdf ....probability and statistics
Dispersion according to geography statistic.ppt
250380111-Measures-of-Dispersion-ppt.ppt
Measure of Dispersion - Grade 8 Statistics.ppt
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
UNIT IV probability and standard distribution
variance
ch-4-measures-of-variability-11 2.ppt for nursing
measures-of-variability-11.ppt
Measures of dispersion
Measures of dispersion discuss 2.2
Measures of dispersion
State presentation2
CJ 301 – Measures of DispersionVariability Think back to the .docx
ders 3 Unit root test.pptx
Ad

Recently uploaded (20)

PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PPTX
Institutional Correction lecture only . . .
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Complications of Minimal Access Surgery at WLH
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
master seminar digital applications in india
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Cell Types and Its function , kingdom of life
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Institutional Correction lecture only . . .
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Renaissance Architecture: A Journey from Faith to Humanism
TR - Agricultural Crops Production NC III.pdf
Week 4 Term 3 Study Techniques revisited.pptx
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Supply Chain Operations Speaking Notes -ICLT Program
O5-L3 Freight Transport Ops (International) V1.pdf
RMMM.pdf make it easy to upload and study
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Complications of Minimal Access Surgery at WLH
Basic Mud Logging Guide for educational purpose
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
master seminar digital applications in india
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Final Presentation General Medicine 03-08-2024.pptx
Cell Types and Its function , kingdom of life

Unit iii measures of dispersion (2)

  • 2. 2 Definition Measures of dispersion are descriptive statistics that describe how similar a set of scores are to each other The more similar the scores are to each other, the lower the measure of dispersion will be The less similar the scores are to each other, the higher the measure of dispersion will be In general, the more spread out a distribution is, the larger the measure of dispersion will be
  • 3. 3 Measures of Dispersion Which of the distributions of scores has the larger dispersion? 0 25 50 75 100 125 1 2 3 4 5 6 7 8 9 10 0 25 50 75 100 125 1 2 3 4 5 6 7 8 9 10 The upper distribution has more dispersion because the scores are more spread out That is, they are less similar to each other
  • 4. 4 Measures of Dispersion There are three main measures of dispersion: The range The semi-interquartile range (SIR) Variance / standard deviation
  • 5. 5 The Range The range is defined as the difference between the largest score in the set of data and the smallest score in the set of data, XL - XS What is the range of the following data: 4 8 1 6 6 2 9 3 6 9 The largest score (XL) is 9; the smallest score (XS) is 1; the range is XL - XS = 9 - 1 = 8
  • 6. 6 When To Use the Range The range is used when you have ordinal data or you are presenting your results to people with little or no knowledge of statistics The range is rarely used in scientific work as it is fairly insensitive It depends on only two scores in the set of data, XL and XS Two very different sets of data can have the same range: 1 1 1 1 9 vs 1 3 5 7 9
  • 7. 7 The Semi-Interquartile Range The semi-interquartile range (or SIR) is defined as the difference of the first and third quartiles divided by two The first quartile is the 25th percentile The third quartile is the 75th percentile SIR = (Q3 - Q1) / 2
  • 8. 8 SIR Example What is the SIR for the data to the right? 25 % of the scores are below 5 5 is the first quartile 25 % of the scores are above 25 25 is the third quartile SIR = (Q3 - Q1) / 2 = (25 - 5) / 2 = 10 2 4 6  5 = 25th %tile 8 10 12 14 20 30  25 = 75th %tile 60
  • 9. 9 When To Use the SIR The SIR is often used with skewed data as it is insensitive to the extreme scores
  • 10. 10 Variance Variance is defined as the average of the square deviations:   N X 2 2     
  • 11. 11 What Does the Variance Formula Mean? First, it says to subtract the mean from each of the scores This difference is called a deviate or a deviation score The deviate tells us how far a given score is from the typical, or average, score Thus, the deviate is a measure of dispersion for a given score
  • 12. 12 What Does the Variance Formula Mean? Why can’t we simply take the average of the deviates? That is, why isn’t variance defined as:   N X 2      This is not the formula for variance!
  • 13. 13 What Does the Variance Formula Mean? One of the definitions of the mean was that it always made the sum of the scores minus the mean equal to 0 Thus, the average of the deviates must be 0 since the sum of the deviates must equal 0 To avoid this problem, statisticians square the deviate score prior to averaging them Squaring the deviate score makes all the squared scores positive
  • 14. 14 What Does the Variance Formula Mean? Variance is the mean of the squared deviation scores The larger the variance is, the more the scores deviate, on average, away from the mean The smaller the variance is, the less the scores deviate, on average, from the mean
  • 15. 15 Standard Deviation When the deviate scores are squared in variance, their unit of measure is squared as well E.g. If people’s weights are measured in pounds, then the variance of the weights would be expressed in pounds2 (or squared pounds) Since squared units of measure are often awkward to deal with, the square root of variance is often used instead The standard deviation is the square root of variance
  • 16. 16 Standard Deviation Standard deviation = variance Variance = standard deviation2
  • 17. 17 Computational Formula When calculating variance, it is often easier to use a computational formula which is algebraically equivalent to the definitional formula:     N N N X X X 2 2 2 2          2 is the population variance, X is a score,  is the population mean, and N is the number of scores
  • 18. 18 Computational Formula Example X X2 X- (X-)2 9 81 2 4 8 64 1 1 6 36 -1 1 5 25 -2 4 8 64 1 1 6 36 -1 1  = 42  = 306  = 0  = 12
  • 19. 19 Computational Formula Example   2 6 12 6 294 306 6 6 306 N N 42 X X 2 2 2 2              2 6 12 N X 2 2       
  • 20. 20 Variance of a Sample Because the sample mean is not a perfect estimate of the population mean, the formula for the variance of a sample is slightly different from the formula for the variance of a population:   1 N X X s 2 2     s2 is the sample variance, X is a score, X is the sample mean, and N is the number of scores
  • 21. 21 Measure of Skew Skew is a measure of symmetry in the distribution of scores Positive Skew Negative Skew Normal (skew = 0)
  • 22. 22 Measure of Skew The following formula can be used to determine skew:     N N X X X X s 2 3 3     
  • 23. 23 Measure of Skew If s3 < 0, then the distribution has a negative skew If s3 > 0 then the distribution has a positive skew If s3 = 0 then the distribution is symmetrical The more different s3 is from 0, the greater the skew in the distribution
  • 24. 24 Kurtosis (Not Related to Halitosis) Kurtosis measures whether the scores are spread out more or less than they would be in a normal (Gaussian) distribution Mesokurtic (s4 = 3) Leptokurtic (s4 > 3) Platykurtic (s4 < 3)
  • 25. 25 Kurtosis When the distribution is normally distributed, its kurtosis equals 3 and it is said to be mesokurtic When the distribution is less spread out than normal, its kurtosis is greater than 3 and it is said to be leptokurtic When the distribution is more spread out than normal, its kurtosis is less than 3 and it is said to be platykurtic
  • 26. 26 Measure of Kurtosis The measure of kurtosis is given by:   N N X X X X s 4 2 4                   
  • 27. 27 s2, s3, & s4 Collectively, the variance (s2), skew (s3), and kurtosis (s4) describe the shape of the distribution