SlideShare a Scribd company logo
Level of Measurement:
        Nominal, Ordinal, Ratio, Interval
        Frequency tables to distributions
 Central Tendency: Mode, Median, Mean
Dispersion: Variance, Standard Deviation
   Nominal – categorical data ; numbers represent
    labels or categories of data but have no
    quantitative value e.g. demographic data
   Ordinal – ranked/ordered data ; numbers rank or
    order people or objects based on the attribute
    being measured
   Interval – points on a scale are an equal distance
    apart ; scale does not contain an absolute zero
    point
   Ratio – points on a scale are an equal distance
    apart and there is an absolute zero point
Basic stat review
   Simple depiction of all the data
   Graphic — easy to understand
   Problems
     Not always precisely measured
     Not summarized in one number or datum
Observation       Frequency
65            1
70            2
75            3
80            4
85            3
90            2
95            1
4

            3
Frequency
            2

            1

                65   70   75      80        85   90   95

                               Test Score
Two key characteristics of a frequency distribution
  are especially important when summarizing
  data or when making a prediction from one set
  of results to another:
 Central Tendency
     What is in the “Middle”?
     What is most common?
     What would we use to predict?
   Dispersion
     How Spread out is the distribution?
     What Shape is it?
Three measures of central tendency are commonly
  used in statistical analysis - the mode, the median,
  and the mean
Each measure is designed to represent a typical
  score
The choice of which measure to use depends on:
 the shape of the distribution (whether normal or

  skewed), and
 the variable’s “level of measurement” (data are

  nominal, ordinal or interval).
   Nominal variables                Mode

   Ordinal variables                Median

   Ratio/Interval level variables       Mean
   Most Common Outcome




          Male   Female
   Middle-most Value
   50% of observations are above the Median, 50%
    are below it
   The difference in magnitude between the
    observations does not matter
   Therefore, it is not sensitive to outliers
• first you rank order the values of X from low to
    high:  85, 94, 94, 96, 96, 96, 96, 97, 97, 98
•   then count number of observations = 10.
• divide by 2 to get the middle score  the 5
   score
  here 96 is the middle score
    Most common measure of central tendency
    Best for making predictions
    Applicable under two conditions:
1.   scores are measured at the interval level, and
2.   distribution is more or less normal [symmetrical].
    Symbolized as:
     
                         X
         for the mean of a sample
        μ for the mean of a population
   X = (Σ X) / N
   If X = {3, 5, 10, 4, 3}
    X = (3 + 5 + 10 + 4 + 3) / 5
      = 25 / 5
      =      5
IF THE DISTRIBUTION IS
 NORMAL

Mean is the best measure of central
 tendency
  Most scores “bunched up” in
   middle
  Extreme scores less frequent 
   don’t move mean around.
 But, central tendency doesn’t tell us
   everything !
Dispersion/Deviation/Spread tells us a lot about
  how a variable is distributed.
We are most interested in Standard Deviations (σ)
  and Variance (σ2)
     Consider these means for weekly candy bar
      consumption.

    X = {7, 8, 6, 7, 7, 6, 8, 7}   X = {12, 2, 0, 14, 10, 9, 5, 4}
    X = (7+8+6+7+7+6+8+7)/8        X = (12+2+0+14+10+9+5+4)/8
    X=7                            X=7


             What is the difference?
Once you determine that the variable of interest is
   normally distributed, ideally by producing a
histogram of the scores, the next question to be
asked is its dispersion: how spread out are the
   scores around the mean.
Dispersion is a key concept in statistical thinking.
The basic question being asked is how much do
   the scores deviate around the Mean? The more
   “bunched up” around the mean the better the
   scores are.
Basic stat review
. If every X were very close to the Mean, the
    mean would be a very good predictor.
If the distribution is very sharply peaked then
    the mean is a good measure of central
    tendency and if you were to use the mean to
    make predictions you would be right or
    close much of the time.
The key concept for describing normal distributions
and making predictions from them is called
deviation from the mean.
We could just calculate the average distance between
   each observation and the mean.
  We must take the absolute value of the distance,
   otherwise they would just cancel out to zero!
Formula:
           | X − Xi |
          ∑ n
Data: X = {6, 10, 5, 4, 9, 8}        X = 42 / 6 = 7

X – Xi        Abs. Dev.
                                1.      Compute X (Average)
7–6           1                 2.      Compute X – X and take
7 – 10        3                         the Absolute Value to get
                                        Absolute Deviations
7–5           2
                                3.      Sum the Absolute
7–4           3                         Deviations
7–9           2                 4.      Divide the sum of the
                                        absolute deviations by N
7–8           1
   Total:         12                     12 / 6 = 2
   Instead of taking the absolute value, we square
    the deviations from the mean. This yields a
    positive value.
 This will result in measures we call the

    Variance and the Standard Deviation
Sample-                   Population-
s: Standard Deviation σ: Standard Deviation
s2: Variance              σ2: Variance
Formulae:

    Variance:                Standard Deviation:



s =
2     ∑(X − Xi )    2
                        s=
                               ∑(X − X )       i
                                                   2


                N                       N
Data: X = {6, 10, 5, 4, 9, 8};      N=6
                                                 Mean:
     X           X−X             (X − X )    2

                                                 X=
                                                    ∑X       =
                                                               42
                                                                  =7
    6               -1              1                   N      6
   10                3              9            Variance:
     5              -2              4
                                                 s =
                                                  2    ∑ ( X − X )2
                                                                      =
                                                                        28
                                                                           = 4.67
     4              -3              9                       N           6
     9               2              4            Standard Deviation:
     8               1              1            s = s 2 = 4.67 = 2.16
Total: 42                        Total: 28

More Related Content

PPTX
Measures of-variation
PPT
Standard Scores
PPT
Z scores
PPT
Measures of Variation
PPT
Normal Distribution
PPT
Measures of Variation
PPT
Measure of Dispersion
Measures of-variation
Standard Scores
Z scores
Measures of Variation
Normal Distribution
Measures of Variation
Measure of Dispersion

What's hot (17)

PPTX
Standard deviation
PPTX
Normal distribution
PDF
Standard deviation
PPT
PPT
Statistics 3, 4
PPTX
Standard Scores
PDF
Standard Score And The Normal Curve
PPT
Standard deviation (3)
PPTX
Understanding Statistics 2#3 The Standard Normal Distribution
PPSX
Measures of variation and dispersion report
PPTX
Standard scores and normal distribution
PPT
The Normal Distribution and Other Continuous Distributions
PDF
Central Tendency
PPTX
Normal Distribution – Introduction and Properties
PPTX
2. standard scores
PPTX
Standard deviation
Standard deviation
Normal distribution
Standard deviation
Statistics 3, 4
Standard Scores
Standard Score And The Normal Curve
Standard deviation (3)
Understanding Statistics 2#3 The Standard Normal Distribution
Measures of variation and dispersion report
Standard scores and normal distribution
The Normal Distribution and Other Continuous Distributions
Central Tendency
Normal Distribution – Introduction and Properties
2. standard scores
Standard deviation
Ad

Viewers also liked (20)

PPTX
Finalpowerpoint
PPT
Chapter1
PDF
Driegeleding
DOCX
การจัดอาหารให้ลูกก่อนวัยเรียน
PPT
使用GoogleAppEngine建立个人信息中心
PDF
Langdurig problematische gezinssituaties
PPTX
Riedy finalpres
PDF
20141229 majalah detik_161
PPTX
The super scientist
PPTX
Comm 201 Presentation
DOCX
การจัดอาหารให้ลูกก่อนวัยเรียน
PPTX
Tv news archive cni
PPT
ท้องถิ่นไทย
PPT
Smartech
PDF
淘宝移动端Web开发最佳实践
PPTX
My favorite tv show is
PPTX
Obesity powerpoint
PPTX
UCLA OR12
PPT
UN VIAJE EN EL TIEMPO
PDF
Viergeleding
Finalpowerpoint
Chapter1
Driegeleding
การจัดอาหารให้ลูกก่อนวัยเรียน
使用GoogleAppEngine建立个人信息中心
Langdurig problematische gezinssituaties
Riedy finalpres
20141229 majalah detik_161
The super scientist
Comm 201 Presentation
การจัดอาหารให้ลูกก่อนวัยเรียน
Tv news archive cni
ท้องถิ่นไทย
Smartech
淘宝移动端Web开发最佳实践
My favorite tv show is
Obesity powerpoint
UCLA OR12
UN VIAJE EN EL TIEMPO
Viergeleding
Ad

Similar to Basic stat review (20)

PPT
Statistical methods
PDF
Ses 11,12,13,14 dispersion
PPTX
G7-quantitative
PPT
Basic statistics
PPTX
Lesson3 lpart one - Measures mean [Autosaved].pptx
PPTX
Lesson2 lecture two in Measures mean.pptx
PDF
Lesson 5.pdf ....probability and statistics
PPT
Standard Deviation.ppt
PDF
Lesson2 - lecture two Measures mean.pdf
PPTX
Introduction to statistics 3
PPTX
Standard Deviation and Variance
PPTX
Measures of Dispersion.pptx
PPTX
descriptive statistics- 1.pptx
PDF
Empirics of standard deviation
PPTX
Topic 8a Basic Statistics
PDF
Production involves transformation of inputs such as capital, equipment, labo...
PPT
Statistics in Research
PPT
Statistics in Research
PPT
Measures of dispersion
Statistical methods
Ses 11,12,13,14 dispersion
G7-quantitative
Basic statistics
Lesson3 lpart one - Measures mean [Autosaved].pptx
Lesson2 lecture two in Measures mean.pptx
Lesson 5.pdf ....probability and statistics
Standard Deviation.ppt
Lesson2 - lecture two Measures mean.pdf
Introduction to statistics 3
Standard Deviation and Variance
Measures of Dispersion.pptx
descriptive statistics- 1.pptx
Empirics of standard deviation
Topic 8a Basic Statistics
Production involves transformation of inputs such as capital, equipment, labo...
Statistics in Research
Statistics in Research
Measures of dispersion

Recently uploaded (20)

PDF
Empowerment Technology for Senior High School Guide
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
My India Quiz Book_20210205121199924.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
HVAC Specification 2024 according to central public works department
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PPTX
Introduction to pro and eukaryotes and differences.pptx
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PPTX
TNA_Presentation-1-Final(SAVE)) (1).pptx
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
Empowerment Technology for Senior High School Guide
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
My India Quiz Book_20210205121199924.pdf
Hazard Identification & Risk Assessment .pdf
HVAC Specification 2024 according to central public works department
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Introduction to pro and eukaryotes and differences.pptx
Unit 4 Computer Architecture Multicore Processor.pptx
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
202450812 BayCHI UCSC-SV 20250812 v17.pptx
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
B.Sc. DS Unit 2 Software Engineering.pptx
TNA_Presentation-1-Final(SAVE)) (1).pptx
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα

Basic stat review

  • 1. Level of Measurement: Nominal, Ordinal, Ratio, Interval Frequency tables to distributions Central Tendency: Mode, Median, Mean Dispersion: Variance, Standard Deviation
  • 2. Nominal – categorical data ; numbers represent labels or categories of data but have no quantitative value e.g. demographic data  Ordinal – ranked/ordered data ; numbers rank or order people or objects based on the attribute being measured  Interval – points on a scale are an equal distance apart ; scale does not contain an absolute zero point  Ratio – points on a scale are an equal distance apart and there is an absolute zero point
  • 4. Simple depiction of all the data  Graphic — easy to understand  Problems  Not always precisely measured  Not summarized in one number or datum
  • 5. Observation Frequency 65 1 70 2 75 3 80 4 85 3 90 2 95 1
  • 6. 4 3 Frequency 2 1 65 70 75 80 85 90 95 Test Score
  • 7. Two key characteristics of a frequency distribution are especially important when summarizing data or when making a prediction from one set of results to another:  Central Tendency  What is in the “Middle”?  What is most common?  What would we use to predict?  Dispersion  How Spread out is the distribution?  What Shape is it?
  • 8. Three measures of central tendency are commonly used in statistical analysis - the mode, the median, and the mean Each measure is designed to represent a typical score The choice of which measure to use depends on:  the shape of the distribution (whether normal or skewed), and  the variable’s “level of measurement” (data are nominal, ordinal or interval).
  • 9. Nominal variables Mode  Ordinal variables Median  Ratio/Interval level variables Mean
  • 10. Most Common Outcome Male Female
  • 11. Middle-most Value  50% of observations are above the Median, 50% are below it  The difference in magnitude between the observations does not matter  Therefore, it is not sensitive to outliers
  • 12. • first you rank order the values of X from low to high:  85, 94, 94, 96, 96, 96, 96, 97, 97, 98 • then count number of observations = 10. • divide by 2 to get the middle score  the 5 score here 96 is the middle score
  • 13. Most common measure of central tendency  Best for making predictions  Applicable under two conditions: 1. scores are measured at the interval level, and 2. distribution is more or less normal [symmetrical].  Symbolized as:  X for the mean of a sample  μ for the mean of a population
  • 14. X = (Σ X) / N  If X = {3, 5, 10, 4, 3} X = (3 + 5 + 10 + 4 + 3) / 5 = 25 / 5 = 5
  • 15. IF THE DISTRIBUTION IS NORMAL Mean is the best measure of central tendency  Most scores “bunched up” in middle  Extreme scores less frequent  don’t move mean around. But, central tendency doesn’t tell us everything !
  • 16. Dispersion/Deviation/Spread tells us a lot about how a variable is distributed. We are most interested in Standard Deviations (σ) and Variance (σ2)
  • 17. Consider these means for weekly candy bar consumption. X = {7, 8, 6, 7, 7, 6, 8, 7} X = {12, 2, 0, 14, 10, 9, 5, 4} X = (7+8+6+7+7+6+8+7)/8 X = (12+2+0+14+10+9+5+4)/8 X=7 X=7 What is the difference?
  • 18. Once you determine that the variable of interest is normally distributed, ideally by producing a histogram of the scores, the next question to be asked is its dispersion: how spread out are the scores around the mean. Dispersion is a key concept in statistical thinking. The basic question being asked is how much do the scores deviate around the Mean? The more “bunched up” around the mean the better the scores are.
  • 20. . If every X were very close to the Mean, the mean would be a very good predictor. If the distribution is very sharply peaked then the mean is a good measure of central tendency and if you were to use the mean to make predictions you would be right or close much of the time.
  • 21. The key concept for describing normal distributions and making predictions from them is called deviation from the mean. We could just calculate the average distance between each observation and the mean.  We must take the absolute value of the distance, otherwise they would just cancel out to zero! Formula: | X − Xi | ∑ n
  • 22. Data: X = {6, 10, 5, 4, 9, 8} X = 42 / 6 = 7 X – Xi Abs. Dev. 1. Compute X (Average) 7–6 1 2. Compute X – X and take 7 – 10 3 the Absolute Value to get Absolute Deviations 7–5 2 3. Sum the Absolute 7–4 3 Deviations 7–9 2 4. Divide the sum of the absolute deviations by N 7–8 1 Total: 12 12 / 6 = 2
  • 23. Instead of taking the absolute value, we square the deviations from the mean. This yields a positive value.  This will result in measures we call the Variance and the Standard Deviation Sample- Population- s: Standard Deviation σ: Standard Deviation s2: Variance σ2: Variance
  • 24. Formulae: Variance: Standard Deviation: s = 2 ∑(X − Xi ) 2 s= ∑(X − X ) i 2 N N
  • 25. Data: X = {6, 10, 5, 4, 9, 8}; N=6 Mean: X X−X (X − X ) 2 X= ∑X = 42 =7 6 -1 1 N 6 10 3 9 Variance: 5 -2 4 s = 2 ∑ ( X − X )2 = 28 = 4.67 4 -3 9 N 6 9 2 4 Standard Deviation: 8 1 1 s = s 2 = 4.67 = 2.16 Total: 42 Total: 28