SlideShare a Scribd company logo
INTRODUCTION TO STATISTICS AND  STATISTICAL INFERENCE Teaching Basic Statistics
Session 1. TEACHING BASIC STATISTICS
Realities about Statistics  “ There are three kinds of lies: lies, damned lies, and statistics”  – Mark Twaine One can not go about without statistics. “ Statistics are like bikinis. What they reveal is suggestive, but what they conceal is vital.”   –  Aaron Levenstein   Session 1. TEACHING BASIC STATISTICS
Definition of Statistics plural sense:  numerical facts, e.g. CPI, peso-dollar exchange rate  singular sense:  scientific discipline consisting of theory and methods for processing numerical information that one can use when making decisions in the face of uncertainty.  Session 1. TEACHING BASIC STATISTICS
History of Statistics The term  statistics  came from the Latin phrase  “ratio status ”  which means study of practical politics or the statesman’s art. In the middle of 18 th  century, the term  statistik   (a term due to Achenwall) was used, a German term defined as “the political science of several countries”  From  statistik  it became  statistics  defined as a statement in figures and facts of the present condition of a state.  Session 1. TEACHING BASIC STATISTICS
Application of Statistics Diverse applications “ During the 20th Century statistical thinking and methodology have become the scientific framework for literally dozens of fields including education, agriculture, economics, biology, and medicine, and with increasing influence recently on the hard sciences such as astronomy, geology, and physics. In other words, we have grown from a small obscure field into a big obscure field.”   – Brad Efron Session 1. TEACHING BASIC STATISTICS
Application of Statistics Comparing the effects of five kinds of fertilizers on the yield of a particular variety of corn Determining the income distribution of Ateneo students under CHED Comparing the effectiveness of two diet programs Prediction of daily temperatures Evaluation of student performance Session 1. TEACHING BASIC STATISTICS
Two Aims of Statistics Statistics aims to uncover structure in data, to explain variation… Descriptive Inferential   Session 1. TEACHING BASIC STATISTICS
Areas of Statistics Descriptive statistics methods concerned w/ collecting, describing, and analyzing a set of data  without drawing conclusions  (or inferences) about a large group Inferential statistics methods concerned with the analysis of a subset of data leading to predictions or inferences about the entire set of data Session 1. TEACHING BASIC STATISTICS
Examples of Descriptive Statistics Presenting the Philippine population by constructing a graph indicating the total number of Filipinos counted during the last census by age group and sex The Department of Social Welfare and Development (DSWD) cited statistics showing an  increase in the number of child abuse cases during the past five years. Session 1. TEACHING BASIC STATISTICS
Examples of Inferential Statistics A new milk formulation designed to improve the psychomotor development of infants was tested on randomly selected infants.  Based on the results, it was concluded that the new milk formulation is effective in improving the psychomotor development of infants . Session 1. TEACHING BASIC STATISTICS
Inferential Statistics Session 1. TEACHING BASIC STATISTICS Larger Set ( N  units/observations) Smaller Set ( n  units/observations ) Inferences and Generalizations
Key Definitions A  variable  is a characteristic observed or measured on every unit of the universe. A  population  is the set of all possible values of the variable. Session 1. TEACHING BASIC STATISTICS
Key Definitions Parameters  are numerical measures that describe the population or universe of interest. Usually donated by Greek letters;    (mu),    (sigma),    (rho),    (lambda),    (tau),    (theta),    (alpha) and    (beta).  Statistics   are numerical measures of a sample Session 1. TEACHING BASIC STATISTICS
Types of Variables Qualitative variable non-numerical values Quantitative variable numerical values Discrete countable  Continuous measurable Session 1. TEACHING BASIC STATISTICS
Levels of Measurement Nominal Numbers or symbols used to classify Ordinal scale Accounts for order; no indication of distance between positions Interval scale Equal intervals; no absolute zero Ratio scale Has absolute zero Session 1. TEACHING BASIC STATISTICS
Session 1. TEACHING BASIC STATISTICS NOMINAL SCALE a nominal scale consists of a set of categories that have different names measurements on a nominal scale label and categorize observations, but do not make any quantitative distinctions between observations.Variables measured at the nominal scale: Gender (1= male, 0=female) ZIP code (7000=Philippines, …) Plate numbers of vehicles (JK3429, MC001, …) Course (Biology, Mathematics, History, …) Race (Asian, American, …) Eye color (Brown, Blue, …)
Session 1. TEACHING BASIC STATISTICS ORDINAL SCALE consists of a set of categories that are organized in an ordered sequence measurements on an ordinal scale rank observations in terms of size variables that can be measured at the ordinal scale: Ranks in a race (first, second, third, …) Sizes of shirts (small, medium, large, …) Order of birth (first child, second child , third child , …) Socio-economic status (lower, middle, upper, …) Difficulty level of a test (easy, average, difficult, …) Degree of agreement (SD, D, A, SA)
Session 1. TEACHING BASIC STATISTICS INTERVAL SCALE  consists of ordered categories that are all intervals of exactly the same size equal differences between numbers on the scale reflect equal differences in magnitude, however, ratios of magnitudes are not meaningful. Variables measured at the interval scale: Temperature (in  o F or  o C) IQ  SAT scores
Session 1. TEACHING BASIC STATISTICS RATIO SCALE  is an interval scale with additional feature of an absolute zero point Ratios of numbers do reflect ratios of magnitude Variables measured at the ratio scale: Age (16, 20, 28, …) Height (165cm, 154cm, 144cm, …) Reaction time (20sec, 43sec, 37sec, …)  Number of siblings (2, 5, 8, …) Hours spent on studying for an exam (0, 2, 3, …)
Methods of Presenting Data Textual Tabular Graphical Session 1. TEACHING BASIC STATISTICS
Session 1. TEACHING BASIC STATISTICS Mean Median Mode Summary Measures Variation Variance Standard Deviation Coefficient of Variation Range Location Maximum Minimum Central  Tendency Percentile Quartile Decile Interquartile Range Skewness Kurtosis
Measures of Location A  Measure of Location  summarizes a data set by giving a “typical value” within the range of the data values that describes its location relative to entire data set. Some Common Measures:    Minimum, Maximum    Central Tendency    Percentiles, Deciles, Quartiles Session 1. TEACHING BASIC STATISTICS
Maximum and Minimum Minimum   is the smallest value in the data set, denoted as  MIN . Maximum  is the largest value in the data set, denoted as  MAX . Session 1. TEACHING BASIC STATISTICS
Measure of Central Tendency A single value that is used to identify the “center” of the data  it is thought of as a typical value of the distribution precise yet simple most representative value of the data Session 1. TEACHING BASIC STATISTICS
Mean  Most common measure of the center  Also known as arithmetic average Session 1. TEACHING BASIC STATISTICS Sample Mean Population Mean
Properties of the Mean may not be an actual observation in the data set can be applied in at least interval level easy to compute every observation contributes to the value of the mean  Session 1. TEACHING BASIC STATISTICS
Properties of the Mean  subgroup means can be combined to come up with a group mean easily affected by extreme values Session 1. TEACHING BASIC STATISTICS 0  1  2  3  4  5  6  7  8  9  10 0  1  2  3  4  5  6  7  8  9  10  12  14  Mean = 6 Mean = 5
Median Divides the observations into two equal parts If n is odd, the median is the middle number. If n is even, the median is the average of the 2 middle numbers. Sample median denoted as while population median is denoted as Session 1. TEACHING BASIC STATISTICS
Properties of a Median may not be an actual observation in the data set can be applied in at least ordinal level a positional measure; not affected by extreme values Session 1. TEACHING BASIC STATISTICS 0  1  2  3  4  5  6  7  8  9  10 0  1  2  3  4  5  6  7  8  9  10  12  14  Median = 5
Mode  occurs most frequently  nominal average computation of the mode for ungrouped or raw data Session 1. TEACHING BASIC STATISTICS 0  1  2  3  4  5  6  7  8  9  10  11  12  13  14  Mode = 9 0  1  2  3  4  5  6 No Mode
Properties of a Mode can be used for qualitative as well as quantitative data  may not be unique not affected by extreme values may not exist Session 1. TEACHING BASIC STATISTICS
Mean, Median & Mode Use the mean when: sampling stability is desired other measures are to be computed Session 1. TEACHING BASIC STATISTICS
Mean, Median & Mode Use the median when: the exact midpoint of the distribution is desired there are extreme observations Session 1. TEACHING BASIC STATISTICS
Mean, Median & Mode Use the mode when: when the "typical" value is desired when the dataset is measured on a nominal scale Session 1. TEACHING BASIC STATISTICS
Percentiles Numerical measures that give the relative position of a data  value  relative to the entire data set. Divide an array ( raw data arranged in increasing or decreasing order of magnitude ) into 100 equal parts. The  j th  percentile, denoted as  P j ,  is the data value in the the data set that separates the bottom  j % of the data from the top (100- j )%.  Session 1. TEACHING BASIC STATISTICS
EXAMPLE Suppose LJ was told that relative to the other scores on a certain test, his score was the 95 th  percentile.    This means that 95% of those who took the test had scores less than or equal to LJ’s score, while 5% had scores higher than LJ’s. Session 1. TEACHING BASIC STATISTICS
Deciles Divide an array into ten equal parts, each part having ten percent of the distribution of the data values, denoted by  D j . The 1 st  decile is the 10 th  percentile; the 2 nd  decile is the 20 th  percentile….. Session 1. TEACHING BASIC STATISTICS
Quartiles Divide an array into four equal parts, each part having 25% of the distribution of the data values, denoted by  Q j . The 1 st  quartile is the 25 th  percentile; the 2 nd  quartile is the 50 th  percentile, also the median and the 3 rd  quartile is the 75 th  percentile. Session 1. TEACHING BASIC STATISTICS
Measures of Variation A measure of variation is a single value that is used to describe the spread of the distribution A measure of central tendency alone does not uniquely describe a distribution Session 1. TEACHING BASIC STATISTICS
Session 1. TEACHING BASIC STATISTICS Mean = 15.5 s  =  3.338   11  12  13  14  15  16  17  18  19  20  21 11  12  13  14  15  16  17  18  19  20  21 Data B Data A Mean = 15.5 s =  .9258 11  12  13  14  15  16  17  18  19  20  21 Mean = 15.5 s =  4.57 Data C A look at dispersion…
Two Types of Measures of Dispersion Absolute Measures of Dispersion:    Range    Inter-quartile Range     Variance    Standard Deviation Session 1. TEACHING BASIC STATISTICS Relative Measure of Dispersion:    Coefficient of Variation
Range (R) Session 1. TEACHING BASIC STATISTICS The difference between the maximum and minimum value in a data set, i.e. R = MAX – MIN Example:  Pulse rates of 15 male residents of a certain village  54   58  58  60  62  65  66  71  74  75  77  78  80  82  85   R  = 85 - 54 = 31
Some Properties of the Range   The larger the value of the range, the more dispersed the observations are. It is quick and easy to understand. A rough measure of dispersion. Session 1. TEACHING BASIC STATISTICS
Inter-Quartile Range (IQR) Session 1. TEACHING BASIC STATISTICS The difference between the third quartile and first quartile, i.e. IQR = Q 3  – Q 1   Example:  Pulse rates of 15 residents of a certain village 54  58  58  60   62  65  66  71  74  75  77  78   80  82  85   IQR  = 78 - 60 = 18
Some Properties of IQR Reduces the influence of extreme values. Not as easy to calculate as the Range. Session 1. TEACHING BASIC STATISTICS
Variance important measure of variation shows variation about the mean Population variance Sample variance Session 1. TEACHING BASIC STATISTICS
Standard Deviation (SD) most important measure of variation square root of Variance has the same units as the original data Population SD Sample SD Session 1. TEACHING BASIC STATISTICS
Session 1. TEACHING BASIC STATISTICS Data:   10  12  14  15  17  18  18  24 n = 8  Mean =16 Computation of Standard Deviation
Session 1. TEACHING BASIC STATISTICS Remarks on Standard Deviation If there is a large amount of variation, then on average, the data values will be far from the mean.  Hence, the SD will be large. If there is only a small amount of variation, then on average, the data values will be close to the mean.  Hence, the SD will be small.
Comparing Standard Deviation Session 1. TEACHING BASIC STATISTICS Mean = 15.5 s  =  3.338   11  12  13  14  15  16  17  18  19  20  21 11  12  13  14  15  16  17  18  19  20  21 Data B Data A Mean = 15.5 s =  .9258 11  12  13  14  15  16  17  18  19  20  21 Mean = 15.5 s =  4.57 Data C
Comparing Standard Deviation Session 1. TEACHING BASIC STATISTICS Example:  Team A - Heights of five marathon players in inches  65”  65 “ 65 “ 65 “ 65 “ 65 “ Mean = 65 S   =  0
Comparing Standard Deviation Session 1. TEACHING BASIC STATISTICS Example:  Team B - Heights of five marathon players in inches 62 “ 67 “ 66 “ 70 “ 60 “ Mean = 65” s  =  4.0”
Properties of Standard Deviation It is the most widely used measure of dispersion. (Chebychev’s Inequality)  It is based on all the items and is rigidly defined. It is used to test the reliability of measures calculated from samples. The standard deviation is sensitive to the presence of extreme values.  It is not easy to calculate by hand (unlike the range). Session 1. TEACHING BASIC STATISTICS
Coefficient of Variation (CV) measure of relative variation usually expressed in percent shows variation relative to mean used to   compare 2 or more groups Formula : Session 1. TEACHING BASIC STATISTICS
Comparing CVs Stock A:  Average Price = P50   SD = P5   CV = 10% Stock B:  Average Price = P100   SD = P5   CV = 5% Session 1. TEACHING BASIC STATISTICS
Measure of Skewness Describes the degree of departures of the distribution of the data from symmetry. The degree of skewness is measured by the coefficient of skewness, denoted as  SK  and computed as, Session 1. TEACHING BASIC STATISTICS
What is Symmetry? A distribution is said to be symmetric about the mean, if the distribution to the left of mean is the “mirror image” of the distribution to the right of the mean. Likewise, a symmetric distribution has SK=0 since its mean is equal to its median and its mode. Session 1. TEACHING BASIC STATISTICS
positively skewed Measure of Skewness negatively skewed Session 1. TEACHING BASIC STATISTICS
Measure of Kurtosis Describes the extent of peakedness or flatness of the distribution of the data. Measured by coefficient of kurtosis ( K ) computed as, Session 1. TEACHING BASIC STATISTICS
Measure of Kurtosis Session 1. TEACHING BASIC STATISTICS K  = 0 mesokurtic K  > 0 leptokurtic K  < 0 platykurtic

More Related Content

PPTX
What is Statistics
PPTX
Statistics
PPTX
Descriptive statistics
PPTX
Statistical table
PPTX
Introduction to Statistics
PPT
Introduction to Statistics and Statistical Inference
PPTX
Statistics
PPTX
statistic
What is Statistics
Statistics
Descriptive statistics
Statistical table
Introduction to Statistics
Introduction to Statistics and Statistical Inference
Statistics
statistic

What's hot (20)

PDF
Introduction to Statistics
PPTX
Measures of central tendency
PPTX
Introduction to Statistics (Part -I)
PPTX
What is statistics
PPTX
Descriptive Statistics
PPTX
PPTX
Measures of central tendency ppt
PPT
Skewness.ppt
PDF
Statistics: Chapter One
PPSX
Types of Statistics
PPTX
Central limit theorem
PPTX
Unit 1 - Statistics (Part 1).pptx
PPTX
Chapter 2: Frequency Distribution and Graphs
PPT
The sampling distribution
PPTX
1.2 types of data
PDF
Frequency distribution
PPTX
Intro to probability
PPTX
Basics stat ppt-types of data
PPTX
Use of statistics in real life
PPT
Introduction to Probability and Probability Distributions
Introduction to Statistics
Measures of central tendency
Introduction to Statistics (Part -I)
What is statistics
Descriptive Statistics
Measures of central tendency ppt
Skewness.ppt
Statistics: Chapter One
Types of Statistics
Central limit theorem
Unit 1 - Statistics (Part 1).pptx
Chapter 2: Frequency Distribution and Graphs
The sampling distribution
1.2 types of data
Frequency distribution
Intro to probability
Basics stat ppt-types of data
Use of statistics in real life
Introduction to Probability and Probability Distributions
Ad

Viewers also liked (20)

PPSX
Introduction to statistics...ppt rahul
PPT
Introduction To Statistics
PPTX
Introduction to statistics
PPT
Basic Statistical Concepts and Methods
PPT
Statistical ppt
PPT
Introduction to Elementary statistics
PPTX
Statistics Math project class 10th
PPT
Statistics
PDF
STATISTICS AND PROBABILITY (TEACHING GUIDE)
PPTX
Introduction to statistics
PPTX
Ses 1 basic fundamentals of mathematics and statistics
PPTX
Introduction to Statistics
PPT
Statistics lesson 1
PPTX
Kurtosis
PPTX
Permutation and combination - Math Statistic
PPT
Probability and statistics(exercise answers)
PDF
What Is Statistics
PDF
Qm 4 20100905
PPTX
An Overview of Basic Statistics
PPT
Gravetter & Wallnau 7e Ch 08 PPt Handout
Introduction to statistics...ppt rahul
Introduction To Statistics
Introduction to statistics
Basic Statistical Concepts and Methods
Statistical ppt
Introduction to Elementary statistics
Statistics Math project class 10th
Statistics
STATISTICS AND PROBABILITY (TEACHING GUIDE)
Introduction to statistics
Ses 1 basic fundamentals of mathematics and statistics
Introduction to Statistics
Statistics lesson 1
Kurtosis
Permutation and combination - Math Statistic
Probability and statistics(exercise answers)
What Is Statistics
Qm 4 20100905
An Overview of Basic Statistics
Gravetter & Wallnau 7e Ch 08 PPt Handout
Ad

Similar to Math 102- Statistics (20)

PPT
1_INTRODUCTION TO STATISTICS AND STATISTICAL INFERENCE.ppt
PPT
INTRODUCTION TO STATISTICS.ppt
PPT
Descriptive Statistics for Graduate School
PPTX
STATISTICS.pptx for the scholars and students
PDF
Module 1 introduction to statistics
PPTX
Major types of statistics terms that you should know
PPTX
Chapter 1 of the book Basic Statistics as described by teacher
PPTX
Statistics-Chapter-1.pptxheheheueuehehehehehe
PPT
Introduction to statistics
PPT
Statistics and probability lecture 01
PPTX
introduction to statistics
PPTX
Stats LECTURE 1.pptx
PPTX
Meaning and Importance of Statistics
PPTX
Complete Biostatistics (Descriptive and Inferential analysis)
PDF
Statistics of engineer’s with basic concepts in statistics
PPTX
Basic statistics
PPTX
050325Online SPSS.pptx spss social science
PPTX
Data Management_new.pptx
PPTX
Introduction to Statistics - Basic concepts
1_INTRODUCTION TO STATISTICS AND STATISTICAL INFERENCE.ppt
INTRODUCTION TO STATISTICS.ppt
Descriptive Statistics for Graduate School
STATISTICS.pptx for the scholars and students
Module 1 introduction to statistics
Major types of statistics terms that you should know
Chapter 1 of the book Basic Statistics as described by teacher
Statistics-Chapter-1.pptxheheheueuehehehehehe
Introduction to statistics
Statistics and probability lecture 01
introduction to statistics
Stats LECTURE 1.pptx
Meaning and Importance of Statistics
Complete Biostatistics (Descriptive and Inferential analysis)
Statistics of engineer’s with basic concepts in statistics
Basic statistics
050325Online SPSS.pptx spss social science
Data Management_new.pptx
Introduction to Statistics - Basic concepts

Recently uploaded (20)

DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
PDF
1_English_Language_Set_2.pdf probationary
PDF
What if we spent less time fighting change, and more time building what’s rig...
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
IGGE1 Understanding the Self1234567891011
PDF
Empowerment Technology for Senior High School Guide
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Lesson notes of climatology university.
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
A systematic review of self-coping strategies used by university students to ...
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PDF
Indian roads congress 037 - 2012 Flexible pavement
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
1_English_Language_Set_2.pdf probationary
What if we spent less time fighting change, and more time building what’s rig...
Final Presentation General Medicine 03-08-2024.pptx
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
IGGE1 Understanding the Self1234567891011
Empowerment Technology for Senior High School Guide
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Lesson notes of climatology university.
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Final Presentation General Medicine 03-08-2024.pptx
Supply Chain Operations Speaking Notes -ICLT Program
Paper A Mock Exam 9_ Attempt review.pdf.
A systematic review of self-coping strategies used by university students to ...
Orientation - ARALprogram of Deped to the Parents.pptx
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
Indian roads congress 037 - 2012 Flexible pavement

Math 102- Statistics

  • 1. INTRODUCTION TO STATISTICS AND STATISTICAL INFERENCE Teaching Basic Statistics
  • 2. Session 1. TEACHING BASIC STATISTICS
  • 3. Realities about Statistics “ There are three kinds of lies: lies, damned lies, and statistics” – Mark Twaine One can not go about without statistics. “ Statistics are like bikinis. What they reveal is suggestive, but what they conceal is vital.” – Aaron Levenstein   Session 1. TEACHING BASIC STATISTICS
  • 4. Definition of Statistics plural sense: numerical facts, e.g. CPI, peso-dollar exchange rate singular sense: scientific discipline consisting of theory and methods for processing numerical information that one can use when making decisions in the face of uncertainty. Session 1. TEACHING BASIC STATISTICS
  • 5. History of Statistics The term statistics came from the Latin phrase “ratio status ” which means study of practical politics or the statesman’s art. In the middle of 18 th century, the term statistik (a term due to Achenwall) was used, a German term defined as “the political science of several countries” From statistik it became statistics defined as a statement in figures and facts of the present condition of a state. Session 1. TEACHING BASIC STATISTICS
  • 6. Application of Statistics Diverse applications “ During the 20th Century statistical thinking and methodology have become the scientific framework for literally dozens of fields including education, agriculture, economics, biology, and medicine, and with increasing influence recently on the hard sciences such as astronomy, geology, and physics. In other words, we have grown from a small obscure field into a big obscure field.” – Brad Efron Session 1. TEACHING BASIC STATISTICS
  • 7. Application of Statistics Comparing the effects of five kinds of fertilizers on the yield of a particular variety of corn Determining the income distribution of Ateneo students under CHED Comparing the effectiveness of two diet programs Prediction of daily temperatures Evaluation of student performance Session 1. TEACHING BASIC STATISTICS
  • 8. Two Aims of Statistics Statistics aims to uncover structure in data, to explain variation… Descriptive Inferential Session 1. TEACHING BASIC STATISTICS
  • 9. Areas of Statistics Descriptive statistics methods concerned w/ collecting, describing, and analyzing a set of data without drawing conclusions (or inferences) about a large group Inferential statistics methods concerned with the analysis of a subset of data leading to predictions or inferences about the entire set of data Session 1. TEACHING BASIC STATISTICS
  • 10. Examples of Descriptive Statistics Presenting the Philippine population by constructing a graph indicating the total number of Filipinos counted during the last census by age group and sex The Department of Social Welfare and Development (DSWD) cited statistics showing an increase in the number of child abuse cases during the past five years. Session 1. TEACHING BASIC STATISTICS
  • 11. Examples of Inferential Statistics A new milk formulation designed to improve the psychomotor development of infants was tested on randomly selected infants. Based on the results, it was concluded that the new milk formulation is effective in improving the psychomotor development of infants . Session 1. TEACHING BASIC STATISTICS
  • 12. Inferential Statistics Session 1. TEACHING BASIC STATISTICS Larger Set ( N units/observations) Smaller Set ( n units/observations ) Inferences and Generalizations
  • 13. Key Definitions A variable is a characteristic observed or measured on every unit of the universe. A population is the set of all possible values of the variable. Session 1. TEACHING BASIC STATISTICS
  • 14. Key Definitions Parameters are numerical measures that describe the population or universe of interest. Usually donated by Greek letters;  (mu),  (sigma),  (rho),  (lambda),  (tau),  (theta),  (alpha) and  (beta). Statistics are numerical measures of a sample Session 1. TEACHING BASIC STATISTICS
  • 15. Types of Variables Qualitative variable non-numerical values Quantitative variable numerical values Discrete countable Continuous measurable Session 1. TEACHING BASIC STATISTICS
  • 16. Levels of Measurement Nominal Numbers or symbols used to classify Ordinal scale Accounts for order; no indication of distance between positions Interval scale Equal intervals; no absolute zero Ratio scale Has absolute zero Session 1. TEACHING BASIC STATISTICS
  • 17. Session 1. TEACHING BASIC STATISTICS NOMINAL SCALE a nominal scale consists of a set of categories that have different names measurements on a nominal scale label and categorize observations, but do not make any quantitative distinctions between observations.Variables measured at the nominal scale: Gender (1= male, 0=female) ZIP code (7000=Philippines, …) Plate numbers of vehicles (JK3429, MC001, …) Course (Biology, Mathematics, History, …) Race (Asian, American, …) Eye color (Brown, Blue, …)
  • 18. Session 1. TEACHING BASIC STATISTICS ORDINAL SCALE consists of a set of categories that are organized in an ordered sequence measurements on an ordinal scale rank observations in terms of size variables that can be measured at the ordinal scale: Ranks in a race (first, second, third, …) Sizes of shirts (small, medium, large, …) Order of birth (first child, second child , third child , …) Socio-economic status (lower, middle, upper, …) Difficulty level of a test (easy, average, difficult, …) Degree of agreement (SD, D, A, SA)
  • 19. Session 1. TEACHING BASIC STATISTICS INTERVAL SCALE consists of ordered categories that are all intervals of exactly the same size equal differences between numbers on the scale reflect equal differences in magnitude, however, ratios of magnitudes are not meaningful. Variables measured at the interval scale: Temperature (in o F or o C) IQ SAT scores
  • 20. Session 1. TEACHING BASIC STATISTICS RATIO SCALE is an interval scale with additional feature of an absolute zero point Ratios of numbers do reflect ratios of magnitude Variables measured at the ratio scale: Age (16, 20, 28, …) Height (165cm, 154cm, 144cm, …) Reaction time (20sec, 43sec, 37sec, …) Number of siblings (2, 5, 8, …) Hours spent on studying for an exam (0, 2, 3, …)
  • 21. Methods of Presenting Data Textual Tabular Graphical Session 1. TEACHING BASIC STATISTICS
  • 22. Session 1. TEACHING BASIC STATISTICS Mean Median Mode Summary Measures Variation Variance Standard Deviation Coefficient of Variation Range Location Maximum Minimum Central Tendency Percentile Quartile Decile Interquartile Range Skewness Kurtosis
  • 23. Measures of Location A Measure of Location summarizes a data set by giving a “typical value” within the range of the data values that describes its location relative to entire data set. Some Common Measures:  Minimum, Maximum  Central Tendency  Percentiles, Deciles, Quartiles Session 1. TEACHING BASIC STATISTICS
  • 24. Maximum and Minimum Minimum is the smallest value in the data set, denoted as MIN . Maximum is the largest value in the data set, denoted as MAX . Session 1. TEACHING BASIC STATISTICS
  • 25. Measure of Central Tendency A single value that is used to identify the “center” of the data it is thought of as a typical value of the distribution precise yet simple most representative value of the data Session 1. TEACHING BASIC STATISTICS
  • 26. Mean Most common measure of the center Also known as arithmetic average Session 1. TEACHING BASIC STATISTICS Sample Mean Population Mean
  • 27. Properties of the Mean may not be an actual observation in the data set can be applied in at least interval level easy to compute every observation contributes to the value of the mean Session 1. TEACHING BASIC STATISTICS
  • 28. Properties of the Mean subgroup means can be combined to come up with a group mean easily affected by extreme values Session 1. TEACHING BASIC STATISTICS 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14 Mean = 6 Mean = 5
  • 29. Median Divides the observations into two equal parts If n is odd, the median is the middle number. If n is even, the median is the average of the 2 middle numbers. Sample median denoted as while population median is denoted as Session 1. TEACHING BASIC STATISTICS
  • 30. Properties of a Median may not be an actual observation in the data set can be applied in at least ordinal level a positional measure; not affected by extreme values Session 1. TEACHING BASIC STATISTICS 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14 Median = 5
  • 31. Mode occurs most frequently nominal average computation of the mode for ungrouped or raw data Session 1. TEACHING BASIC STATISTICS 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Mode = 9 0 1 2 3 4 5 6 No Mode
  • 32. Properties of a Mode can be used for qualitative as well as quantitative data may not be unique not affected by extreme values may not exist Session 1. TEACHING BASIC STATISTICS
  • 33. Mean, Median & Mode Use the mean when: sampling stability is desired other measures are to be computed Session 1. TEACHING BASIC STATISTICS
  • 34. Mean, Median & Mode Use the median when: the exact midpoint of the distribution is desired there are extreme observations Session 1. TEACHING BASIC STATISTICS
  • 35. Mean, Median & Mode Use the mode when: when the &quot;typical&quot; value is desired when the dataset is measured on a nominal scale Session 1. TEACHING BASIC STATISTICS
  • 36. Percentiles Numerical measures that give the relative position of a data value relative to the entire data set. Divide an array ( raw data arranged in increasing or decreasing order of magnitude ) into 100 equal parts. The j th percentile, denoted as P j , is the data value in the the data set that separates the bottom j % of the data from the top (100- j )%. Session 1. TEACHING BASIC STATISTICS
  • 37. EXAMPLE Suppose LJ was told that relative to the other scores on a certain test, his score was the 95 th percentile.  This means that 95% of those who took the test had scores less than or equal to LJ’s score, while 5% had scores higher than LJ’s. Session 1. TEACHING BASIC STATISTICS
  • 38. Deciles Divide an array into ten equal parts, each part having ten percent of the distribution of the data values, denoted by D j . The 1 st decile is the 10 th percentile; the 2 nd decile is the 20 th percentile….. Session 1. TEACHING BASIC STATISTICS
  • 39. Quartiles Divide an array into four equal parts, each part having 25% of the distribution of the data values, denoted by Q j . The 1 st quartile is the 25 th percentile; the 2 nd quartile is the 50 th percentile, also the median and the 3 rd quartile is the 75 th percentile. Session 1. TEACHING BASIC STATISTICS
  • 40. Measures of Variation A measure of variation is a single value that is used to describe the spread of the distribution A measure of central tendency alone does not uniquely describe a distribution Session 1. TEACHING BASIC STATISTICS
  • 41. Session 1. TEACHING BASIC STATISTICS Mean = 15.5 s = 3.338 11 12 13 14 15 16 17 18 19 20 21 11 12 13 14 15 16 17 18 19 20 21 Data B Data A Mean = 15.5 s = .9258 11 12 13 14 15 16 17 18 19 20 21 Mean = 15.5 s = 4.57 Data C A look at dispersion…
  • 42. Two Types of Measures of Dispersion Absolute Measures of Dispersion:  Range  Inter-quartile Range  Variance  Standard Deviation Session 1. TEACHING BASIC STATISTICS Relative Measure of Dispersion:  Coefficient of Variation
  • 43. Range (R) Session 1. TEACHING BASIC STATISTICS The difference between the maximum and minimum value in a data set, i.e. R = MAX – MIN Example: Pulse rates of 15 male residents of a certain village 54 58 58 60 62 65 66 71 74 75 77 78 80 82 85 R = 85 - 54 = 31
  • 44. Some Properties of the Range The larger the value of the range, the more dispersed the observations are. It is quick and easy to understand. A rough measure of dispersion. Session 1. TEACHING BASIC STATISTICS
  • 45. Inter-Quartile Range (IQR) Session 1. TEACHING BASIC STATISTICS The difference between the third quartile and first quartile, i.e. IQR = Q 3 – Q 1 Example: Pulse rates of 15 residents of a certain village 54 58 58 60 62 65 66 71 74 75 77 78 80 82 85 IQR = 78 - 60 = 18
  • 46. Some Properties of IQR Reduces the influence of extreme values. Not as easy to calculate as the Range. Session 1. TEACHING BASIC STATISTICS
  • 47. Variance important measure of variation shows variation about the mean Population variance Sample variance Session 1. TEACHING BASIC STATISTICS
  • 48. Standard Deviation (SD) most important measure of variation square root of Variance has the same units as the original data Population SD Sample SD Session 1. TEACHING BASIC STATISTICS
  • 49. Session 1. TEACHING BASIC STATISTICS Data: 10 12 14 15 17 18 18 24 n = 8 Mean =16 Computation of Standard Deviation
  • 50. Session 1. TEACHING BASIC STATISTICS Remarks on Standard Deviation If there is a large amount of variation, then on average, the data values will be far from the mean. Hence, the SD will be large. If there is only a small amount of variation, then on average, the data values will be close to the mean. Hence, the SD will be small.
  • 51. Comparing Standard Deviation Session 1. TEACHING BASIC STATISTICS Mean = 15.5 s = 3.338 11 12 13 14 15 16 17 18 19 20 21 11 12 13 14 15 16 17 18 19 20 21 Data B Data A Mean = 15.5 s = .9258 11 12 13 14 15 16 17 18 19 20 21 Mean = 15.5 s = 4.57 Data C
  • 52. Comparing Standard Deviation Session 1. TEACHING BASIC STATISTICS Example: Team A - Heights of five marathon players in inches 65” 65 “ 65 “ 65 “ 65 “ 65 “ Mean = 65 S = 0
  • 53. Comparing Standard Deviation Session 1. TEACHING BASIC STATISTICS Example: Team B - Heights of five marathon players in inches 62 “ 67 “ 66 “ 70 “ 60 “ Mean = 65” s = 4.0”
  • 54. Properties of Standard Deviation It is the most widely used measure of dispersion. (Chebychev’s Inequality) It is based on all the items and is rigidly defined. It is used to test the reliability of measures calculated from samples. The standard deviation is sensitive to the presence of extreme values. It is not easy to calculate by hand (unlike the range). Session 1. TEACHING BASIC STATISTICS
  • 55. Coefficient of Variation (CV) measure of relative variation usually expressed in percent shows variation relative to mean used to compare 2 or more groups Formula : Session 1. TEACHING BASIC STATISTICS
  • 56. Comparing CVs Stock A: Average Price = P50 SD = P5 CV = 10% Stock B: Average Price = P100 SD = P5 CV = 5% Session 1. TEACHING BASIC STATISTICS
  • 57. Measure of Skewness Describes the degree of departures of the distribution of the data from symmetry. The degree of skewness is measured by the coefficient of skewness, denoted as SK and computed as, Session 1. TEACHING BASIC STATISTICS
  • 58. What is Symmetry? A distribution is said to be symmetric about the mean, if the distribution to the left of mean is the “mirror image” of the distribution to the right of the mean. Likewise, a symmetric distribution has SK=0 since its mean is equal to its median and its mode. Session 1. TEACHING BASIC STATISTICS
  • 59. positively skewed Measure of Skewness negatively skewed Session 1. TEACHING BASIC STATISTICS
  • 60. Measure of Kurtosis Describes the extent of peakedness or flatness of the distribution of the data. Measured by coefficient of kurtosis ( K ) computed as, Session 1. TEACHING BASIC STATISTICS
  • 61. Measure of Kurtosis Session 1. TEACHING BASIC STATISTICS K = 0 mesokurtic K > 0 leptokurtic K < 0 platykurtic