SlideShare a Scribd company logo
Further Mathematics
Displaying Bivariate Data



              K McMullen 2012
Displaying Bivariate Data
Bivariate Data: data with two variables (two
quantities or qualities that change)

Generally one variable depends on the other
   The dependent variable depends on the
   independent variable
   Eg. Height and Weight
   Eg. Hours studied and test result
   Tend to focus more on dependent and
   independent variables when plotting scatterplots


                                        K McMullen 2012
Displaying Bivariate Data
Back-to-back stem plots: are used to display the
relationship between a numerical variable and a
two-valued categorical variable

They are used to compare data sets using
summary statistics such as measures of centre
and measures of spread

Eg. Comparing Further Maths study scores
(numerical variable) with gender (male or female-
two-valued categorical variable)


                                     K McMullen 2012
Displaying Bivariate Data
Parallel box plots: are used to display the relationship
between a numerical variable and a categorical
variable with two or more categories

They are used to compare sets of data using
summary statistics such as measures of centre and
measures of spread- also think of the 5 number
summary

Remember that parallel box plots must be placed on
the same axis (you can also do this on CAS)

Eg. The results achieved by 4 different further maths
classes
                                          K McMullen 2012
Displaying Bivariate Data
Two-way frequency tables: are used to display
the relationship between two categorical
variables and can be represented graphically as
a segmented bar chart
Remember that it is easier to compare data sets
if you are working with percentages instead of
totals
In a frequency table you should place your
independent variable along the top row and your
dependent variable along the left column (this will
mean that all your columns must add to 100% if
done correctly)
                                      K McMullen 2012
Displaying Bivariate Data
Scatterplots: are used to display the relationship
(correlation) between two numerical variables

The dependent variable is displayed on the vertical
axis

The independent variable is displayed on the
horizontal axis

The relationship between variables on a scatterplot
can be described in terms of:
   Strength (strong, moderate, weak)
   Direction (positive, negative)
   Form (linear, non-linear)
                                          K McMullen 2012
Displaying Bivariate Data
Scatterplots- continued
   Pearson’s product-moment correlation coefficient (r)
   is used to measure the strength of the scatterplot
   The values of r range between -1 (perfect negative)
   to 1 (perfect positive)
   You can approximate the value of r (look at formula
   on p. 101) but you can also calculate it using CAS
   (obviously more reliable)
   To interpret r look and copy the table on page 100 of
   your textbook


                                            K McMullen 2012
Displaying Bivariate Data
Scatterplots- continued

•    The coefficient of determination (r2): this provides information about
     the degree to which one variable can be predicted from another
     variable provided that the variables have a linear correlation

•    The coefficient of determination is calculated by squaring the
     correlation coefficient (r)

•    When commenting using r2 always convert your value into a
     percentage

•    Comments

“The coefficient of determination tells us that rr% of the variation in the
dependent variable is explained by the variation in the independent
variable”



                                                             K McMullen 2012
Displaying Bivariate Data
•   You must remember the difference between
    correlation and causation
•   To interpret your scatterplot you must stick to the
    variables given and don’t make any unnecessary
    assumptions
•   If your scatterplot is negative then: “As IV
    increases the DV decreases)
•   If your scatterplot is positive then: “As IV
    increases the DV increases)


                                            K McMullen 2012
Displaying Bivariate Data
Example: Age and arm span of teenage boys
   Comment: As the age of teenage boys increases
   the length of their arm span also increases
   Assumption: As teenage boys get taller their arm
   span increases

   Obviously they get taller but height is not a
   variable and therefore you should not comment
   on it




                                       K McMullen 2012
Displaying Bivariate Data
Eg. The number of cigarettes smoked and fitness
level
   Comment: As the number of cigarettes increase
   the fitness level of participants decreased
   Assumption: Smoking cigarettes causes fitness
   levels to decrease

   You must remember that there can be other
   factors the can account for low levels of fitness
   such as lack of exercise or weight etc


                                          K McMullen 2012
Displaying Bivariate Data
Eg. People catching public transport and the sales of
designer handbags
   Comment: As the number of people catching public
   transport increase the number of people buying
   designer handbags decreases
   Assumption: A high proportion of people catching
   public transport has caused a decline in the sales of
   designer handbags

   These two variables are clearly unrelated even though
   there can be some correlation. You need to always
   question the validity of stats- what else could have
   caused public transport use to increase and designer
   handbags sales to decrease?

                                            K McMullen 2012
Displaying Bivariate Data
Work through Ch 4 questions and chapter review




                                   K McMullen 2012

More Related Content

PDF
PPTX
Alg II 2-5 Linear Models
PPTX
Further3 summarising univariate data
PPT
Multiple regression presentation
PPTX
Regression vs correlation and causation
PPT
Week 1 lecture b how do economists work
PPTX
Correlation
PPT
Population 3.5 - Spearman’S Rank
Alg II 2-5 Linear Models
Further3 summarising univariate data
Multiple regression presentation
Regression vs correlation and causation
Week 1 lecture b how do economists work
Correlation
Population 3.5 - Spearman’S Rank

What's hot (17)

PPT
Regression and Co-Relation
PPTX
Correlation and regression
PPTX
Partial correlation
PPTX
Correlation
PPTX
Correlation and regression
PDF
Correlation analysis
PPTX
Multiple Linear Regression
PPTX
Correlation and partial correlation
PPT
Correlation - Biostatistics
PPT
Statistical Relationships
PPTX
Chapter 3.1
PPTX
Measures of relationships
PPTX
Covariance vs Correlation
PPTX
Regression
PPT
Correlation
PPT
Consumer Spending causing unemployment analysis
PPT
Spearman Rank Correlation Presentation
Regression and Co-Relation
Correlation and regression
Partial correlation
Correlation
Correlation and regression
Correlation analysis
Multiple Linear Regression
Correlation and partial correlation
Correlation - Biostatistics
Statistical Relationships
Chapter 3.1
Measures of relationships
Covariance vs Correlation
Regression
Correlation
Consumer Spending causing unemployment analysis
Spearman Rank Correlation Presentation
Ad

Viewers also liked (20)

PPTX
Sampling For Multivariate Data Analysis
PPT
Sampling presentation
PPT
Stat lesson 5.1 probability distributions
PPT
Chapter05
PPTX
Statistics
PPTX
Chapter 2: Collection of Data
PPTX
Absolute Measures of dispersion
PPT
Chapter 06
PPT
PPT
Rm 6 Sampling Design
PPTX
Bayes Theorem
PPTX
Probability basics and bayes' theorem
PDF
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
PPT
Index Number
PDF
Correlation and Simple Regression
PPT
Statistical ppt
PPSX
Introduction to statistics...ppt rahul
PPTX
Sample and sampling techniques
PPT
Introduction To Statistics
PPT
Sampling Design
Sampling For Multivariate Data Analysis
Sampling presentation
Stat lesson 5.1 probability distributions
Chapter05
Statistics
Chapter 2: Collection of Data
Absolute Measures of dispersion
Chapter 06
Rm 6 Sampling Design
Bayes Theorem
Probability basics and bayes' theorem
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
Index Number
Correlation and Simple Regression
Statistical ppt
Introduction to statistics...ppt rahul
Sample and sampling techniques
Introduction To Statistics
Sampling Design
Ad

Similar to Further6 displaying bivariate data (20)

PPT
Frequency Tables - Statistics
PPTX
Most of the players scored 40+ runs in a match and only a few
PDF
Maths A - Chapter 11
PDF
An intro to applied multi stat with r by everitt et al
PDF
An introductiontoappliedmultivariateanalysiswithr everit
PDF
Dr. Shivu___Machine Learning_Module 2pdf
PDF
Deepak_DAI101_Data_Anal_lecture6 (1).pdf
PPTX
Bus 173_4.pptx
PPT
Econometrics
PDF
the didactic material of Statistics II .pdf
PPTX
MODULE-2.pptx machine learning notes for vtu 6th sem cse
PPTX
MODULE-3edited.pptx machine learning modulk
PPTX
C11BD 22-23 data ana-Exploration II.pptx
PPTX
QR II Lect 15 (Bivariate analysis and scatter plot, correlation).pptx
PPTX
Advanced Excel, Day 4
PPTX
1133629601 400440
PPTX
Basic Statistics in 1 hour.pptx
PPTX
Chap2-Data.pptx. It is all about data in data mining.
PPTX
UNIT IV.pptx
Frequency Tables - Statistics
Most of the players scored 40+ runs in a match and only a few
Maths A - Chapter 11
An intro to applied multi stat with r by everitt et al
An introductiontoappliedmultivariateanalysiswithr everit
Dr. Shivu___Machine Learning_Module 2pdf
Deepak_DAI101_Data_Anal_lecture6 (1).pdf
Bus 173_4.pptx
Econometrics
the didactic material of Statistics II .pdf
MODULE-2.pptx machine learning notes for vtu 6th sem cse
MODULE-3edited.pptx machine learning modulk
C11BD 22-23 data ana-Exploration II.pptx
QR II Lect 15 (Bivariate analysis and scatter plot, correlation).pptx
Advanced Excel, Day 4
1133629601 400440
Basic Statistics in 1 hour.pptx
Chap2-Data.pptx. It is all about data in data mining.
UNIT IV.pptx

More from kmcmullen (16)

PPTX
Further 9 similar triangles
PPTX
Methods8 trigonometric functions
PPTX
Further8 data transformation
PPTX
Methods7 exponential equations
PPTX
Further7 regression analysis
PPTX
Methods6 modulus function
PPTX
Further5 normal distribution
PPTX
Methods5 transformations
PPTX
Types of functions2
PPTX
Methods3 types of functions1
PPTX
Further4 box plots, 5 number summary and outliers
PPTX
Further1 types of data
PPTX
Further2 displaying univariate data
PPTX
Methods2 polynomial functions
PPTX
Methods1 relations and functions
KEY
Methods1 relations and functions
Further 9 similar triangles
Methods8 trigonometric functions
Further8 data transformation
Methods7 exponential equations
Further7 regression analysis
Methods6 modulus function
Further5 normal distribution
Methods5 transformations
Types of functions2
Methods3 types of functions1
Further4 box plots, 5 number summary and outliers
Further1 types of data
Further2 displaying univariate data
Methods2 polynomial functions
Methods1 relations and functions
Methods1 relations and functions

Recently uploaded (20)

PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Basic Mud Logging Guide for educational purpose
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
GDM (1) (1).pptx small presentation for students
PDF
RMMM.pdf make it easy to upload and study
PDF
Sports Quiz easy sports quiz sports quiz
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Microbial diseases, their pathogenesis and prophylaxis
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Basic Mud Logging Guide for educational purpose
01-Introduction-to-Information-Management.pdf
Cell Structure & Organelles in detailed.
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
O5-L3 Freight Transport Ops (International) V1.pdf
GDM (1) (1).pptx small presentation for students
RMMM.pdf make it easy to upload and study
Sports Quiz easy sports quiz sports quiz
Final Presentation General Medicine 03-08-2024.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Renaissance Architecture: A Journey from Faith to Humanism
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf

Further6 displaying bivariate data

  • 2. Displaying Bivariate Data Bivariate Data: data with two variables (two quantities or qualities that change) Generally one variable depends on the other The dependent variable depends on the independent variable Eg. Height and Weight Eg. Hours studied and test result Tend to focus more on dependent and independent variables when plotting scatterplots K McMullen 2012
  • 3. Displaying Bivariate Data Back-to-back stem plots: are used to display the relationship between a numerical variable and a two-valued categorical variable They are used to compare data sets using summary statistics such as measures of centre and measures of spread Eg. Comparing Further Maths study scores (numerical variable) with gender (male or female- two-valued categorical variable) K McMullen 2012
  • 4. Displaying Bivariate Data Parallel box plots: are used to display the relationship between a numerical variable and a categorical variable with two or more categories They are used to compare sets of data using summary statistics such as measures of centre and measures of spread- also think of the 5 number summary Remember that parallel box plots must be placed on the same axis (you can also do this on CAS) Eg. The results achieved by 4 different further maths classes K McMullen 2012
  • 5. Displaying Bivariate Data Two-way frequency tables: are used to display the relationship between two categorical variables and can be represented graphically as a segmented bar chart Remember that it is easier to compare data sets if you are working with percentages instead of totals In a frequency table you should place your independent variable along the top row and your dependent variable along the left column (this will mean that all your columns must add to 100% if done correctly) K McMullen 2012
  • 6. Displaying Bivariate Data Scatterplots: are used to display the relationship (correlation) between two numerical variables The dependent variable is displayed on the vertical axis The independent variable is displayed on the horizontal axis The relationship between variables on a scatterplot can be described in terms of: Strength (strong, moderate, weak) Direction (positive, negative) Form (linear, non-linear) K McMullen 2012
  • 7. Displaying Bivariate Data Scatterplots- continued Pearson’s product-moment correlation coefficient (r) is used to measure the strength of the scatterplot The values of r range between -1 (perfect negative) to 1 (perfect positive) You can approximate the value of r (look at formula on p. 101) but you can also calculate it using CAS (obviously more reliable) To interpret r look and copy the table on page 100 of your textbook K McMullen 2012
  • 8. Displaying Bivariate Data Scatterplots- continued • The coefficient of determination (r2): this provides information about the degree to which one variable can be predicted from another variable provided that the variables have a linear correlation • The coefficient of determination is calculated by squaring the correlation coefficient (r) • When commenting using r2 always convert your value into a percentage • Comments “The coefficient of determination tells us that rr% of the variation in the dependent variable is explained by the variation in the independent variable” K McMullen 2012
  • 9. Displaying Bivariate Data • You must remember the difference between correlation and causation • To interpret your scatterplot you must stick to the variables given and don’t make any unnecessary assumptions • If your scatterplot is negative then: “As IV increases the DV decreases) • If your scatterplot is positive then: “As IV increases the DV increases) K McMullen 2012
  • 10. Displaying Bivariate Data Example: Age and arm span of teenage boys Comment: As the age of teenage boys increases the length of their arm span also increases Assumption: As teenage boys get taller their arm span increases Obviously they get taller but height is not a variable and therefore you should not comment on it K McMullen 2012
  • 11. Displaying Bivariate Data Eg. The number of cigarettes smoked and fitness level Comment: As the number of cigarettes increase the fitness level of participants decreased Assumption: Smoking cigarettes causes fitness levels to decrease You must remember that there can be other factors the can account for low levels of fitness such as lack of exercise or weight etc K McMullen 2012
  • 12. Displaying Bivariate Data Eg. People catching public transport and the sales of designer handbags Comment: As the number of people catching public transport increase the number of people buying designer handbags decreases Assumption: A high proportion of people catching public transport has caused a decline in the sales of designer handbags These two variables are clearly unrelated even though there can be some correlation. You need to always question the validity of stats- what else could have caused public transport use to increase and designer handbags sales to decrease? K McMullen 2012
  • 13. Displaying Bivariate Data Work through Ch 4 questions and chapter review K McMullen 2012