Displaying and describing categorical data

 The three rules of data analysis won’t be difficult to
remember:
1. Make a picture—things may be revealed that are not
obvious in the raw data. These will be things to think
about.
2. Make a picture—important features of and patterns in
the data will show up. You may also see things that
you did not expect.
3. Make a picture—the best way to tell others about your
data is with a well-chosen picture.
The Three Rules of Data Analysis

 We can “pile” the data by counting the number of
data values in each category of interest.
 We can organize these counts into a frequency table,
which records the totals and the category names.
Frequency Tables

 A relative frequency table is similar, but gives the
percentages (instead of counts) for each category.
 Cumulative Frequency – calculates the sum of all
relative frequencies for that particular category and
all previous categories
Relative Frequency Tables

 Both types of tables show how cases are distributed across
the categories.
 They describe the distribution of a categorical variable
because they name the possible categories and tell how
frequently each occurs.
 To calculate the relative frequency:
 Divide the count by the total number to cases
 Multiply by 100 to express as a percentage
 To calculate the cumulative relative frequency:
 Take the current category’s relative frequency and add it all
previous relative frequencies
Creating Frequency Tables

 You might think that
a good way to show
the Titanic data is
with this display. Is it?
What’s Wrong With This Picture?

 The ship display makes it look like most of the
people on the Titanic were crew members, with a few
passengers along for the ride.
 When we look at each ship, we see the area taken up
by the ship, instead of the length of the ship.
 The ship display violates the area principle:
 The area occupied by a part of the graph should
correspond to the magnitude of the value it
represents.
The Area Principle

 A bar chart displays the distribution of a categorical
variable, showing the counts for each category next to
each other for easy comparison.
 A bar chart stays true
to the area principle.
 Thus, a better display
for the ship data is:
 Bar charts have spaces
between the categories,
while histograms do not.
Bar Charts

 A relative frequency bar chart displays the relative
proportion of counts for each category.
 A relative frequency bar chart also stays true to the
area principle.
 Replacing counts
with percentages
in the ship data:
Bar Charts (cont.)

 A relative frequency bar chart displays the relative
proportion of counts for each category.
 A relative frequency bar chart also stays true to the
area principle.
 Replacing counts
with percentages
in the ship data:
Bar Charts (cont.)

More Related Content

PPTX
O-give slide share
PPT
NCompass Live: Presenting Data in Meaningful and Interesting Ways
PPTX
Graphical Displays of Data
PPTX
Ogive presentation
PPTX
Histogram
PPTX
GEOGRAPHY IGCSE: GRAPH SKILLS
PPTX
2.4 Other Types of Graphs
PPT
TX History Ch 1.3
O-give slide share
NCompass Live: Presenting Data in Meaningful and Interesting Ways
Graphical Displays of Data
Ogive presentation
Histogram
GEOGRAPHY IGCSE: GRAPH SKILLS
2.4 Other Types of Graphs
TX History Ch 1.3

What's hot (20)

PPT
Tx history-ch-1.3
PPTX
Histogram
PPTX
Area chart
PPTX
GEOGRAPHY IGCSE: CLIMATE GRAPHS
PPTX
Ogive slide share
PPTX
Frequency Polygon
DOCX
Best guess for calc paper higher
PPTX
Statistics and optimization (1)
PPT
Math in the News: 8/29/11
PPTX
Ogive slide share
PPTX
Ogives slides
PPT
2 3 depicting frequency tables
PPTX
Different Types of Graphs
DOCX
Uses of graph
PPTX
Bivariate data
PPTX
Step by Step Guide to Constructing Ogives
PPTX
O- gives slide share
PPTX
Graphical Representation of Statistical data
PPT
Understanding data through presentation_contd
Tx history-ch-1.3
Histogram
Area chart
GEOGRAPHY IGCSE: CLIMATE GRAPHS
Ogive slide share
Frequency Polygon
Best guess for calc paper higher
Statistics and optimization (1)
Math in the News: 8/29/11
Ogive slide share
Ogives slides
2 3 depicting frequency tables
Different Types of Graphs
Uses of graph
Bivariate data
Step by Step Guide to Constructing Ogives
O- gives slide share
Graphical Representation of Statistical data
Understanding data through presentation_contd
Ad

Viewers also liked (6)

PPTX
Hope for a new day
DOCX
Problema identificados que destruyen los bosques
PPT
Trata de personas en el peru
PDF
PTTLS 4
PDF
Yourprezi
PDF
p3d @EuroSciPy2010 by C. Fufezan
Hope for a new day
Problema identificados que destruyen los bosques
Trata de personas en el peru
PTTLS 4
Yourprezi
p3d @EuroSciPy2010 by C. Fufezan
Ad

Similar to Displaying and describing categorical data (20)

PPT
Displaying and describing categorical data
DOCX
Chapter 2
DOCX
Statistik Chapter 2
DOCX
Quantitative techniques in business
DOCX
For Problem 2, you are to evaluate the given analysis and inte.docx
PDF
How to choose the Right Data Visualization
PDF
Visualization-1
PPTX
Data Presentation in Medicine I PPX.pptx
PPTX
Descriptive analytics BA4206 Anna university Business Analytics
PPTX
2.4 Scatterplots, correlation, and regression
PPT
It's a presentation of data in Statistics.
PPT
Chapter 02
PPTX
Graphs that Enlighten and Graphs that Deceive
PPTX
2.3 Graphs that enlighten and graphs that deceive
PPTX
3. data graphics.pptx biostatistics reasearch methodology
PPTX
03.data presentation(2015) 2
PPTX
Model Evaluation & Visualisation part of a series of intro modules for data ...
PPT
Sta2023 ch02
PPT
Displaying data using charts and graphs
PPTX
Vsual analysis.pptx
Displaying and describing categorical data
Chapter 2
Statistik Chapter 2
Quantitative techniques in business
For Problem 2, you are to evaluate the given analysis and inte.docx
How to choose the Right Data Visualization
Visualization-1
Data Presentation in Medicine I PPX.pptx
Descriptive analytics BA4206 Anna university Business Analytics
2.4 Scatterplots, correlation, and regression
It's a presentation of data in Statistics.
Chapter 02
Graphs that Enlighten and Graphs that Deceive
2.3 Graphs that enlighten and graphs that deceive
3. data graphics.pptx biostatistics reasearch methodology
03.data presentation(2015) 2
Model Evaluation & Visualisation part of a series of intro modules for data ...
Sta2023 ch02
Displaying data using charts and graphs
Vsual analysis.pptx

Recently uploaded (20)

PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
advance database management system book.pdf
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
IGGE1 Understanding the Self1234567891011
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Empowerment Technology for Senior High School Guide
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PPTX
TNA_Presentation-1-Final(SAVE)) (1).pptx
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
International_Financial_Reporting_Standa.pdf
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
advance database management system book.pdf
Paper A Mock Exam 9_ Attempt review.pdf.
IGGE1 Understanding the Self1234567891011
Chinmaya Tiranga quiz Grand Finale.pdf
Empowerment Technology for Senior High School Guide
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Environmental Education MCQ BD2EE - Share Source.pdf
A powerpoint presentation on the Revised K-10 Science Shaping Paper
TNA_Presentation-1-Final(SAVE)) (1).pptx
FORM 1 BIOLOGY MIND MAPS and their schemes
Introduction to pro and eukaryotes and differences.pptx
LDMMIA Reiki Yoga Finals Review Spring Summer
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
International_Financial_Reporting_Standa.pdf
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf

Displaying and describing categorical data

  • 2.   The three rules of data analysis won’t be difficult to remember: 1. Make a picture—things may be revealed that are not obvious in the raw data. These will be things to think about. 2. Make a picture—important features of and patterns in the data will show up. You may also see things that you did not expect. 3. Make a picture—the best way to tell others about your data is with a well-chosen picture. The Three Rules of Data Analysis
  • 3.   We can “pile” the data by counting the number of data values in each category of interest.  We can organize these counts into a frequency table, which records the totals and the category names. Frequency Tables
  • 4.   A relative frequency table is similar, but gives the percentages (instead of counts) for each category.  Cumulative Frequency – calculates the sum of all relative frequencies for that particular category and all previous categories Relative Frequency Tables
  • 5.   Both types of tables show how cases are distributed across the categories.  They describe the distribution of a categorical variable because they name the possible categories and tell how frequently each occurs.  To calculate the relative frequency:  Divide the count by the total number to cases  Multiply by 100 to express as a percentage  To calculate the cumulative relative frequency:  Take the current category’s relative frequency and add it all previous relative frequencies Creating Frequency Tables
  • 6.   You might think that a good way to show the Titanic data is with this display. Is it? What’s Wrong With This Picture?
  • 7.   The ship display makes it look like most of the people on the Titanic were crew members, with a few passengers along for the ride.  When we look at each ship, we see the area taken up by the ship, instead of the length of the ship.  The ship display violates the area principle:  The area occupied by a part of the graph should correspond to the magnitude of the value it represents. The Area Principle
  • 8.   A bar chart displays the distribution of a categorical variable, showing the counts for each category next to each other for easy comparison.  A bar chart stays true to the area principle.  Thus, a better display for the ship data is:  Bar charts have spaces between the categories, while histograms do not. Bar Charts
  • 9.   A relative frequency bar chart displays the relative proportion of counts for each category.  A relative frequency bar chart also stays true to the area principle.  Replacing counts with percentages in the ship data: Bar Charts (cont.)
  • 10.   A relative frequency bar chart displays the relative proportion of counts for each category.  A relative frequency bar chart also stays true to the area principle.  Replacing counts with percentages in the ship data: Bar Charts (cont.)