Chap 2-1
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.
Chapter 2
Describing Data: Graphical
Statistics for
Business and Economics
6th Edition
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-2
Chapter Goals
After completing this chapter, you should be able to:
 Identify types of data and levels of measurement
 Create and interpret graphs to describe categorical variables:
 frequency distribution, bar chart, pie chart, Pareto diagram
 Create a line chart to describe time-series data
 Create and interpret graphs to describe numerical variables:
 frequency distribution, histogram, ogive, stem-and-leaf display
 Construct and interpret graphs to describe relationships between
variables:
 Scatter plot, cross table
 Describe appropriate and inappropriate ways to display data
graphically
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-3
Types of Data
Data
Categorical Numerical
Discrete Continuous
Examples:
 Marital Status
 Are you registered to
vote?
 Eye Color
(Defined categories or
groups)
Examples:
 Number of Children
 Defects per hour
(Counted items)
Examples:
 Weight
 Voltage
(Measured characteristics)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-4
Measurement Levels
Interval Data
Ordinal Data
Nominal Data
Quantitative Data
Qualitative Data
Categories (no
ordering or direction)
Ordered Categories
(rankings, order, or
scaling)
Differences between
measurements but no
true zero
Ratio Data
Differences between
measurements, true
zero exists
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-5
Graphical
Presentation of Data
 Data in raw form are usually not easy to use
for decision making
 Some type of organization is needed
 Table
 Graph
 The type of graph to use depends on the
variable being summarized
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-6
Graphical
Presentation of Data
 Techniques reviewed in this chapter:
Categorical
Variables
Numerical
Variables
• Frequency distribution
• Bar chart
• Pie chart
• Pareto diagram
• Line chart
• Frequency distribution
• Histogram and ogive
• Stem-and-leaf display
• Scatter plot
(continued)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-7
Tables and Graphs for
Categorical Variables
Categorical
Data
Graphing Data
Pie
Chart
Pareto
Diagram
Bar
Chart
Frequency
Distribution
Table
Tabulating Data
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-8
The Frequency
Distribution Table
Example: Hospital Patients by Unit
Hospital Unit Number of Patients
Cardiac Care 1,052
Emergency 2,245
Intensive Care 340
Maternity 552
Surgery 4,630
(Variables are
categorical)
Summarize data by category
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-9
Bar and Pie Charts
 Bar charts and Pie charts are often used
for qualitative (category) data
 Height of bar or size of pie slice shows the
frequency or percentage for each
category
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-10
Bar Chart Example
Hospital Patients by Unit
0
1000
2000
3000
4000
5000
Cardiac
Care
Emergency
Intensive
Care
Maternity
Surgery
Number
of
patients
per
year
Hospital Number
Unit of Patients
Cardiac Care 1,052
Emergency 2,245
Intensive Care 340
Maternity 552
Surgery 4,630
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-11
Hospital Patients by Unit
Emergency
25%
Maternity
6%
Surgery
53%
Cardiac Care
12%
Intensive Care
4%
Pie Chart Example
(Percentages
are rounded to
the nearest
percent)
Hospital Number % of
Unit of Patients Total
Cardiac Care 1,052 11.93
Emergency 2,245 25.46
Intensive Care 340 3.86
Maternity 552 6.26
Surgery 4,630 52.50
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-12
Pareto Diagram
 Used to portray categorical data
 A bar chart, where categories are shown in
descending order of frequency
 A cumulative polygon is often shown in the
same graph
 Used to separate the “vital few” from the “trivial
many”
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-13
Example: 400 defective items are examined
for cause of defect:
Source of
Manufacturing Error Number of defects
Bad Weld 34
Poor Alignment 223
Missing Part 25
Paint Flaw 78
Electrical Short 19
Cracked case 21
Total 400
Pareto Diagram Example
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-14
Step 1: Sort by defect cause, in descending order
Step 2: Determine % in each category
Source of
Manufacturing Error Number of defects % of Total Defects
Poor Alignment 223 55.75
Paint Flaw 78 19.50
Bad Weld 34 8.50
Missing Part 25 6.25
Cracked case 21 5.25
Electrical Short 19 4.75
Total 400 100%
Pareto Diagram Example
(continued)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-15
Pareto Diagram Example
cumulative
%
(line
graph)
%
of
defects
in
each
category
(bar
graph)
Pareto Diagram: Cause of Manufacturing Defect
0%
10%
20%
30%
40%
50%
60%
Poor Alignment Paint Flaw Bad Weld Missing Part Cracked case Electrical Short
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Step 3: Show results graphically
(continued)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-16
Graphs for Time-Series Data
 A line chart (time-series plot) is used to show
the values of a variable over time
 Time is measured on the horizontal axis
 The variable of interest is measured on the
vertical axis
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-17
Line Chart Example
Magazine Subscriptions by Year
0
50
100
150
200
250
300
350
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
Thousands
of
subscribers
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-18
Numerical Data
Stem-and-Leaf
Display
Histogram Ogive
Frequency Distributions
and
Cumulative Distributions
Graphs to Describe
Numerical Variables
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-19
What is a Frequency Distribution?
 A frequency distribution is a list or a table …
 containing class groupings (categories or
ranges within which the data fall) ...
 and the corresponding frequencies with which
data fall within each class or category
Frequency Distributions
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-20
Why Use Frequency Distributions?
 A frequency distribution is a way to
summarize data
 The distribution condenses the raw data
into a more useful form...
 and allows for a quick visual interpretation
of the data
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-21
Class Intervals
and Class Boundaries
 Each class grouping has the same width
 Determine the width of each interval by
 Use at least 5 but no more than 15-20 intervals
 Intervals never overlap
 Round up the interval width to get desirable
interval endpoints
intervals
desired
of
number
number
smallest
number
largest
width
interval
w



Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-22
Frequency Distribution Example
Example: A manufacturer of insulation randomly
selects 20 winter days and records the daily
high temperature
24, 35, 17, 21, 24, 37, 26, 46, 58, 30,
32, 13, 12, 38, 41, 43, 44, 27, 53, 27
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-23
 Sort raw data in ascending order:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
 Find range: 58 - 12 = 46
 Select number of classes: 5 (usually between 5 and 15)
 Compute interval width: 10 (46/5 then round up)
 Determine interval boundaries: 10 but less than 20, 20 but
less than 30, . . . , 60 but less than 70
 Count observations & assign to classes
Frequency Distribution Example
(continued)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-24
Frequency Distribution Example
Interval Frequency
10 but less than 20 3 .15 15
20 but less than 30 6 .30 30
30 but less than 40 5 .25 25
40 but less than 50 4 .20 20
50 but less than 60 2 .10 10
Total 20 1.00 100
Relative
Frequency Percentage
Data in ordered array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
(continued)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-25
Histogram
 A graph of the data in a frequency distribution
is called a histogram
 The interval endpoints are shown on the
horizontal axis
 the vertical axis is either frequency, relative
frequency, or percentage
 Bars of the appropriate heights are used to
represent the number of observations within
each class
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-26
Histogram: Daily High Temperature
0
3
6
5
4
2
0
0
1
2
3
4
5
6
7
0 10 20 30 40 50 60
Frequency
Temperature in Degrees
Histogram Example
(No gaps
between
bars)
Interval
10 but less than 20 3
20 but less than 30 6
30 but less than 40 5
40 but less than 50 4
50 but less than 60 2
Frequency
0 10 20 30 40 50 60 70
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-27
Histograms in Excel
Select
Tools/Data Analysis
1
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-28
Choose Histogram
2
3
Input data range and bin
range (bin range is a cell
range containing the upper
interval endpoints for each class
grouping)
Select Chart Output
and click “OK”
Histograms in Excel
(continued)
(
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-29
Questions for Grouping Data
into Intervals
 1. How wide should each interval be?
(How many classes should be used?)
 2. How should the endpoints of the
intervals be determined?
 Often answered by trial and error, subject to
user judgment
 The goal is to create a distribution that is
neither too "jagged" nor too "blocky”
 Goal is to appropriately show the pattern of
variation in the data
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-30
How Many Class Intervals?
 Many (Narrow class intervals)
 may yield a very jagged distribution
with gaps from empty classes
 Can give a poor indication of how
frequency varies across classes
 Few (Wide class intervals)
 may compress variation too much and
yield a blocky distribution
 can obscure important patterns of
variation. 0
2
4
6
8
10
12
0 30 60 More
Temperature
Frequency
0
0.5
1
1.5
2
2.5
3
3.5
4
8
12
16
20
24
28
32
36
40
44
48
52
56
60
More
Temperature
Frequency
(X axis labels are upper class endpoints)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-31
The Cumulative
Frequency Distribuiton
Class
10 but less than 20 3 15 3 15
20 but less than 30 6 30 9 45
30 but less than 40 5 25 14 70
40 but less than 50 4 20 18 90
50 but less than 60 2 10 20 100
Total 20 100
Percentage
Cumulative
Percentage
Data in ordered array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Frequency
Cumulative
Frequency
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-32
The Ogive
Graphing Cumulative Frequencies
Ogive: Daily High Temperature
0
20
40
60
80
100
10 20 30 40 50 60
Cumulative
Percentage
Interval endpoints
Interval
Less than 10 10 0
10 but less than 20 20 15
20 but less than 30 30 45
30 but less than 40 40 70
40 but less than 50 50 90
50 but less than 60 60 100
Cumulative
Percentage
Upper
interval
endpoint
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-33
Distribution Shape
 The shape of the distribution is said to be
symmetric if the observations are balanced,
or evenly distributed, about the center.
Symmetric Distribution
0
1
2
3
4
5
6
7
8
9
10
1 2 3 4 5 6 7 8 9
Frequency
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-34
Distribution Shape
 The shape of the distribution is said to be
skewed if the observations are not
symmetrically distributed around the center.
(continued)
Positively Skewed Distribution
0
2
4
6
8
10
12
1 2 3 4 5 6 7 8 9
Frequency
Negatively Skewed Distribution
0
2
4
6
8
10
12
1 2 3 4 5 6 7 8 9
Frequency
A positively skewed distribution
(skewed to the right) has a tail that
extends to the right in the direction of
positive values.
A negatively skewed distribution
(skewed to the left) has a tail that
extends to the left in the direction of
negative values.
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-35
Stem-and-Leaf Diagram
 A simple way to see distribution details in a
data set
METHOD: Separate the sorted data series
into leading digits (the stem) and
the trailing digits (the leaves)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-36
Example
 Here, use the 10’s digit for the stem unit:
Data in ordered array:
21, 24, 24, 26, 27, 27, 30, 32, 38, 41
 21 is shown as
 38 is shown as
Stem Leaf
2 1
3 8
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-37
Example
 Completed stem-and-leaf diagram:
Stem Leaves
2 1 4 4 6 7 7
3 0 2 8
4 1
(continued)
Data in ordered array:
21, 24, 24, 26, 27, 27, 30, 32, 38, 41
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-38
Using other stem units
 Using the 100’s digit as the stem:
 Round off the 10’s digit to form the leaves
 613 would become 6 1
 776 would become 7 8
 . . .
 1224 becomes 12 2
Stem Leaf
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-39
Using other stem units
 Using the 100’s digit as the stem:
 The completed stem-and-leaf display:
Stem Leaves
(continued)
6 1 3 6
7 2 2 5 8
8 3 4 6 6 9 9
9 1 3 3 6 8
10 3 5 6
11 4 7
12 2
Data:
613, 632, 658, 717,
722, 750, 776, 827,
841, 859, 863, 891,
894, 906, 928, 933,
955, 982, 1034,
1047,1056, 1140,
1169, 1224
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-40
Relationships Between Variables
 Graphs illustrated so far have involved only a
single variable
 When two variables exist other techniques are
used:
Categorical
(Qualitative)
Variables
Numerical
(Quantitative)
Variables
Cross tables Scatter plots
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-41
 Scatter Diagrams are used for paired
observations taken from two
numerical variables
 The Scatter Diagram:
 one variable is measured on the vertical
axis and the other variable is measured
on the horizontal axis
Scatter Diagrams
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-42
Scatter Diagram Example
Cost per Dayvs. Production Volume
0
50
100
150
200
250
0 10 20 30 40 50 60 70
Volume per Day
Cost
per
Day
Volume
per day
Cost per
day
23 125
26 140
29 146
33 160
38 167
42 170
50 188
55 195
60 200
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-43
Scatter Diagrams in Excel
Select the chart wizard
1
2
Select XY(Scatter) option,
then click “Next”
When prompted, enter the
data range, desired
legend, and desired
destination to complete
the scatter diagram
3
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-44
Cross Tables
 Cross Tables (or contingency tables) list the
number of observations for every combination
of values for two categorical or ordinal
variables
 If there are r categories for the first variable
(rows) and c categories for the second
variable (columns), the table is called an r x c
cross table
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-45
Cross Table Example
 4 x 3 Cross Table for Investment Choices by Investor
(values in $1000’s)
Investment Investor A Investor B Investor C Total
Category
Stocks 46.5 55 27.5 129
Bonds 32.0 44 19.0 95
CD 15.5 20 13.5 49
Savings 16.0 28 7.0 51
Total 110.0 147 67.0 324
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-46
 Side by side bar charts
(continued)
Graphing
Multivariate Categorical Data
C omparing Investors
0 10 20 30 40 50 60
S toc k s
B onds
CD
S avings
Inves tor A Inves tor B Inves tor C
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-47
Side-by-Side Chart Example
 Sales by quarter for three sales territories:
0
10
20
30
40
50
60
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
East
West
North
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
East 20.4 27.4 59 20.4
West 30.6 38.6 34.6 31.6
North 45.9 46.9 45 43.9
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-48
Data Presentation Errors
Goals for effective data presentation:
 Present data to display essential information
 Communicate complex ideas clearly and
accurately
 Avoid distortion that might convey the wrong
message
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-49
 Unequal histogram interval widths
 Compressing or distorting the
vertical axis
 Providing no zero point on the
vertical axis
 Failing to provide a relative basis
in comparing data between
groups
Data Presentation Errors
(continued)
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-50
Chapter Summary
 Reviewed types of data and measurement levels
 Data in raw form are usually not easy to use for decision
making -- Some type of organization is needed:
 Table  Graph
 Techniques reviewed in this chapter:
 Frequency distribution
 Bar chart
 Pie chart
 Pareto diagram
 Line chart
 Frequency distribution
 Histogram and ogive
 Stem-and-leaf display
 Scatter plot
 Cross tables and
side-by-side bar charts

More Related Content

PPTX
5.7 poisson regression in the analysis of cohort data
 
PDF
A Study on Banking Information System.
PPTX
PPT
Descriptive statistics
PPTX
Introduction to Statistics
PPT
Introduction to statistics
PDF
Using excel to for descriptive statistics
PDF
Chapter 2 part3-Least-Squares Regression
5.7 poisson regression in the analysis of cohort data
 
A Study on Banking Information System.
Descriptive statistics
Introduction to Statistics
Introduction to statistics
Using excel to for descriptive statistics
Chapter 2 part3-Least-Squares Regression

What's hot (20)

PPTX
Data stewardship
PDF
Chi square test and its types
PPT
Introduction To Statistics
PPT
Transaction processing system
PPTX
Toward an organizational E-readiness Model
PDF
Introduction to Business and Data Analysis Undergraduate.pdf
PPT
Presenting Data in Tables and Charts
PDF
Introduction to Generalized Linear Models
PPT
Regression analysis ppt
PPTX
Business Intelligence and Business Analytics
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
PDF
Introduction to Business Analytics.pdf
PDF
Data visualisation & analytics with Tableau
PPTX
Chapter 6 telecommunications and networks
PDF
Principal Component Analysis and Clustering
PPT
Strategic information system planning
PPT
Probability and statistics
PPTX
Simple understanding of biostatistics
PPTX
Descriptive statistics
PDF
Data Analysis using SPSS: Part 1
Data stewardship
Chi square test and its types
Introduction To Statistics
Transaction processing system
Toward an organizational E-readiness Model
Introduction to Business and Data Analysis Undergraduate.pdf
Presenting Data in Tables and Charts
Introduction to Generalized Linear Models
Regression analysis ppt
Business Intelligence and Business Analytics
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Introduction to Business Analytics.pdf
Data visualisation & analytics with Tableau
Chapter 6 telecommunications and networks
Principal Component Analysis and Clustering
Strategic information system planning
Probability and statistics
Simple understanding of biostatistics
Descriptive statistics
Data Analysis using SPSS: Part 1
Ad

Similar to Newbold_chap02.ppt (20)

PPT
Basic Stat Notes
PPTX
Lecture 1 - Overview.pptx
PPTX
Presentation and-analysis-of-business-data
PPTX
Presentation and-analysis-of-business-data
PPTX
Presentation and-analysis-of-business-data
PPTX
Presentation and-analysis-of-business-data
PPTX
Presentation and analysis of business data
PPTX
Presentation and-analysis-of-business-data
PDF
Chapter 1 - Displaying Descriptive Statistics.pdf
PPTX
Charts and graphs
PPTX
Chap01 describing data; graphical
PDF
2. Descriptive Statistics.pdf
PPTX
Type of data @ Web Mining Discussion
PPTX
2. AAdata presentation edited edited tutor srudents(1).pptx
PPTX
Type of data @ web mining discussion
PPT
Chap002
PPT
Chapter 2_Presentation of Data.ppt mean, median, mode, variance
PPTX
2.3 Graphs that enlighten and graphs that deceive
PPTX
lecture 4 Graphs.pptx
PPT
Stat11t chapter2
Basic Stat Notes
Lecture 1 - Overview.pptx
Presentation and-analysis-of-business-data
Presentation and-analysis-of-business-data
Presentation and-analysis-of-business-data
Presentation and-analysis-of-business-data
Presentation and analysis of business data
Presentation and-analysis-of-business-data
Chapter 1 - Displaying Descriptive Statistics.pdf
Charts and graphs
Chap01 describing data; graphical
2. Descriptive Statistics.pdf
Type of data @ Web Mining Discussion
2. AAdata presentation edited edited tutor srudents(1).pptx
Type of data @ web mining discussion
Chap002
Chapter 2_Presentation of Data.ppt mean, median, mode, variance
2.3 Graphs that enlighten and graphs that deceive
lecture 4 Graphs.pptx
Stat11t chapter2
Ad

More from cfisicaster (20)

PDF
slidesWaveRegular.pdf
PDF
WavesNotesAnswers.pdf
PDF
WavesLoading.pdf
PDF
WavesTransformation.pdf
PDF
WavesAppendix.pdf
PDF
WavesLinear.pdf
PDF
WavesExamplesAnswers.pdf
PDF
slidesWaveTransformation.pdf
PDF
WavesStatistics.pdf
PDF
slidesWaveLoading.pdf
PDF
WavesExamples.pdf
PDF
slidesWaveStatistics.pdf
PDF
WavesSchedule.pdf
PDF
Richard I. Levine - Estadistica para administración (2009, Pearson Educación)...
PDF
Mario F. Triola - Estadística (2006, Pearson_Educación) - libgen.li.pdf
PDF
David R. Anderson - Estadistica para administracion y economia (2010) - libge...
PDF
Richard I. Levin, David S. Rubin - Estadística para administradores (2004, Pe...
PDF
N. Schlager - Study Materials for MIT Course [8.02T] - Electricity and Magnet...
PDF
Teruo Matsushita - Electricity and Magnetism_ New Formulation by Introduction...
PDF
Fisica2.pdf
slidesWaveRegular.pdf
WavesNotesAnswers.pdf
WavesLoading.pdf
WavesTransformation.pdf
WavesAppendix.pdf
WavesLinear.pdf
WavesExamplesAnswers.pdf
slidesWaveTransformation.pdf
WavesStatistics.pdf
slidesWaveLoading.pdf
WavesExamples.pdf
slidesWaveStatistics.pdf
WavesSchedule.pdf
Richard I. Levine - Estadistica para administración (2009, Pearson Educación)...
Mario F. Triola - Estadística (2006, Pearson_Educación) - libgen.li.pdf
David R. Anderson - Estadistica para administracion y economia (2010) - libge...
Richard I. Levin, David S. Rubin - Estadística para administradores (2004, Pe...
N. Schlager - Study Materials for MIT Course [8.02T] - Electricity and Magnet...
Teruo Matsushita - Electricity and Magnetism_ New Formulation by Introduction...
Fisica2.pdf

Recently uploaded (20)

PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
Trump Administration's workforce development strategy
PDF
Hazard Identification & Risk Assessment .pdf
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PDF
My India Quiz Book_20210205121199924.pdf
PPTX
Introduction to pro and eukaryotes and differences.pptx
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
International_Financial_Reporting_Standa.pdf
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PPTX
Virtual and Augmented Reality in Current Scenario
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
Empowerment Technology for Senior High School Guide
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
Trump Administration's workforce development strategy
Hazard Identification & Risk Assessment .pdf
Unit 4 Computer Architecture Multicore Processor.pptx
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
My India Quiz Book_20210205121199924.pdf
Introduction to pro and eukaryotes and differences.pptx
Share_Module_2_Power_conflict_and_negotiation.pptx
FORM 1 BIOLOGY MIND MAPS and their schemes
Chinmaya Tiranga quiz Grand Finale.pdf
202450812 BayCHI UCSC-SV 20250812 v17.pptx
International_Financial_Reporting_Standa.pdf
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
Virtual and Augmented Reality in Current Scenario
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
B.Sc. DS Unit 2 Software Engineering.pptx
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
Empowerment Technology for Senior High School Guide

Newbold_chap02.ppt

  • 1. Chap 2-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 2 Describing Data: Graphical Statistics for Business and Economics 6th Edition
  • 2. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-2 Chapter Goals After completing this chapter, you should be able to:  Identify types of data and levels of measurement  Create and interpret graphs to describe categorical variables:  frequency distribution, bar chart, pie chart, Pareto diagram  Create a line chart to describe time-series data  Create and interpret graphs to describe numerical variables:  frequency distribution, histogram, ogive, stem-and-leaf display  Construct and interpret graphs to describe relationships between variables:  Scatter plot, cross table  Describe appropriate and inappropriate ways to display data graphically
  • 3. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-3 Types of Data Data Categorical Numerical Discrete Continuous Examples:  Marital Status  Are you registered to vote?  Eye Color (Defined categories or groups) Examples:  Number of Children  Defects per hour (Counted items) Examples:  Weight  Voltage (Measured characteristics)
  • 4. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-4 Measurement Levels Interval Data Ordinal Data Nominal Data Quantitative Data Qualitative Data Categories (no ordering or direction) Ordered Categories (rankings, order, or scaling) Differences between measurements but no true zero Ratio Data Differences between measurements, true zero exists
  • 5. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-5 Graphical Presentation of Data  Data in raw form are usually not easy to use for decision making  Some type of organization is needed  Table  Graph  The type of graph to use depends on the variable being summarized
  • 6. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-6 Graphical Presentation of Data  Techniques reviewed in this chapter: Categorical Variables Numerical Variables • Frequency distribution • Bar chart • Pie chart • Pareto diagram • Line chart • Frequency distribution • Histogram and ogive • Stem-and-leaf display • Scatter plot (continued)
  • 7. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-7 Tables and Graphs for Categorical Variables Categorical Data Graphing Data Pie Chart Pareto Diagram Bar Chart Frequency Distribution Table Tabulating Data
  • 8. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-8 The Frequency Distribution Table Example: Hospital Patients by Unit Hospital Unit Number of Patients Cardiac Care 1,052 Emergency 2,245 Intensive Care 340 Maternity 552 Surgery 4,630 (Variables are categorical) Summarize data by category
  • 9. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-9 Bar and Pie Charts  Bar charts and Pie charts are often used for qualitative (category) data  Height of bar or size of pie slice shows the frequency or percentage for each category
  • 10. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-10 Bar Chart Example Hospital Patients by Unit 0 1000 2000 3000 4000 5000 Cardiac Care Emergency Intensive Care Maternity Surgery Number of patients per year Hospital Number Unit of Patients Cardiac Care 1,052 Emergency 2,245 Intensive Care 340 Maternity 552 Surgery 4,630
  • 11. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-11 Hospital Patients by Unit Emergency 25% Maternity 6% Surgery 53% Cardiac Care 12% Intensive Care 4% Pie Chart Example (Percentages are rounded to the nearest percent) Hospital Number % of Unit of Patients Total Cardiac Care 1,052 11.93 Emergency 2,245 25.46 Intensive Care 340 3.86 Maternity 552 6.26 Surgery 4,630 52.50
  • 12. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-12 Pareto Diagram  Used to portray categorical data  A bar chart, where categories are shown in descending order of frequency  A cumulative polygon is often shown in the same graph  Used to separate the “vital few” from the “trivial many”
  • 13. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-13 Example: 400 defective items are examined for cause of defect: Source of Manufacturing Error Number of defects Bad Weld 34 Poor Alignment 223 Missing Part 25 Paint Flaw 78 Electrical Short 19 Cracked case 21 Total 400 Pareto Diagram Example
  • 14. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-14 Step 1: Sort by defect cause, in descending order Step 2: Determine % in each category Source of Manufacturing Error Number of defects % of Total Defects Poor Alignment 223 55.75 Paint Flaw 78 19.50 Bad Weld 34 8.50 Missing Part 25 6.25 Cracked case 21 5.25 Electrical Short 19 4.75 Total 400 100% Pareto Diagram Example (continued)
  • 15. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-15 Pareto Diagram Example cumulative % (line graph) % of defects in each category (bar graph) Pareto Diagram: Cause of Manufacturing Defect 0% 10% 20% 30% 40% 50% 60% Poor Alignment Paint Flaw Bad Weld Missing Part Cracked case Electrical Short 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Step 3: Show results graphically (continued)
  • 16. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-16 Graphs for Time-Series Data  A line chart (time-series plot) is used to show the values of a variable over time  Time is measured on the horizontal axis  The variable of interest is measured on the vertical axis
  • 17. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-17 Line Chart Example Magazine Subscriptions by Year 0 50 100 150 200 250 300 350 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 Thousands of subscribers
  • 18. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-18 Numerical Data Stem-and-Leaf Display Histogram Ogive Frequency Distributions and Cumulative Distributions Graphs to Describe Numerical Variables
  • 19. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-19 What is a Frequency Distribution?  A frequency distribution is a list or a table …  containing class groupings (categories or ranges within which the data fall) ...  and the corresponding frequencies with which data fall within each class or category Frequency Distributions
  • 20. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-20 Why Use Frequency Distributions?  A frequency distribution is a way to summarize data  The distribution condenses the raw data into a more useful form...  and allows for a quick visual interpretation of the data
  • 21. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-21 Class Intervals and Class Boundaries  Each class grouping has the same width  Determine the width of each interval by  Use at least 5 but no more than 15-20 intervals  Intervals never overlap  Round up the interval width to get desirable interval endpoints intervals desired of number number smallest number largest width interval w   
  • 22. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-22 Frequency Distribution Example Example: A manufacturer of insulation randomly selects 20 winter days and records the daily high temperature 24, 35, 17, 21, 24, 37, 26, 46, 58, 30, 32, 13, 12, 38, 41, 43, 44, 27, 53, 27
  • 23. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-23  Sort raw data in ascending order: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58  Find range: 58 - 12 = 46  Select number of classes: 5 (usually between 5 and 15)  Compute interval width: 10 (46/5 then round up)  Determine interval boundaries: 10 but less than 20, 20 but less than 30, . . . , 60 but less than 70  Count observations & assign to classes Frequency Distribution Example (continued)
  • 24. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-24 Frequency Distribution Example Interval Frequency 10 but less than 20 3 .15 15 20 but less than 30 6 .30 30 30 but less than 40 5 .25 25 40 but less than 50 4 .20 20 50 but less than 60 2 .10 10 Total 20 1.00 100 Relative Frequency Percentage Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 (continued)
  • 25. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-25 Histogram  A graph of the data in a frequency distribution is called a histogram  The interval endpoints are shown on the horizontal axis  the vertical axis is either frequency, relative frequency, or percentage  Bars of the appropriate heights are used to represent the number of observations within each class
  • 26. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-26 Histogram: Daily High Temperature 0 3 6 5 4 2 0 0 1 2 3 4 5 6 7 0 10 20 30 40 50 60 Frequency Temperature in Degrees Histogram Example (No gaps between bars) Interval 10 but less than 20 3 20 but less than 30 6 30 but less than 40 5 40 but less than 50 4 50 but less than 60 2 Frequency 0 10 20 30 40 50 60 70
  • 27. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-27 Histograms in Excel Select Tools/Data Analysis 1
  • 28. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-28 Choose Histogram 2 3 Input data range and bin range (bin range is a cell range containing the upper interval endpoints for each class grouping) Select Chart Output and click “OK” Histograms in Excel (continued) (
  • 29. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-29 Questions for Grouping Data into Intervals  1. How wide should each interval be? (How many classes should be used?)  2. How should the endpoints of the intervals be determined?  Often answered by trial and error, subject to user judgment  The goal is to create a distribution that is neither too "jagged" nor too "blocky”  Goal is to appropriately show the pattern of variation in the data
  • 30. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-30 How Many Class Intervals?  Many (Narrow class intervals)  may yield a very jagged distribution with gaps from empty classes  Can give a poor indication of how frequency varies across classes  Few (Wide class intervals)  may compress variation too much and yield a blocky distribution  can obscure important patterns of variation. 0 2 4 6 8 10 12 0 30 60 More Temperature Frequency 0 0.5 1 1.5 2 2.5 3 3.5 4 8 12 16 20 24 28 32 36 40 44 48 52 56 60 More Temperature Frequency (X axis labels are upper class endpoints)
  • 31. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-31 The Cumulative Frequency Distribuiton Class 10 but less than 20 3 15 3 15 20 but less than 30 6 30 9 45 30 but less than 40 5 25 14 70 40 but less than 50 4 20 18 90 50 but less than 60 2 10 20 100 Total 20 100 Percentage Cumulative Percentage Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Frequency Cumulative Frequency
  • 32. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-32 The Ogive Graphing Cumulative Frequencies Ogive: Daily High Temperature 0 20 40 60 80 100 10 20 30 40 50 60 Cumulative Percentage Interval endpoints Interval Less than 10 10 0 10 but less than 20 20 15 20 but less than 30 30 45 30 but less than 40 40 70 40 but less than 50 50 90 50 but less than 60 60 100 Cumulative Percentage Upper interval endpoint
  • 33. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-33 Distribution Shape  The shape of the distribution is said to be symmetric if the observations are balanced, or evenly distributed, about the center. Symmetric Distribution 0 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 Frequency
  • 34. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-34 Distribution Shape  The shape of the distribution is said to be skewed if the observations are not symmetrically distributed around the center. (continued) Positively Skewed Distribution 0 2 4 6 8 10 12 1 2 3 4 5 6 7 8 9 Frequency Negatively Skewed Distribution 0 2 4 6 8 10 12 1 2 3 4 5 6 7 8 9 Frequency A positively skewed distribution (skewed to the right) has a tail that extends to the right in the direction of positive values. A negatively skewed distribution (skewed to the left) has a tail that extends to the left in the direction of negative values.
  • 35. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-35 Stem-and-Leaf Diagram  A simple way to see distribution details in a data set METHOD: Separate the sorted data series into leading digits (the stem) and the trailing digits (the leaves)
  • 36. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-36 Example  Here, use the 10’s digit for the stem unit: Data in ordered array: 21, 24, 24, 26, 27, 27, 30, 32, 38, 41  21 is shown as  38 is shown as Stem Leaf 2 1 3 8
  • 37. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-37 Example  Completed stem-and-leaf diagram: Stem Leaves 2 1 4 4 6 7 7 3 0 2 8 4 1 (continued) Data in ordered array: 21, 24, 24, 26, 27, 27, 30, 32, 38, 41
  • 38. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-38 Using other stem units  Using the 100’s digit as the stem:  Round off the 10’s digit to form the leaves  613 would become 6 1  776 would become 7 8  . . .  1224 becomes 12 2 Stem Leaf
  • 39. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-39 Using other stem units  Using the 100’s digit as the stem:  The completed stem-and-leaf display: Stem Leaves (continued) 6 1 3 6 7 2 2 5 8 8 3 4 6 6 9 9 9 1 3 3 6 8 10 3 5 6 11 4 7 12 2 Data: 613, 632, 658, 717, 722, 750, 776, 827, 841, 859, 863, 891, 894, 906, 928, 933, 955, 982, 1034, 1047,1056, 1140, 1169, 1224
  • 40. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-40 Relationships Between Variables  Graphs illustrated so far have involved only a single variable  When two variables exist other techniques are used: Categorical (Qualitative) Variables Numerical (Quantitative) Variables Cross tables Scatter plots
  • 41. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-41  Scatter Diagrams are used for paired observations taken from two numerical variables  The Scatter Diagram:  one variable is measured on the vertical axis and the other variable is measured on the horizontal axis Scatter Diagrams
  • 42. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-42 Scatter Diagram Example Cost per Dayvs. Production Volume 0 50 100 150 200 250 0 10 20 30 40 50 60 70 Volume per Day Cost per Day Volume per day Cost per day 23 125 26 140 29 146 33 160 38 167 42 170 50 188 55 195 60 200
  • 43. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-43 Scatter Diagrams in Excel Select the chart wizard 1 2 Select XY(Scatter) option, then click “Next” When prompted, enter the data range, desired legend, and desired destination to complete the scatter diagram 3
  • 44. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-44 Cross Tables  Cross Tables (or contingency tables) list the number of observations for every combination of values for two categorical or ordinal variables  If there are r categories for the first variable (rows) and c categories for the second variable (columns), the table is called an r x c cross table
  • 45. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-45 Cross Table Example  4 x 3 Cross Table for Investment Choices by Investor (values in $1000’s) Investment Investor A Investor B Investor C Total Category Stocks 46.5 55 27.5 129 Bonds 32.0 44 19.0 95 CD 15.5 20 13.5 49 Savings 16.0 28 7.0 51 Total 110.0 147 67.0 324
  • 46. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-46  Side by side bar charts (continued) Graphing Multivariate Categorical Data C omparing Investors 0 10 20 30 40 50 60 S toc k s B onds CD S avings Inves tor A Inves tor B Inves tor C
  • 47. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-47 Side-by-Side Chart Example  Sales by quarter for three sales territories: 0 10 20 30 40 50 60 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East 20.4 27.4 59 20.4 West 30.6 38.6 34.6 31.6 North 45.9 46.9 45 43.9
  • 48. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-48 Data Presentation Errors Goals for effective data presentation:  Present data to display essential information  Communicate complex ideas clearly and accurately  Avoid distortion that might convey the wrong message
  • 49. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-49  Unequal histogram interval widths  Compressing or distorting the vertical axis  Providing no zero point on the vertical axis  Failing to provide a relative basis in comparing data between groups Data Presentation Errors (continued)
  • 50. Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 2-50 Chapter Summary  Reviewed types of data and measurement levels  Data in raw form are usually not easy to use for decision making -- Some type of organization is needed:  Table  Graph  Techniques reviewed in this chapter:  Frequency distribution  Bar chart  Pie chart  Pareto diagram  Line chart  Frequency distribution  Histogram and ogive  Stem-and-leaf display  Scatter plot  Cross tables and side-by-side bar charts