SlideShare a Scribd company logo
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1
Chapter 3
Numerical Descriptive Measures
Statistics for Managers
Using Microsoft®
Excel
4th
Edition
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-2
After completing this chapter, you should be able to:
 Compute and interpret the mean, median, and mode for a
set of data
 Find the range, variance, standard deviation, and
coefficient of variation and know what these values mean
 Apply the empirical rule and the Bienaymé - Chebshev rule
to describe the variation of population values around the
mean
 Construct and interpret a box-and-whiskers plot
 Compute and explain the correlation coefficient
 Use numerical measures along with graphs, charts, and
tables to describe data
Chapter Goals
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-3
Chapter Topics
 Measures of central tendency, variation, and
shape
 Mean, median, mode, geometric mean
 Quartiles
 Range, inter- quartile range, variance and standard
deviation, coefficient of variation
 Symmetric and skewed distributions
 Population summary measures
 Mean, variance, and standard deviation
 The empirical rule and Bienaymé - Chebyshev rule
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-4
Chapter Topics
 Five number summary and box-and-whisker
plots
 Covariance and coefficient of correlation
 Pitfalls in numerical descriptive measures and
ethical considerations
(continued)
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-5
Summary Measures
Arithmetic Mean
Median
Mode
Describing Data Numerically
Variance
Standard Deviation
Coefficient of Variation
Range
Inter - quartile Range
Geometric Mean
Skewness
Central Tendency Variation ShapeQuartiles
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-6
Measures of Central Tendency
Central Tendency
Arithmetic Mean Median Mode Geometric Mean
n
X
X
n
i
i∑=
= 1
n/1
n21G )XXX(X ×××= 
Overview
Midpoint of
ranked
values
Most
frequently
observed
value
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-7
Arithmetic Mean
 The arithmetic mean (mean) is the most
common measure of central tendency
 For a sample of size n:
Sample size
n
XXX
n
X
X n21
n
1i
i
+++
==
∑= 
Observed values
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-8
Arithmetic Mean
 The most common measure of central tendency
 Mean = sum of values divided by the number of values
 Affected by extreme values (outliers)
(continued)
0 1 2 3 4 5 6 7 8 9 10
Mean = 3
0 1 2 3 4 5 6 7 8 9 10
Mean = 4
3
5
15
5
54321
==
++++
4
5
20
5
104321
==
++++
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-9
Median
 In an ordered array, the median is the “middle”
number (50% above, 50% below)
 Not affected by extreme values
0 1 2 3 4 5 6 7 8 9 10
Median = 3
0 1 2 3 4 5 6 7 8 9 10
Median = 3
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-10
Finding the Median
 The location of the median:
 If the number of values is odd, the median is the middle number
 If the number of values is even, the median is the average of
the two middle numbers
 Note that is not the value of the median, only the
position of the median in the ranked data
dataorderedtheinposition
2
1n
positionMedian
+
=
2
1n +
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-11
Mode
 A measure of central tendency
 Value that occurs most often
 Not affected by extreme values
 Used for either numerical or categorical data
 There may may be no mode
 There may be several modes
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Mode = 9
0 1 2 3 4 5 6
No Mode
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-12
 Five houses on a hill by the beach
Review Example
$2,000 K
$500 K
$300 K
$100 K
$100 K
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-13
Review Example:
Summary Statistics
 Mean: ($3,000,000/5)
= $600,000
 Median: middle value of ranked data
= $300,000
 Mode: most frequent value
= $100,000
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Sum 3,000,000
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-14
 Mean is generally used, unless
extreme values (outliers) exist
 Then median is often used, since
the median is not sensitive to
extreme values.
 Example: Median home prices may be
reported for a region – less sensitive to
outliers
Which measure of location
is the “best”?
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-15
Geometric Mean
 Geometric mean
 Used to measure the rate of change of a variable
over time
 Geometric mean rate of return
 Measures the status of an investment over time
 Where Ri is the rate of return in time period i
n/1
n21G )XXX(X ×××= 
1)]R1()R1()R1[(R n/1
n21G −+××+×+= 
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-16
Example
An investment of $100,000 declined to $50,000 at the
end of year one and rebounded to $100,000 at end
of year two:
000,100$X000,50$X000,100$X 321 ===
50% decrease 100% increase
The overall two-year return is zero, since it started and
ended at the same level.
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-17
Example
Use the 1-year returns to compute the arithmetic
mean and the geometric mean:
%0111)]2()50[(.
1%))]100(1(%))50(1[(
1)]R1()R1()R1[(R
2/12/1
2/1
n/1
n21G
=−=−×=
−+×−+=
−+××+×+= 
%25
2
%)100(%)50(
X =
+−
=
Arithmetic
mean rate
of return:
Geometric
mean rate
of return:
Misleading result
More
accurate
result
(continued)
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-18
Quartiles
 Quartiles split the ranked data into 4 segments with
an equal number of values per segment
25% 25% 25% 25%
 The first quartile, Q1, is the value for which 25% of the
observations are smaller and 75% are larger
 Q2 is the same as the median (50% are smaller, 50% are
larger)
 Only 25% of the observations are greater than the third
quartile
Q1 Q2 Q3
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-19
Quartile Formulas
Find a quartile by determining the value in the
appropriate position in the ranked data, where
First quartile position: Q1 = (n+1)/4
Second quartile position: Q2 = (n+1)/2 (the median position)
Third quartile position: Q3 = 3(n+1)/4
where n is the number of observed values
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-20
(n = 9)
Q1 = is in the (9+1)/4 = 2.5 position of the ranked data
so use the value half way between the 2nd
and 3rd
values,
so Q1 = 12.5
Quartiles
Sample Data in Ordered Array: 11 12 13 16 16 17 18 21 22
 Example: Find the first quartile
Q1 and Q3 are measures of noncentral location
Q2 = median, a measure of central tendency
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-21
Same center,
different variation
Measures of Variation
Variation
Variance Standard
Deviation
Coefficient
of Variation
Range Interquartile
Range
 Measures of variation give
information on the spread or
variability of the data
values.
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-22
Range
 Simplest measure of variation
 Difference between the largest and the smallest
observations:
Range = Xlargest – Xsmallest
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 14 - 1 = 13
Example:
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-23
 Ignores the way in which data are distributed
 Sensitive to outliers
7 8 9 10 11 12
Range = 12 - 7 = 5
7 8 9 10 11 12
Range = 12 - 7 = 5
Disadvantages of the Range
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 5 - 1 = 4
Range = 120 - 1 = 119
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-24
Inter - quartile Range
 Can eliminate some outlier problems by using
the inter - quartile range
 Eliminate some high- and low-valued
observations and calculate the range from the
remaining values
 Interquartile range = 3rd
quartile – 1st
quartile
= Q3 – Q1
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-25
Interquartile Range
Median
(Q2)
X
maximumX
minimum Q1 Q3
Example:
25% 25% 25% 25%
12 30 45 57 70
Interquartile range
= 57 – 30 = 27
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-26
 Average (approximately) of squared deviations
of values from the mean
 Sample variance:
Variance
1-n
)X(X
S
n
1i
2
i
2
∑=
−
=
Where = arithmetic mean
n = sample size
Xi = ith
value of the variable X
X
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-27
Standard Deviation
 Most commonly used measure of variation
 Shows variation about the mean
 Has the same units as the original data
 Sample standard deviation:
1-n
)X(X
S
n
1i
2
i∑=
−
=
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-28
Calculation Example:
Sample Standard Deviation
Sample
Data (Xi) : 10 12 14 15 17 18 18 24
n = 8 Mean = X = 16
4.2426
7
126
18
16)(2416)(1416)(1216)(10
1n
)X(24)X(14)X(12)X(10
S
2222
2222
==
−
−++−+−+−
=
−
−++−+−+−
=


A measure of the “average”
scatter around the mean
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-29
Measuring variation
Small standard deviation
Large standard deviation
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-30
Comparing Standard Deviations
Mean = 15.5
S = 3.33811 12 13 14 15 16 17 18 19 20 21
11 12 13 14 15 16 17 18 19 20 21
Data B
Data A
Mean = 15.5
S = 0.926
11 12 13 14 15 16 17 18 19 20 21
Mean = 15.5
S = 4.570
Data C
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-31
Advantages of Variance and
Standard Deviation
 Each value in the data set is used in the
calculation
 Values far from the mean are given extra
weight
(because deviations from the mean are squared)
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-32
Coefficient of Variation
 Measures relative variation
 Always in percentage (%)
 Shows variation relative to mean
 Can be used to compare two or more sets of
data measured in different units
100%
X
S
CV ⋅







=
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-33
Comparing Coefficient
of Variation
 Stock A:
 Average price last year = $50
 Standard deviation = $5
 Stock B:
 Average price last year = $100
 Standard deviation = $5
Both stocks
have the same
standard
deviation, but
stock B is less
variable relative
to its price
10%100%
$50
$5
100%
X
S
CVA =⋅=⋅







=
5%100%
$100
$5
100%
X
S
CVB =⋅=⋅







=
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-34
Shape of a Distribution
 Describes how data is distributed
 Measures of shape
 Symmetric or skewed
Mean = MedianMean < Median Median < Mean
Right-SkewedLeft-Skewed Symmetric
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-35
Using Microsoft Excel
 Descriptive Statistics can be obtained
from Microsoft®
Excel
 Use menu choice:
tools / data analysis / descriptive statistics
 Enter details in dialog box
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-36
Using Excel
Use menu choice:
tools / data analysis /
descriptive statistics
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-37
 Enter dialog box
details
 Check box for
summary statistics
 Click OK
Using Excel
(continued)
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-38
Excel output
Microsoft Excel
descriptive statistics output,
using the house price data:
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-39
Population Summary Measures
 Population summary measures are called parameters
 The population mean is the sum of the values in the
population divided by the population size, N
N
XXX
N
X
N21
N
1i
i
+++
==µ
∑= 
μ = population mean
N = population size
Xi = ith
value of the variable X
Where
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-40
 Average of squared deviations of values from
the mean
 Population variance:
Population Variance
N
μ)(X
σ
N
1i
2
i
2
∑=
−
=
Where μ = population mean
N = population size
Xi = ith
value of the variable X
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-41
Population Standard Deviation
 Most commonly used measure of variation
 Shows variation about the mean
 Has the same units as the original data
 Population standard deviation:
N
μ)(X
σ
N
1i
2
i∑=
−
=
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-42
 If the data distribution is bell-shaped, then
the interval:
 contains about 68% of the values in
the population or the sample
The Empirical Rule
1σμ ±
μ
68%
1σμ ±
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-43
 contains about 95% of the values in
the population or the sample
 contains about 99.7% of the values
in the population or the sample
The Empirical Rule
2σμ ±
3σμ ±
3σμ ±
99.7%95%
2σμ ±
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-44
 Regardless of how the data are distributed,
at least (1 - 1/k2
) of the values will fall within
k standard deviations of the mean (for k > 1)
 Examples:
(1 - 1/12
) = 0% ……..... k=1 (μ ± 1σ)
(1 - 1/22
) = 75% …........ k=2 (μ ± 2σ)
(1 - 1/32
) = 89% ………. k=3 (μ ± 3σ)
Bienaymé - Chebyshev Rule
withinAt least
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-45
Exploratory Data Analysis
 Box-and-Whisker Plot: A Graphical display of
data using 5-number summary:
Minimum -- Q1 -- Median -- Q3 -- Maximum
Example:
Minimum 1st Median 3rd Maximum
Quartile Quartile
Minimum 1st Median 3rd Maximum
Quartile Quartile
25% 25% 25% 25%
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-46
Shape of Box-and-Whisker Plots
 The Box and central line are centered between the
endpoints if data are symmetric around the median
 A Box-and-Whisker plot can be shown in either vertical
or horizontal format
Min Q1 Median Q3 Max
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-47
Distribution Shape and
Box-and-Whisker Plot
Right-SkewedLeft-Skewed Symmetric
Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-48
Box-and-Whisker Plot Example
 Below is a Box-and-Whisker plot for the following
data:
0 2 2 2 3 3 4 5 5 10 27
 This data is right skewed, as the plot depicts
0 2 3 5 270 2 3 5 27
Min Q1 Q2 Q3 Max
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-49
The Sample Covariance
 The sample covariance measures the strength of the
linear relationship between two variables (called
bivariate data)
 The sample covariance:
 Only concerned with the strength of the relationship
 No causal effect is implied
1n
)YY)(XX(
)Y,X(cov
n
1i
ii
−
−−
=
∑=
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-50
 Covariance between two random variables:
cov(X,Y) > 0 X and Y tend to move in the same direction
cov(X,Y) < 0 X and Y tend to move in opposite directions
cov(X,Y) = 0 X and Y are independent
Interpreting Covariance
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-51
Coefficient of Correlation
 Measures the relative strength of the linear
relationship between two variables
 Sample coefficient of correlation:
YX
n
1i
2
i
n
1i
2
i
n
1i
ii
SS
)Y,X(cov
)YY()XX(
)YY)(XX(
r =
−−
−−
=
∑∑
∑
==
=
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-52
Features of
Correlation Coefficient, r
 Unit free
 Ranges between –1 and 1
 The closer to –1, the stronger the negative linear
relationship
 The closer to 1, the stronger the positive linear
relationship
 The closer to 0, the weaker any positive linear
relationship
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-53
Scatter Plots of Data with Various
Correlation Coefficients
Y
X
Y
X
Y
X
Y
X
Y
X
r = -1 r = -.6 r = 0
r = +.3r = +1
Y
X
r = 0
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-54
Using Excel to Find
the Correlation Coefficient
 Select
Tools/Data Analysis
 Choose Correlation from
the selection menu
 Click OK . . .
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-55
Using Excel to Find
the Correlation Coefficient
 Input data range and select
appropriate options
 Click OK to get output
(continued)
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-56
Interpreting the Result
 r = .733
 There is a relatively
strong positive linear
relationship between
test score #1
and test score #2
 Students who scored high on the first test tended
to score high on second test
Scatter Plot of Test Scores
70
75
80
85
90
95
100
70 75 80 85 90 95 100
Test #1 Score
Test#2Score
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-57
Pitfalls in Numerical
Descriptive Measures
 Data analysis is objective
 Should report the summary measures that best meet
the assumptions about the data set
 Data interpretation is subjective
 Should be done in fair, neutral and clear manner
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-58
Ethical Considerations
Numerical descriptive measures:
 Should document both good and bad results
 Should be presented in a fair, objective and
neutral manner
 Should not use inappropriate summary
measures to distort facts
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-59
Chapter Summary
 Described measures of central tendency
 Mean, median, mode, geometric mean
 Discussed quartiles
 Described measures of variation
 Range, interquartile range, variance and standard
deviation, coefficient of variation
 Illustrated shape of distribution
 Symmetric, skewed, box-and-whisker plots
Statistics for Managers Using
Microsoft Excel, 4e © 2004
Prentice-Hall, Inc. Chap 3-60
Chapter Summary
 Discussed covariance and correlation
coefficient
 Addressed pitfalls in numerical descriptive
measures and ethical considerations
(continued)

More Related Content

PPT
7E's APPROCH
PPTX
Measures of Variation (Ungrouped Data)
PPTX
Measures of Variation
PPTX
ESP 8 Modyul 10
PPTX
MEASURES OF POSITION GROUPED DATA.pptx
PPTX
Using Excel Functions
PPT
parabola class 12
PDF
Measures of central tendency
7E's APPROCH
Measures of Variation (Ungrouped Data)
Measures of Variation
ESP 8 Modyul 10
MEASURES OF POSITION GROUPED DATA.pptx
Using Excel Functions
parabola class 12
Measures of central tendency

What's hot (20)

PPT
Chap03 numerical descriptive measures
PDF
Tabel distribusi frekuensi
PPTX
Probability Distribution
PPT
Chap04 basic probability
PPT
Chap02 presenting data in chart & tables
PPTX
Basic Concepts of Probability
DOCX
Langkah langkah analisis korespondensi
PDF
Lecture 15 - Hypothesis Testing (1).pdf
PPTX
Estimating population mean
PDF
1. STATISTIK INDUSTRI - PENDAHULUAN
PPT
Sampling and sampling distribution
PPT
Discrete Probability Distributions
PPTX
Measures of Variation
PPTX
Introduction to Statistics and Probability
PPTX
Ukuran pemusatan data
PPTX
Chap04 discrete random variables and probability distribution
PPTX
Variance and standard deviation of a discrete random variable
PPTX
Normal as Approximation to Binomial
DOCX
Statistics and Probability Q4_M1_LAS .docx
Chap03 numerical descriptive measures
Tabel distribusi frekuensi
Probability Distribution
Chap04 basic probability
Chap02 presenting data in chart & tables
Basic Concepts of Probability
Langkah langkah analisis korespondensi
Lecture 15 - Hypothesis Testing (1).pdf
Estimating population mean
1. STATISTIK INDUSTRI - PENDAHULUAN
Sampling and sampling distribution
Discrete Probability Distributions
Measures of Variation
Introduction to Statistics and Probability
Ukuran pemusatan data
Chap04 discrete random variables and probability distribution
Variance and standard deviation of a discrete random variable
Normal as Approximation to Binomial
Statistics and Probability Q4_M1_LAS .docx
Ad

Similar to Numerical Descriptive Measures (20)

PPT
Bbs10 ppt ch03
PPT
Newbold_chap03.ppt
PPT
Lesson 6 measures of central tendency
PPT
Confidence Interval Estimation
PPT
Chap07 interval estimation
PPT
Chap08 fundamentals of hypothesis
PPT
Business Statistics Chapter 3
PPTX
Statistical Methods: Measures of Central Tendency.pptx
PPT
Fundamentals of Testing Hypothesis
PPTX
Chap02 describing data; numerical
PPT
Chap02 presenting data in chart & tables
PPTX
chap08.pptx
PPT
Chap10 anova
PPT
Presenting Data in Tables and Charts
PPT
Analysis of Variance
PPT
Chap12 simple regression
PPT
Analysis of measure-of-central-tendency.ppt
PPTX
2 Descriptive Statistics Psychology.pptx
PPT
Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
PPT
medidas de tendencia central en ingles.ppt
Bbs10 ppt ch03
Newbold_chap03.ppt
Lesson 6 measures of central tendency
Confidence Interval Estimation
Chap07 interval estimation
Chap08 fundamentals of hypothesis
Business Statistics Chapter 3
Statistical Methods: Measures of Central Tendency.pptx
Fundamentals of Testing Hypothesis
Chap02 describing data; numerical
Chap02 presenting data in chart & tables
chap08.pptx
Chap10 anova
Presenting Data in Tables and Charts
Analysis of Variance
Chap12 simple regression
Analysis of measure-of-central-tendency.ppt
2 Descriptive Statistics Psychology.pptx
Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
medidas de tendencia central en ingles.ppt
Ad

More from Yesica Adicondro (20)

PPTX
Strategi Tata Letak
PPT
Konsep Balanced Score Card
PPTX
Makalah kelompok Analisis Taksi Bakri
DOCX
Makalah kelompok Analisis Taksi Bakri
PPTX
Makalah Analisis PT Kereta API Indonesia
DOC
Makalah Analisis PT Kereta API Indonesia
PPTX
Makalah kelompok 3 gudang garam
DOC
Makalah Perusahaan Gudang Garam
PPTX
Makalah kelompok 2 garuda citilink PPT
DOCX
Makalah kelompok 2 garuda citilink
PDF
Dmfi leaflet indonesian
PDF
Dmfi booklet indonesian
PPTX
Makalah kinerja operasi Indonesia PPT
DOCX
Makalah kinerja operasi Indonesia
PPTX
Business process reengineering PPT
DOCX
Business process reengineering Makalah
PPTX
PPT Balanced Scorecard
DOCX
Makalah Balanced Scorecard
DOCX
Analisis Manajemen strategik PT garuda citilink
PPTX
analisis PPT PT Japfa
Strategi Tata Letak
Konsep Balanced Score Card
Makalah kelompok Analisis Taksi Bakri
Makalah kelompok Analisis Taksi Bakri
Makalah Analisis PT Kereta API Indonesia
Makalah Analisis PT Kereta API Indonesia
Makalah kelompok 3 gudang garam
Makalah Perusahaan Gudang Garam
Makalah kelompok 2 garuda citilink PPT
Makalah kelompok 2 garuda citilink
Dmfi leaflet indonesian
Dmfi booklet indonesian
Makalah kinerja operasi Indonesia PPT
Makalah kinerja operasi Indonesia
Business process reengineering PPT
Business process reengineering Makalah
PPT Balanced Scorecard
Makalah Balanced Scorecard
Analisis Manajemen strategik PT garuda citilink
analisis PPT PT Japfa

Recently uploaded (20)

PDF
Introduction to Business Data Analytics.
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPT
Quality review (1)_presentation of this 21
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
Global journeys: estimating international migration
PDF
Mega Projects Data Mega Projects Data
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Computer network topology notes for revision
PPTX
Moving the Public Sector (Government) to a Digital Adoption
Introduction to Business Data Analytics.
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Quality review (1)_presentation of this 21
Launch Your Data Science Career in Kochi – 2025
Global journeys: estimating international migration
Mega Projects Data Mega Projects Data
Clinical guidelines as a resource for EBP(1).pdf
Business Acumen Training GuidePresentation.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
.pdf is not working space design for the following data for the following dat...
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Fluorescence-microscope_Botany_detailed content
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to Knowledge Engineering Part 1
Computer network topology notes for revision
Moving the Public Sector (Government) to a Digital Adoption

Numerical Descriptive Measures

  • 1. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Statistics for Managers Using Microsoft® Excel 4th Edition
  • 2. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-2 After completing this chapter, you should be able to:  Compute and interpret the mean, median, and mode for a set of data  Find the range, variance, standard deviation, and coefficient of variation and know what these values mean  Apply the empirical rule and the Bienaymé - Chebshev rule to describe the variation of population values around the mean  Construct and interpret a box-and-whiskers plot  Compute and explain the correlation coefficient  Use numerical measures along with graphs, charts, and tables to describe data Chapter Goals
  • 3. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-3 Chapter Topics  Measures of central tendency, variation, and shape  Mean, median, mode, geometric mean  Quartiles  Range, inter- quartile range, variance and standard deviation, coefficient of variation  Symmetric and skewed distributions  Population summary measures  Mean, variance, and standard deviation  The empirical rule and Bienaymé - Chebyshev rule
  • 4. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-4 Chapter Topics  Five number summary and box-and-whisker plots  Covariance and coefficient of correlation  Pitfalls in numerical descriptive measures and ethical considerations (continued)
  • 5. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-5 Summary Measures Arithmetic Mean Median Mode Describing Data Numerically Variance Standard Deviation Coefficient of Variation Range Inter - quartile Range Geometric Mean Skewness Central Tendency Variation ShapeQuartiles
  • 6. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-6 Measures of Central Tendency Central Tendency Arithmetic Mean Median Mode Geometric Mean n X X n i i∑= = 1 n/1 n21G )XXX(X ×××=  Overview Midpoint of ranked values Most frequently observed value
  • 7. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-7 Arithmetic Mean  The arithmetic mean (mean) is the most common measure of central tendency  For a sample of size n: Sample size n XXX n X X n21 n 1i i +++ == ∑=  Observed values
  • 8. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-8 Arithmetic Mean  The most common measure of central tendency  Mean = sum of values divided by the number of values  Affected by extreme values (outliers) (continued) 0 1 2 3 4 5 6 7 8 9 10 Mean = 3 0 1 2 3 4 5 6 7 8 9 10 Mean = 4 3 5 15 5 54321 == ++++ 4 5 20 5 104321 == ++++
  • 9. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-9 Median  In an ordered array, the median is the “middle” number (50% above, 50% below)  Not affected by extreme values 0 1 2 3 4 5 6 7 8 9 10 Median = 3 0 1 2 3 4 5 6 7 8 9 10 Median = 3
  • 10. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-10 Finding the Median  The location of the median:  If the number of values is odd, the median is the middle number  If the number of values is even, the median is the average of the two middle numbers  Note that is not the value of the median, only the position of the median in the ranked data dataorderedtheinposition 2 1n positionMedian + = 2 1n +
  • 11. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-11 Mode  A measure of central tendency  Value that occurs most often  Not affected by extreme values  Used for either numerical or categorical data  There may may be no mode  There may be several modes 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Mode = 9 0 1 2 3 4 5 6 No Mode
  • 12. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-12  Five houses on a hill by the beach Review Example $2,000 K $500 K $300 K $100 K $100 K House Prices: $2,000,000 500,000 300,000 100,000 100,000
  • 13. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-13 Review Example: Summary Statistics  Mean: ($3,000,000/5) = $600,000  Median: middle value of ranked data = $300,000  Mode: most frequent value = $100,000 House Prices: $2,000,000 500,000 300,000 100,000 100,000 Sum 3,000,000
  • 14. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-14  Mean is generally used, unless extreme values (outliers) exist  Then median is often used, since the median is not sensitive to extreme values.  Example: Median home prices may be reported for a region – less sensitive to outliers Which measure of location is the “best”?
  • 15. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-15 Geometric Mean  Geometric mean  Used to measure the rate of change of a variable over time  Geometric mean rate of return  Measures the status of an investment over time  Where Ri is the rate of return in time period i n/1 n21G )XXX(X ×××=  1)]R1()R1()R1[(R n/1 n21G −+××+×+= 
  • 16. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-16 Example An investment of $100,000 declined to $50,000 at the end of year one and rebounded to $100,000 at end of year two: 000,100$X000,50$X000,100$X 321 === 50% decrease 100% increase The overall two-year return is zero, since it started and ended at the same level.
  • 17. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-17 Example Use the 1-year returns to compute the arithmetic mean and the geometric mean: %0111)]2()50[(. 1%))]100(1(%))50(1[( 1)]R1()R1()R1[(R 2/12/1 2/1 n/1 n21G =−=−×= −+×−+= −+××+×+=  %25 2 %)100(%)50( X = +− = Arithmetic mean rate of return: Geometric mean rate of return: Misleading result More accurate result (continued)
  • 18. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-18 Quartiles  Quartiles split the ranked data into 4 segments with an equal number of values per segment 25% 25% 25% 25%  The first quartile, Q1, is the value for which 25% of the observations are smaller and 75% are larger  Q2 is the same as the median (50% are smaller, 50% are larger)  Only 25% of the observations are greater than the third quartile Q1 Q2 Q3
  • 19. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-19 Quartile Formulas Find a quartile by determining the value in the appropriate position in the ranked data, where First quartile position: Q1 = (n+1)/4 Second quartile position: Q2 = (n+1)/2 (the median position) Third quartile position: Q3 = 3(n+1)/4 where n is the number of observed values
  • 20. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-20 (n = 9) Q1 = is in the (9+1)/4 = 2.5 position of the ranked data so use the value half way between the 2nd and 3rd values, so Q1 = 12.5 Quartiles Sample Data in Ordered Array: 11 12 13 16 16 17 18 21 22  Example: Find the first quartile Q1 and Q3 are measures of noncentral location Q2 = median, a measure of central tendency
  • 21. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-21 Same center, different variation Measures of Variation Variation Variance Standard Deviation Coefficient of Variation Range Interquartile Range  Measures of variation give information on the spread or variability of the data values.
  • 22. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-22 Range  Simplest measure of variation  Difference between the largest and the smallest observations: Range = Xlargest – Xsmallest 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Range = 14 - 1 = 13 Example:
  • 23. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-23  Ignores the way in which data are distributed  Sensitive to outliers 7 8 9 10 11 12 Range = 12 - 7 = 5 7 8 9 10 11 12 Range = 12 - 7 = 5 Disadvantages of the Range 1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5 1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120 Range = 5 - 1 = 4 Range = 120 - 1 = 119
  • 24. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-24 Inter - quartile Range  Can eliminate some outlier problems by using the inter - quartile range  Eliminate some high- and low-valued observations and calculate the range from the remaining values  Interquartile range = 3rd quartile – 1st quartile = Q3 – Q1
  • 25. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-25 Interquartile Range Median (Q2) X maximumX minimum Q1 Q3 Example: 25% 25% 25% 25% 12 30 45 57 70 Interquartile range = 57 – 30 = 27
  • 26. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-26  Average (approximately) of squared deviations of values from the mean  Sample variance: Variance 1-n )X(X S n 1i 2 i 2 ∑= − = Where = arithmetic mean n = sample size Xi = ith value of the variable X X
  • 27. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-27 Standard Deviation  Most commonly used measure of variation  Shows variation about the mean  Has the same units as the original data  Sample standard deviation: 1-n )X(X S n 1i 2 i∑= − =
  • 28. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-28 Calculation Example: Sample Standard Deviation Sample Data (Xi) : 10 12 14 15 17 18 18 24 n = 8 Mean = X = 16 4.2426 7 126 18 16)(2416)(1416)(1216)(10 1n )X(24)X(14)X(12)X(10 S 2222 2222 == − −++−+−+− = − −++−+−+− =   A measure of the “average” scatter around the mean
  • 29. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-29 Measuring variation Small standard deviation Large standard deviation
  • 30. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-30 Comparing Standard Deviations Mean = 15.5 S = 3.33811 12 13 14 15 16 17 18 19 20 21 11 12 13 14 15 16 17 18 19 20 21 Data B Data A Mean = 15.5 S = 0.926 11 12 13 14 15 16 17 18 19 20 21 Mean = 15.5 S = 4.570 Data C
  • 31. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-31 Advantages of Variance and Standard Deviation  Each value in the data set is used in the calculation  Values far from the mean are given extra weight (because deviations from the mean are squared)
  • 32. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-32 Coefficient of Variation  Measures relative variation  Always in percentage (%)  Shows variation relative to mean  Can be used to compare two or more sets of data measured in different units 100% X S CV ⋅        =
  • 33. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-33 Comparing Coefficient of Variation  Stock A:  Average price last year = $50  Standard deviation = $5  Stock B:  Average price last year = $100  Standard deviation = $5 Both stocks have the same standard deviation, but stock B is less variable relative to its price 10%100% $50 $5 100% X S CVA =⋅=⋅        = 5%100% $100 $5 100% X S CVB =⋅=⋅        =
  • 34. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-34 Shape of a Distribution  Describes how data is distributed  Measures of shape  Symmetric or skewed Mean = MedianMean < Median Median < Mean Right-SkewedLeft-Skewed Symmetric
  • 35. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-35 Using Microsoft Excel  Descriptive Statistics can be obtained from Microsoft® Excel  Use menu choice: tools / data analysis / descriptive statistics  Enter details in dialog box
  • 36. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-36 Using Excel Use menu choice: tools / data analysis / descriptive statistics
  • 37. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-37  Enter dialog box details  Check box for summary statistics  Click OK Using Excel (continued)
  • 38. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-38 Excel output Microsoft Excel descriptive statistics output, using the house price data: House Prices: $2,000,000 500,000 300,000 100,000 100,000
  • 39. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-39 Population Summary Measures  Population summary measures are called parameters  The population mean is the sum of the values in the population divided by the population size, N N XXX N X N21 N 1i i +++ ==µ ∑=  μ = population mean N = population size Xi = ith value of the variable X Where
  • 40. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-40  Average of squared deviations of values from the mean  Population variance: Population Variance N μ)(X σ N 1i 2 i 2 ∑= − = Where μ = population mean N = population size Xi = ith value of the variable X
  • 41. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-41 Population Standard Deviation  Most commonly used measure of variation  Shows variation about the mean  Has the same units as the original data  Population standard deviation: N μ)(X σ N 1i 2 i∑= − =
  • 42. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-42  If the data distribution is bell-shaped, then the interval:  contains about 68% of the values in the population or the sample The Empirical Rule 1σμ ± μ 68% 1σμ ±
  • 43. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-43  contains about 95% of the values in the population or the sample  contains about 99.7% of the values in the population or the sample The Empirical Rule 2σμ ± 3σμ ± 3σμ ± 99.7%95% 2σμ ±
  • 44. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-44  Regardless of how the data are distributed, at least (1 - 1/k2 ) of the values will fall within k standard deviations of the mean (for k > 1)  Examples: (1 - 1/12 ) = 0% ……..... k=1 (μ ± 1σ) (1 - 1/22 ) = 75% …........ k=2 (μ ± 2σ) (1 - 1/32 ) = 89% ………. k=3 (μ ± 3σ) Bienaymé - Chebyshev Rule withinAt least
  • 45. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-45 Exploratory Data Analysis  Box-and-Whisker Plot: A Graphical display of data using 5-number summary: Minimum -- Q1 -- Median -- Q3 -- Maximum Example: Minimum 1st Median 3rd Maximum Quartile Quartile Minimum 1st Median 3rd Maximum Quartile Quartile 25% 25% 25% 25%
  • 46. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-46 Shape of Box-and-Whisker Plots  The Box and central line are centered between the endpoints if data are symmetric around the median  A Box-and-Whisker plot can be shown in either vertical or horizontal format Min Q1 Median Q3 Max
  • 47. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-47 Distribution Shape and Box-and-Whisker Plot Right-SkewedLeft-Skewed Symmetric Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3
  • 48. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-48 Box-and-Whisker Plot Example  Below is a Box-and-Whisker plot for the following data: 0 2 2 2 3 3 4 5 5 10 27  This data is right skewed, as the plot depicts 0 2 3 5 270 2 3 5 27 Min Q1 Q2 Q3 Max
  • 49. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-49 The Sample Covariance  The sample covariance measures the strength of the linear relationship between two variables (called bivariate data)  The sample covariance:  Only concerned with the strength of the relationship  No causal effect is implied 1n )YY)(XX( )Y,X(cov n 1i ii − −− = ∑=
  • 50. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-50  Covariance between two random variables: cov(X,Y) > 0 X and Y tend to move in the same direction cov(X,Y) < 0 X and Y tend to move in opposite directions cov(X,Y) = 0 X and Y are independent Interpreting Covariance
  • 51. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-51 Coefficient of Correlation  Measures the relative strength of the linear relationship between two variables  Sample coefficient of correlation: YX n 1i 2 i n 1i 2 i n 1i ii SS )Y,X(cov )YY()XX( )YY)(XX( r = −− −− = ∑∑ ∑ == =
  • 52. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-52 Features of Correlation Coefficient, r  Unit free  Ranges between –1 and 1  The closer to –1, the stronger the negative linear relationship  The closer to 1, the stronger the positive linear relationship  The closer to 0, the weaker any positive linear relationship
  • 53. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-53 Scatter Plots of Data with Various Correlation Coefficients Y X Y X Y X Y X Y X r = -1 r = -.6 r = 0 r = +.3r = +1 Y X r = 0
  • 54. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-54 Using Excel to Find the Correlation Coefficient  Select Tools/Data Analysis  Choose Correlation from the selection menu  Click OK . . .
  • 55. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-55 Using Excel to Find the Correlation Coefficient  Input data range and select appropriate options  Click OK to get output (continued)
  • 56. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-56 Interpreting the Result  r = .733  There is a relatively strong positive linear relationship between test score #1 and test score #2  Students who scored high on the first test tended to score high on second test Scatter Plot of Test Scores 70 75 80 85 90 95 100 70 75 80 85 90 95 100 Test #1 Score Test#2Score
  • 57. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-57 Pitfalls in Numerical Descriptive Measures  Data analysis is objective  Should report the summary measures that best meet the assumptions about the data set  Data interpretation is subjective  Should be done in fair, neutral and clear manner
  • 58. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-58 Ethical Considerations Numerical descriptive measures:  Should document both good and bad results  Should be presented in a fair, objective and neutral manner  Should not use inappropriate summary measures to distort facts
  • 59. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-59 Chapter Summary  Described measures of central tendency  Mean, median, mode, geometric mean  Discussed quartiles  Described measures of variation  Range, interquartile range, variance and standard deviation, coefficient of variation  Illustrated shape of distribution  Symmetric, skewed, box-and-whisker plots
  • 60. Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-60 Chapter Summary  Discussed covariance and correlation coefficient  Addressed pitfalls in numerical descriptive measures and ethical considerations (continued)