Measures of Dispersion                              Variance and the standard deviation
  •    Concept and objective                                      Variance is the average squared deviation from the mean
  •    Range, inter-quartile range (IQR =Q3 – Q1)
  •    Variance and standard deviation (sd)                                 1    N

  •    Computation
                                                                   σ2 =
                                                                            N
                                                                                ∑(X
                                                                                i =1
                                                                                        i   − µ )2
                                                                                                          x    x                x       x
  •    Chebyshev’s Inequality                                                                             X1   Xi    µ          X2      XN
  •    Relative dispersion -- the coefficient of
                                                                       σ, the standard deviation is the (+ve) square root of the variance
       variation
                                                                  In the sample variance calculation, use the denominator (n-1)
                           σ
                  C.V. =     × 100%
                           µ                                                     1 n
                                                                        S2 =         ∑ ( X i − X )2
                                                                               n − 1 i =1




            Chebyshev’s Inequality
                                                                          Chebyshev’s Inequality:
      • At least (1 - 1/k2) proportion of the data must be
        within k standard deviation of the mean.                               Illustration
      • Here k is a number (not necessarily an integer)
        greater than 1.
      • The statement is valid for ANY distribution,
        discrete or continuous, symmetric or otherwise.
      • If a r.v. X has a mean µ and a standard deviation σ
      then a equivalent probability statement is :
                                                                µ-3σ         µ-2σ       µ-σ           µ    µ+σ           µ+2σ        µ+3σ
                                           1
             P [ | X − µ | > kσ ] ≤                                                            At least 75%
                                           k2
                                                                                       At least 88.89%




       Understanding Standard deviation
                                                                    The Coefficient of Variation:
294 MLA of WB has an average wealth of 68 Lakh                     A measure of relative dispersion
and s.d. = 10 Lakh. What does that tell you?                                                   σ    S
                                                                                                 or                 × 100 %
  In particular, what can you say about % of MLAs                                              µ    X
    having wealth                                             • Unit free
  • between 58L and 78L ?                                     • Amenable to comparison
                                                              • Often expressed in terms of percentages
  • between 53L and 83L ?
  • Less than 53 L or more than 83L?
  • between 50L and 1 crore ?
  • More than 1 crore?




                                                                                                                                             1
Box Plot                                                                           Box Plot
                                                                                                                Elements of a Box Plot
                                                                                                     Smallest data                          Largest data point
                                                                                                     point not below                        not exceeding        Suspected
                                                                                 Outlier             inner fence                            inner fence          outlier




  o                X                                                 X    *      o               X                                                 X
                                                                                                                                                                  *


                                                                                                 Inner                 Q1   Median
                                                                                 Outer                                               Q3             Inner                    Outer
                                                                                 Fence           Fence                                              Fence                    Fence
                                                                                                Q1-1.5(IQR)                 Interquartile         Q3+1.5(IQR)
                                                                                                                             Range
                                                                                 Q1-3(IQR)                                                                            Q3+3(IQR)




           Review Descriptive Statistics
                                                                                              Poll Forecasting – Exit polls
• Graphical representations, frequency
  distribution                                                                             • Exit Polls in US election 2000 in the critical
                                                                                             state of Florida
• Measures of central tendency and dispersion
      – Computation and interpretation                                                     • Indian Election 2004, 2009,
      – Chebychev’s result                                                                 • UP 2012
• Skewness and Kurtosis                                                                    • Karnataka 2008
• Outliers
      – What are they? How to detect?
      – What to do if there are outliers?




             A simple example: Overview                                                        Some of the questions to be
  Population = all projects undertaken by a company                                                    answered
  An unknown proportion π of them took longer than scheduled
                                          *                                                • To what degree, the randomness in Y can be
   * Delay                                         *
                                *      ** *                  *                               attributed to sampling fluctuations?
                                          *         *
                            *
                                        * *
                                                                                           • How close is Y = p to π ?
                                 *              *                                                           n
                                                        **
                            *          * *      *                *                         • If we want p to be within ±0.05 of π , how
                                                                                             many projects do we need to look at?
A random sample of n projects are selected. Y of them are found to be delayed.

  (sample outcome) Y is random, but the randomness depends on π.

  Given the value of Y, one can make an objective inference about π.




                                                                                                                                                                                     2
Myth and Mystery of Probability
                                                        Overview of Probability
• What is chance of getting any rain today in   • Approaches for defining probability
  the campus?
• What is the probability that India will win   • Basic Probability rules
  WC2015?
• What is the chance that India’s space         • Conditional probability and notion of
  mission will send a human being to moon         independence
  by 2020?

                                                • Bayes’ rule




      Approaches for defining
                                                              Probability Laws
           probability
• Classical approach                            •   0 ≤ P[A] ≤1;
                                                •   P[impossible event]=0; P[Sure event]=1
• (Asymptotic) Relative frequency approach      •   P[A or B] = P[A] + P[B] - P[AB]
                                                •   In particular, P[not A] = 1- P[A]
• Subjective probability                        •   Look at the Venn diagram and write down
                                                    other formulae like
                                                       • P[A] = P[A and B] + P[A and (not B)]




                                                                                                3

More Related Content

PDF
Real time information reconstruction -The Prediction Market
PDF
Dynamic Trading Volume
PDF
Hedging, Arbitrage, and Optimality with Superlinear Frictions
PDF
Transaction Costs Made Tractable
PDF
multiple linear reggression model
PPTX
Topic 6 stat probability theory
PPT
MTH120_Chapter7
Real time information reconstruction -The Prediction Market
Dynamic Trading Volume
Hedging, Arbitrage, and Optimality with Superlinear Frictions
Transaction Costs Made Tractable
multiple linear reggression model
Topic 6 stat probability theory
MTH120_Chapter7

What's hot (20)

PDF
Interview Preparation
PDF
UT Austin - Portugal Lectures on Portfolio Choice
PDF
Fundamental Theorem of Asset Pricing
PPT
9. administración y pronóstico de la demanda
PDF
Who Should Sell Stocks?
PDF
Detecting Drug Effects in the Brain
PDF
Parameter Estimation for Semiparametric Models with CMARS and Its Applications
PPT
Risk08a
PDF
Pr 2-bayesian decision
PDF
Slides edf-1
PDF
Transactional Data Mining
DOC
Final
PDF
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
PPT
Black scholas theory for venu(RVSKVK)
PDF
AP Calculus - Semester Review
PDF
Statistics lecture 10(ch10)
PDF
Fuzzy logic in ATM
PDF
Chapter 3 projection
PDF
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...
PDF
Statistics lecture 9 (chapter 8)
Interview Preparation
UT Austin - Portugal Lectures on Portfolio Choice
Fundamental Theorem of Asset Pricing
9. administración y pronóstico de la demanda
Who Should Sell Stocks?
Detecting Drug Effects in the Brain
Parameter Estimation for Semiparametric Models with CMARS and Its Applications
Risk08a
Pr 2-bayesian decision
Slides edf-1
Transactional Data Mining
Final
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
Black scholas theory for venu(RVSKVK)
AP Calculus - Semester Review
Statistics lecture 10(ch10)
Fuzzy logic in ATM
Chapter 3 projection
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...
Statistics lecture 9 (chapter 8)
Ad

Viewers also liked (14)

PPTX
M2 l1 relations functions
PDF
ProjectX-workbook.Jan27
PDF
Clustering - Scheda corso LEN
PDF
Tqm: total quality management - Scheda corso LEN
PPTX
MRC Presentation Community Outreach
PDF
Linea
PDF
MS PowerPoint 2003 - Scheda corso LEN
PDF
3D Visualization Techniques
PDF
Shahid Lecture-12- MKAG1273
PDF
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)
PDF
VNMC Mekong Dam Study: Modelling
PPTX
Agrobacterium mediated transformation
PDF
Mpc 006 - 02-01 product moment coefficient of correlation
M2 l1 relations functions
ProjectX-workbook.Jan27
Clustering - Scheda corso LEN
Tqm: total quality management - Scheda corso LEN
MRC Presentation Community Outreach
Linea
MS PowerPoint 2003 - Scheda corso LEN
3D Visualization Techniques
Shahid Lecture-12- MKAG1273
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)
VNMC Mekong Dam Study: Modelling
Agrobacterium mediated transformation
Mpc 006 - 02-01 product moment coefficient of correlation
Ad

Similar to Session 2 (20)

PPT
Basic statistics
PPT
Correlation of Discrete-Time Signals.ppt
PPT
Randomness conductors
PPTX
Hypergeometric Distribution
PPTX
Hypergeometric Distribution
PDF
Exchange confirm
PDF
Lecture on solving1
PDF
Perceptron Arboles De Decision
PDF
PPTX
Statr sessions 9 to 10
PDF
Ssp notes
DOC
Simplex3
PPT
PPTX
Modeling and interpretation
PPT
Normal distribution stat
PDF
アーベル論理でセッション型
PPTX
Probability, Discrete Probability, Normal Probabilty
PDF
Social Network Analysis
PDF
Mhf4 U Trig
Basic statistics
Correlation of Discrete-Time Signals.ppt
Randomness conductors
Hypergeometric Distribution
Hypergeometric Distribution
Exchange confirm
Lecture on solving1
Perceptron Arboles De Decision
Statr sessions 9 to 10
Ssp notes
Simplex3
Modeling and interpretation
Normal distribution stat
アーベル論理でセッション型
Probability, Discrete Probability, Normal Probabilty
Social Network Analysis
Mhf4 U Trig

More from vivek_shaw (20)

PPTX
Lecture 11 market structure- perfect competition
PPTX
Lecture 9 costs
PPTX
Lecture 8 production, optimal inputs
PPTX
Lecture 8 production, optimal inputs (1)
PPTX
Lecture 3 dds sand elasticity
PPTX
Game theory 3
PPTX
Game theory 1
PPTX
Game theory 2
DOCX
Ford motors
PPTX
Consumertheory1
PPT
Consumer theory 2
PPTX
Class2 market, demand and supply
PPTX
Auctions 1
PPTX
Costs2
PPTX
Policy implications
PPT
Man org session 3-org and technology_5th july 2012
PPT
Man org session 14_org decision making_16th august 2012
PPT
Man org session 12_org learning_3rd august 2012
PPT
Man org session 11 interorganizational relationships_2nd august 2012
PPT
Man org session 10_org control_27th july 2012
Lecture 11 market structure- perfect competition
Lecture 9 costs
Lecture 8 production, optimal inputs
Lecture 8 production, optimal inputs (1)
Lecture 3 dds sand elasticity
Game theory 3
Game theory 1
Game theory 2
Ford motors
Consumertheory1
Consumer theory 2
Class2 market, demand and supply
Auctions 1
Costs2
Policy implications
Man org session 3-org and technology_5th july 2012
Man org session 14_org decision making_16th august 2012
Man org session 12_org learning_3rd august 2012
Man org session 11 interorganizational relationships_2nd august 2012
Man org session 10_org control_27th july 2012

Session 2

  • 1. Measures of Dispersion Variance and the standard deviation • Concept and objective Variance is the average squared deviation from the mean • Range, inter-quartile range (IQR =Q3 – Q1) • Variance and standard deviation (sd) 1 N • Computation σ2 = N ∑(X i =1 i − µ )2 x x x x • Chebyshev’s Inequality X1 Xi µ X2 XN • Relative dispersion -- the coefficient of σ, the standard deviation is the (+ve) square root of the variance variation In the sample variance calculation, use the denominator (n-1) σ C.V. = × 100% µ 1 n S2 = ∑ ( X i − X )2 n − 1 i =1 Chebyshev’s Inequality Chebyshev’s Inequality: • At least (1 - 1/k2) proportion of the data must be within k standard deviation of the mean. Illustration • Here k is a number (not necessarily an integer) greater than 1. • The statement is valid for ANY distribution, discrete or continuous, symmetric or otherwise. • If a r.v. X has a mean µ and a standard deviation σ then a equivalent probability statement is : µ-3σ µ-2σ µ-σ µ µ+σ µ+2σ µ+3σ 1 P [ | X − µ | > kσ ] ≤ At least 75% k2 At least 88.89% Understanding Standard deviation The Coefficient of Variation: 294 MLA of WB has an average wealth of 68 Lakh A measure of relative dispersion and s.d. = 10 Lakh. What does that tell you? σ S or × 100 % In particular, what can you say about % of MLAs µ X having wealth • Unit free • between 58L and 78L ? • Amenable to comparison • Often expressed in terms of percentages • between 53L and 83L ? • Less than 53 L or more than 83L? • between 50L and 1 crore ? • More than 1 crore? 1
  • 2. Box Plot Box Plot Elements of a Box Plot Smallest data Largest data point point not below not exceeding Suspected Outlier inner fence inner fence outlier o X X * o X X * Inner Q1 Median Outer Q3 Inner Outer Fence Fence Fence Fence Q1-1.5(IQR) Interquartile Q3+1.5(IQR) Range Q1-3(IQR) Q3+3(IQR) Review Descriptive Statistics Poll Forecasting – Exit polls • Graphical representations, frequency distribution • Exit Polls in US election 2000 in the critical state of Florida • Measures of central tendency and dispersion – Computation and interpretation • Indian Election 2004, 2009, – Chebychev’s result • UP 2012 • Skewness and Kurtosis • Karnataka 2008 • Outliers – What are they? How to detect? – What to do if there are outliers? A simple example: Overview Some of the questions to be Population = all projects undertaken by a company answered An unknown proportion π of them took longer than scheduled * • To what degree, the randomness in Y can be * Delay * * ** * * attributed to sampling fluctuations? * * * * * • How close is Y = p to π ? * * n ** * * * * * • If we want p to be within ±0.05 of π , how many projects do we need to look at? A random sample of n projects are selected. Y of them are found to be delayed. (sample outcome) Y is random, but the randomness depends on π. Given the value of Y, one can make an objective inference about π. 2
  • 3. Myth and Mystery of Probability Overview of Probability • What is chance of getting any rain today in • Approaches for defining probability the campus? • What is the probability that India will win • Basic Probability rules WC2015? • What is the chance that India’s space • Conditional probability and notion of mission will send a human being to moon independence by 2020? • Bayes’ rule Approaches for defining Probability Laws probability • Classical approach • 0 ≤ P[A] ≤1; • P[impossible event]=0; P[Sure event]=1 • (Asymptotic) Relative frequency approach • P[A or B] = P[A] + P[B] - P[AB] • In particular, P[not A] = 1- P[A] • Subjective probability • Look at the Venn diagram and write down other formulae like • P[A] = P[A and B] + P[A and (not B)] 3