SlideShare a Scribd company logo
Estimation S Q U A N T T E C H I N T E U Q I A S E V I T 1 0 S
Why Estimation ? [ From ] Inference For a given population Various statistical parameters are GIVEN or KNOWN Mean, Standard Deviation etc Task was to interpret them and take managerial decisions How many shirts to be stocked in the store ? Is the machine setup faulty ? Should we fix it [ To ] Estimation For the given population or sample Various statistical parameters are NOT KNOWN So managerial decisions cannot be taken UNLESS we can estimate the parameters
Two kinds of Estimates Point Estimate A single number that is used to estimate a given population parameter Mean Age = 22.3 Interval Estimate A range of values used to estimate a population parameter Mean Age is between 21.5 and 23 Difficulty of point estimate It is either right or wrong ! No way to know the quantum of error in the estimate Needs to accompanied by another estimate of the error that could have happened !!
Estimator Estimator A sample statistic that is used to estimate a population parameter Estimate A specific value of the statistic that is observed
Criteria for a Good Estimator Unbiased Example : Mean of  the sampling distribution of sample means   taken from the population   is equal to the  population mean  itself. Efficient Depends on the standard error of the statistic Standard error = standard deviation of the sampling distribution If standard error is low, estimator is efficient Consistent When sample size increases the value of the statistic comes closer and closer to the value of the parameter Sufficient Uses all  information that can be extracted from sample
Point Estimate Estimate of Mean Sample Mean S  x x =  n Estimate of Standard Deviation S  (x – x) 2 s 2  =  n – 1 We cannot use the lower statistic because S  (x – x) 2 s 2  =  n because it can be shown that it has a bias !
Where are the errors here ? Potential number of patrons at a very popular musical concert that is always sold out .. Estimator : Average number of tickets sold Telephone calls are billed by whole minutes even if the duration is a fraction. What is the average length of a call ? Estimator : Average billing for all calls made over a day / Rate per minute
Interval Estimate An interval estimate describes a range of values within which a population parameter is likely to lie Consider an interval estimate for the mean Start with a point estimate Find the likely error of this estimate Standard error is standard deviation of the estimator Make an interval estimate Defined in terms of the estimate and the standard error Find the probability that mean will fall in this interval estimate
Example Estimate the average battery life of a car in months From a sample size  of 200 we get x = 36 Standard error of sample mean  s s x   =  = 0.707  assuming   s  = 10 Now we can make an interval estimates like x –  s x   <   m   <  x +  s x   =>  35.293 <  m  < 36.707 x  – 2 s x  <  m   <  x  + 2 s x  =>  34.586 <  m  < 37.414 x  – 3 s x  <  m   <  x  + 3 s x  =>  33.879 <  m  < 38.121
Back to Probability Sampling distribution of the mean is also  normal  with Mean = 36.0 Standard Deviation ( Standard Error) = 0.707 So probability of the real mean lying between the limits given by the interval estimate is known ! 68.3% 95.5% 99.7%
Interval Estimate of Mean Probabilities are as follows  68.3% =>  35.293 <  m  < 36.707 95. 5 % =>  34.586 <  m  < 37.414 99.7% =>  33.879 <  m  < 38.121 Here we note that the probabilities are odd, fractional kind of numbers ... So how can we have simpler probabilities like 50% or 90% probability ?
Confidence Interval We observe that in a  normal  distribution 90% of the values lie within 1.64 s  of mean 99% of the values lie within 2.58 s  of mean So we redefine our interval estimates as 90% =>  34.84 <  m  < 37.16 99% =>  34.18 <  m  < 37.82
Confidence Intervals Original Limits  68.3% =>  35.293 <  m  < 36.707 95. 5 % =>  34.586 <  m  < 37.414 99.7% =>  33.879 <  m  < 38.121 More convenient limits 90% =>  34.84 <  m  < 37.16 99% =>  34.18 <  m  < 37.82 Limits 1 s 2 s 3 s 1.64 s 2.58 s
Is a “higher” confidence interval always better ? I am 99.999% sure that the average age of this class lies between 1 year and 50 years Does this really help you in anyway ? I am 95% sure that the average age of this class lies between 23 and 26 years This gives me a far better idea of where the average age of the class lies This information is better than the first information
95% confident that the mean battery life lies between 30 – 42 months Does NOT mean that  There is 95% probability that the mean life of all our batteries falls within the interval established from this one sample It DOES mean that  If we select many random samples of the same size and calculate a confidence interval for each of these samples then 95% of these intervals will contain the population mean
Calculation of Confidence Interval Example A large automotive parts wholesalers needs an  estimate  of the  mean life  that he can expect from a windshield wiper under normal driving conditions It is  known  that the  standard deviation  of the population life is 6  months Observations from 1 simple random sample of 100 blades is as follows  Sample Size n = 100 Sample Mean x = 21 months Population standard deviation  s  = 6 months
95% Confidence Interval Standard Error s s x   =  6 =  = 0.6 months 95% confidence level will include 47.5 % on each side of the mean Sample size is > 30 so we can assume that the sample mean follows a  normal distribution In a normal distribution 95% values lie within 1.96 times the standard deviation  95 % values of the sample mean lie within 1.96 times the standard error
Upper Confidence Limit Lower Confidence Limit Upper Confidence Limit x + 1.96  s x = 21 + 1.96 ( 0.6) = 22.18 months Lower Confidence Limit x – 1.96  s x = 21 – 1.96 ( 0.6) = 19.82 months Two major assumptions Standard Deviation of Population is known In reality this is may not be known The sampling distribution follows the normal distribution This assumption is valid only if the sample size is more than 30.
Standard deviation is not known  When Standard Deviation Known Standard Error of the Sample mean s s x   =  When Standard Deviation is  not  Known Standard Error of the sample mean  s s x   =  S  (x - x) 2 s  = n - 1 ^ ^ ^
How do we get this interval [Usually] we are trying to estimate the population mean  m We have an estimator E which [ in most cases ] is the sample mean. E follows a distribution that has mean  m  and standard error  s We create a statistic Q =  (E –  m )/ s Q follows a some distribution ( normal ? T ? ) We identify two values Q 1 , Q 2  such that probability of Q falling between Q 1  and Q 2  is equal to required confidence P Interval is E – Q 1 s  <  m  < E + Q 2 s
What is our goal ? What is known ? E,  s , P What is to be calculated Q 1 , Q 2 What is the objective To be confident that  Probability of  m   Lying between E – Q 1 s  and E + Q 2 s Is equal to P
What are the steps Identify an estimator What distribution does the estimator follow ? Is the standard deviation known ?  If not what is the estimator for the standard deviation Is the sample size big enough ? Get a value of the estimate Get a value of the standard error for the estimator Set an appropriate confidence level in terms of probability From the graph / table of the sampling distribution get the upper and lower limits in terms of estimate and the standard error
Confidence Intervals Revisited Original Limits  68.3% =>  35.293 <  m  < 36.707 95. 5 % =>  34.586 <  m  < 37.414 99.7% =>  33.879 <  m  < 38.121 More convenient limits 90% =>  34.84 <  m  < 37.16 99% =>  34.18 <  m  < 37.82 Point to note ... m  and  s  come from the estimate How do we connect  68.3%, 90%, 95.5%, 99%, 99.7% 1, 1.64, 2, 2.58, 3 Limits 1 s 2 s 3 s 1.64 s 2.58 s By looking at the  probability distribution function   of the estimator
Sample Size in Estimation What should be the sample size such that with a known population standard deviation, the sample size should be adequate to ensure an adequate confidence interval ?
Which distribution does the estimator follow ? So far ... and usually .. we assume that the estimator follows the normal distribution That is how we get 68.3% => 1.0 s 90.0% => 1.64 s 95.5% => 2.0 s 99.0% => 2.58 s 99.7% => 3.00 s 68.3% 95.5% 99.7%
The Student's t distribution Used when Standard deviation of the population is NOT known AND Sample size is less than 30 When this happens we cannot use the Normal distribution but must look up the tables for the T distribution
t-distribution instead of normal =NORMSINV( D5 +0.5) =TINV( $F5 ; G$2 )
Usage of t-table The probability that we are working with  IS NOT the probability that the estimated value will fall inside the confidence interval  INSTEAD It is the probability that the estimated value will fall OUTSIDE the confidence interval This probability is defined as  a Confidence = 1 –  a Degree of Freedom 1 – sample size
Binomial Distribution / Proportions We have a binomial distribution with p as the success probability 20% student  population  are engineers 45% of employees  population  are married We need to have an estimate of p Estimator is proportion  p  from  sample  Assumptions  Estimator follows normal distribution m  = np Standard error of estimate
Example Sample = 75 Fraction graduate p = 0.4 Fraction not graduate q = 0.6 Estimate of p = 0.4 Standard Error for Estimator = 0.057 99% confidence interval Z = 2.58 LCL  0.4 – 0.057 * 2.58 = 0.253 UCL  0.4 + 0.057 * 2.58 = 0.547

More Related Content

PPTX
CONCEPTS OF COST
PPTX
PPT
Law of demand
PPTX
Revenue concept
PPTX
Business cycle Managerial Economics
PPT
Producer Surplus basics
PPTX
Law of supply
PPT
Macroeconomic Models
CONCEPTS OF COST
Law of demand
Revenue concept
Business cycle Managerial Economics
Producer Surplus basics
Law of supply
Macroeconomic Models

What's hot (20)

PPT
Least cost combination
PPTX
Money market & its instruments
PPTX
Producer equilibrium
PPS
National income concepts
PPTX
Types of risks
PPTX
Market efficiency and emh
PDF
chapter 1. The Fundamentals of Managerial Economics
PPT
Gross National Product
PPTX
Isoquants and Isocosts.pptx
PDF
Economies of scale
PPTX
Producers equilibrium
PPTX
Microeconomics: Production and Cost
PPT
Commodity ppt
PPTX
demand curve
PPT
Cost Concepts
 
PPTX
Trading system in stock exchange
DOC
Derivatives & risk management
PPT
Monopoly
PPTX
Perfect competition
Least cost combination
Money market & its instruments
Producer equilibrium
National income concepts
Types of risks
Market efficiency and emh
chapter 1. The Fundamentals of Managerial Economics
Gross National Product
Isoquants and Isocosts.pptx
Economies of scale
Producers equilibrium
Microeconomics: Production and Cost
Commodity ppt
demand curve
Cost Concepts
 
Trading system in stock exchange
Derivatives & risk management
Monopoly
Perfect competition
Ad

Viewers also liked (20)

PPTX
Bitcoin, Blockchain and the Crypto Contracts - Part 2
PPTX
TRANSMISSION PLANNING CRITERIA
PPT
Estimation
PPTX
Introduction to z-Scores
PPTX
The t Test for Two Related Samples
PPTX
Z-scores: Location of Scores and Standardized Distributions
PPTX
Probability
PPTX
Probability and Samples: The Distribution of Sample Means
PPTX
Repeated-Measures and Two-Factor Analysis of Variance
PPTX
The t Test for Two Independent Samples
PPTX
Introduction to Analysis of Variance
PPTX
Introduction to Hypothesis Testing
PPTX
Measures of Central Tendency
PPTX
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
PPTX
Introduction to Statistics
PPTX
Introduction to the t Statistic
PPTX
Frequency Distributions
PPTX
Correlation and Regression
PPTX
Measures of Variability
PPTX
Rural finance
Bitcoin, Blockchain and the Crypto Contracts - Part 2
TRANSMISSION PLANNING CRITERIA
Estimation
Introduction to z-Scores
The t Test for Two Related Samples
Z-scores: Location of Scores and Standardized Distributions
Probability
Probability and Samples: The Distribution of Sample Means
Repeated-Measures and Two-Factor Analysis of Variance
The t Test for Two Independent Samples
Introduction to Analysis of Variance
Introduction to Hypothesis Testing
Measures of Central Tendency
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
Introduction to Statistics
Introduction to the t Statistic
Frequency Distributions
Correlation and Regression
Measures of Variability
Rural finance
Ad

Similar to QT1 - 07 - Estimation (20)

PPTX
M1-4 Estimasi Titik dan Intervaltttt.pptx
DOCX
Module 7 Interval estimatorsMaster for Business Statistics.docx
PDF
Ch3_Statistical Analysis and Random Error Estimation.pdf
PPTX
STATISTICAL-INTERVALS 2202 Math 403.pptx
PPTX
Estimation and confidence interval
PPTX
Point and Interval Estimation
PPT
Ch. 2 (B) Theory of Estimation ABOUT MARKETING
PDF
Estimation and hypothesis testing (2).pdf
PPTX
CH 06; ESTIMATION OF Sample and Population Mean
PPTX
Data Analysis - Confirmatory Data Analysis.pptx
PPTX
Intro to data science
PPTX
Introduction To Data Science Using R
PPT
Chapter 1(5)Measurement and Error
PDF
Lec 5 statistical intervals
PPTX
statistics chapter 4 PowerPoint for accounting studens.ppt
PPT
ERM-4b-finalERM-4b-finaERM-4b-finaERM-4b-fina.ppt
PPTX
Measures of Variation
PDF
Estimasi Parameter (Teori Estimasi)
PPTX
Estimating-parameters.pptxhajjanakkakwknss
PDF
Taxi for Professor Evans
M1-4 Estimasi Titik dan Intervaltttt.pptx
Module 7 Interval estimatorsMaster for Business Statistics.docx
Ch3_Statistical Analysis and Random Error Estimation.pdf
STATISTICAL-INTERVALS 2202 Math 403.pptx
Estimation and confidence interval
Point and Interval Estimation
Ch. 2 (B) Theory of Estimation ABOUT MARKETING
Estimation and hypothesis testing (2).pdf
CH 06; ESTIMATION OF Sample and Population Mean
Data Analysis - Confirmatory Data Analysis.pptx
Intro to data science
Introduction To Data Science Using R
Chapter 1(5)Measurement and Error
Lec 5 statistical intervals
statistics chapter 4 PowerPoint for accounting studens.ppt
ERM-4b-finalERM-4b-finaERM-4b-finaERM-4b-fina.ppt
Measures of Variation
Estimasi Parameter (Teori Estimasi)
Estimating-parameters.pptxhajjanakkakwknss
Taxi for Professor Evans

More from Prithwis Mukerjee (20)

PDF
Bitcoin, Blockchain and Crypto Contracts - Part 3
PDF
Internet of Things
PDF
Thought controlled devices
PDF
Cloudcasting
PDF
Currency, Commodity and Bitcoins
PDF
Data Science
PPT
05 OLAP v6 weekend
ODP
04 Dimensional Analysis - v6
PDF
Thought control
PPT
World of data @ praxis 2013 v2
ODP
BIS 08a - Application Development - II Version 2
PPT
Lecture02 - Data Mining & Analytics
ODP
ইন্টার্নেট কি এবং কেন ?
PPT
Data mining clustering-2009-v0
PPT
Data mining classification-2009-v0
PPT
Data mining arm-2009-v0
PPT
Data mining intro-2009-v2
PPT
PPT
Business Intelligence Industry Perspective Session I
PPT
OLAP Cubes in Datawarehousing
Bitcoin, Blockchain and Crypto Contracts - Part 3
Internet of Things
Thought controlled devices
Cloudcasting
Currency, Commodity and Bitcoins
Data Science
05 OLAP v6 weekend
04 Dimensional Analysis - v6
Thought control
World of data @ praxis 2013 v2
BIS 08a - Application Development - II Version 2
Lecture02 - Data Mining & Analytics
ইন্টার্নেট কি এবং কেন ?
Data mining clustering-2009-v0
Data mining classification-2009-v0
Data mining arm-2009-v0
Data mining intro-2009-v2
Business Intelligence Industry Perspective Session I
OLAP Cubes in Datawarehousing

Recently uploaded (20)

PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Cell Structure & Organelles in detailed.
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Institutional Correction lecture only . . .
PPTX
Pharma ospi slides which help in ospi learning
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Business Ethics Teaching Materials for college
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Cell Structure & Organelles in detailed.
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Microbial diseases, their pathogenesis and prophylaxis
Microbial disease of the cardiovascular and lymphatic systems
Renaissance Architecture: A Journey from Faith to Humanism
Basic Mud Logging Guide for educational purpose
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Abdominal Access Techniques with Prof. Dr. R K Mishra
Institutional Correction lecture only . . .
Pharma ospi slides which help in ospi learning
PPH.pptx obstetrics and gynecology in nursing
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Business Ethics Teaching Materials for college
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf

QT1 - 07 - Estimation

  • 1. Estimation S Q U A N T T E C H I N T E U Q I A S E V I T 1 0 S
  • 2. Why Estimation ? [ From ] Inference For a given population Various statistical parameters are GIVEN or KNOWN Mean, Standard Deviation etc Task was to interpret them and take managerial decisions How many shirts to be stocked in the store ? Is the machine setup faulty ? Should we fix it [ To ] Estimation For the given population or sample Various statistical parameters are NOT KNOWN So managerial decisions cannot be taken UNLESS we can estimate the parameters
  • 3. Two kinds of Estimates Point Estimate A single number that is used to estimate a given population parameter Mean Age = 22.3 Interval Estimate A range of values used to estimate a population parameter Mean Age is between 21.5 and 23 Difficulty of point estimate It is either right or wrong ! No way to know the quantum of error in the estimate Needs to accompanied by another estimate of the error that could have happened !!
  • 4. Estimator Estimator A sample statistic that is used to estimate a population parameter Estimate A specific value of the statistic that is observed
  • 5. Criteria for a Good Estimator Unbiased Example : Mean of the sampling distribution of sample means taken from the population is equal to the population mean itself. Efficient Depends on the standard error of the statistic Standard error = standard deviation of the sampling distribution If standard error is low, estimator is efficient Consistent When sample size increases the value of the statistic comes closer and closer to the value of the parameter Sufficient Uses all information that can be extracted from sample
  • 6. Point Estimate Estimate of Mean Sample Mean S x x = n Estimate of Standard Deviation S (x – x) 2 s 2 = n – 1 We cannot use the lower statistic because S (x – x) 2 s 2 = n because it can be shown that it has a bias !
  • 7. Where are the errors here ? Potential number of patrons at a very popular musical concert that is always sold out .. Estimator : Average number of tickets sold Telephone calls are billed by whole minutes even if the duration is a fraction. What is the average length of a call ? Estimator : Average billing for all calls made over a day / Rate per minute
  • 8. Interval Estimate An interval estimate describes a range of values within which a population parameter is likely to lie Consider an interval estimate for the mean Start with a point estimate Find the likely error of this estimate Standard error is standard deviation of the estimator Make an interval estimate Defined in terms of the estimate and the standard error Find the probability that mean will fall in this interval estimate
  • 9. Example Estimate the average battery life of a car in months From a sample size of 200 we get x = 36 Standard error of sample mean s s x = = 0.707 assuming s = 10 Now we can make an interval estimates like x – s x < m < x + s x => 35.293 < m < 36.707 x – 2 s x < m < x + 2 s x => 34.586 < m < 37.414 x – 3 s x < m < x + 3 s x => 33.879 < m < 38.121
  • 10. Back to Probability Sampling distribution of the mean is also normal with Mean = 36.0 Standard Deviation ( Standard Error) = 0.707 So probability of the real mean lying between the limits given by the interval estimate is known ! 68.3% 95.5% 99.7%
  • 11. Interval Estimate of Mean Probabilities are as follows 68.3% => 35.293 < m < 36.707 95. 5 % => 34.586 < m < 37.414 99.7% => 33.879 < m < 38.121 Here we note that the probabilities are odd, fractional kind of numbers ... So how can we have simpler probabilities like 50% or 90% probability ?
  • 12. Confidence Interval We observe that in a normal distribution 90% of the values lie within 1.64 s of mean 99% of the values lie within 2.58 s of mean So we redefine our interval estimates as 90% => 34.84 < m < 37.16 99% => 34.18 < m < 37.82
  • 13. Confidence Intervals Original Limits 68.3% => 35.293 < m < 36.707 95. 5 % => 34.586 < m < 37.414 99.7% => 33.879 < m < 38.121 More convenient limits 90% => 34.84 < m < 37.16 99% => 34.18 < m < 37.82 Limits 1 s 2 s 3 s 1.64 s 2.58 s
  • 14. Is a “higher” confidence interval always better ? I am 99.999% sure that the average age of this class lies between 1 year and 50 years Does this really help you in anyway ? I am 95% sure that the average age of this class lies between 23 and 26 years This gives me a far better idea of where the average age of the class lies This information is better than the first information
  • 15. 95% confident that the mean battery life lies between 30 – 42 months Does NOT mean that There is 95% probability that the mean life of all our batteries falls within the interval established from this one sample It DOES mean that If we select many random samples of the same size and calculate a confidence interval for each of these samples then 95% of these intervals will contain the population mean
  • 16. Calculation of Confidence Interval Example A large automotive parts wholesalers needs an estimate of the mean life that he can expect from a windshield wiper under normal driving conditions It is known that the standard deviation of the population life is 6 months Observations from 1 simple random sample of 100 blades is as follows Sample Size n = 100 Sample Mean x = 21 months Population standard deviation s = 6 months
  • 17. 95% Confidence Interval Standard Error s s x = 6 = = 0.6 months 95% confidence level will include 47.5 % on each side of the mean Sample size is > 30 so we can assume that the sample mean follows a normal distribution In a normal distribution 95% values lie within 1.96 times the standard deviation 95 % values of the sample mean lie within 1.96 times the standard error
  • 18. Upper Confidence Limit Lower Confidence Limit Upper Confidence Limit x + 1.96 s x = 21 + 1.96 ( 0.6) = 22.18 months Lower Confidence Limit x – 1.96 s x = 21 – 1.96 ( 0.6) = 19.82 months Two major assumptions Standard Deviation of Population is known In reality this is may not be known The sampling distribution follows the normal distribution This assumption is valid only if the sample size is more than 30.
  • 19. Standard deviation is not known When Standard Deviation Known Standard Error of the Sample mean s s x = When Standard Deviation is not Known Standard Error of the sample mean s s x = S (x - x) 2 s = n - 1 ^ ^ ^
  • 20. How do we get this interval [Usually] we are trying to estimate the population mean m We have an estimator E which [ in most cases ] is the sample mean. E follows a distribution that has mean m and standard error s We create a statistic Q = (E – m )/ s Q follows a some distribution ( normal ? T ? ) We identify two values Q 1 , Q 2 such that probability of Q falling between Q 1 and Q 2 is equal to required confidence P Interval is E – Q 1 s < m < E + Q 2 s
  • 21. What is our goal ? What is known ? E, s , P What is to be calculated Q 1 , Q 2 What is the objective To be confident that Probability of m Lying between E – Q 1 s and E + Q 2 s Is equal to P
  • 22. What are the steps Identify an estimator What distribution does the estimator follow ? Is the standard deviation known ? If not what is the estimator for the standard deviation Is the sample size big enough ? Get a value of the estimate Get a value of the standard error for the estimator Set an appropriate confidence level in terms of probability From the graph / table of the sampling distribution get the upper and lower limits in terms of estimate and the standard error
  • 23. Confidence Intervals Revisited Original Limits 68.3% => 35.293 < m < 36.707 95. 5 % => 34.586 < m < 37.414 99.7% => 33.879 < m < 38.121 More convenient limits 90% => 34.84 < m < 37.16 99% => 34.18 < m < 37.82 Point to note ... m and s come from the estimate How do we connect 68.3%, 90%, 95.5%, 99%, 99.7% 1, 1.64, 2, 2.58, 3 Limits 1 s 2 s 3 s 1.64 s 2.58 s By looking at the probability distribution function of the estimator
  • 24. Sample Size in Estimation What should be the sample size such that with a known population standard deviation, the sample size should be adequate to ensure an adequate confidence interval ?
  • 25. Which distribution does the estimator follow ? So far ... and usually .. we assume that the estimator follows the normal distribution That is how we get 68.3% => 1.0 s 90.0% => 1.64 s 95.5% => 2.0 s 99.0% => 2.58 s 99.7% => 3.00 s 68.3% 95.5% 99.7%
  • 26. The Student's t distribution Used when Standard deviation of the population is NOT known AND Sample size is less than 30 When this happens we cannot use the Normal distribution but must look up the tables for the T distribution
  • 27. t-distribution instead of normal =NORMSINV( D5 +0.5) =TINV( $F5 ; G$2 )
  • 28. Usage of t-table The probability that we are working with IS NOT the probability that the estimated value will fall inside the confidence interval INSTEAD It is the probability that the estimated value will fall OUTSIDE the confidence interval This probability is defined as a Confidence = 1 – a Degree of Freedom 1 – sample size
  • 29. Binomial Distribution / Proportions We have a binomial distribution with p as the success probability 20% student population are engineers 45% of employees population are married We need to have an estimate of p Estimator is proportion p from sample Assumptions Estimator follows normal distribution m = np Standard error of estimate
  • 30. Example Sample = 75 Fraction graduate p = 0.4 Fraction not graduate q = 0.6 Estimate of p = 0.4 Standard Error for Estimator = 0.057 99% confidence interval Z = 2.58 LCL 0.4 – 0.057 * 2.58 = 0.253 UCL 0.4 + 0.057 * 2.58 = 0.547