SlideShare a Scribd company logo
6
Most read
7
Most read
10
Most read
MEAN SQUARED ERROR (MSE) OF AN
ESTIMATOR
By
Suruchi Somwanshi
M.Sc. (Mathematics)
M.Sc. (Statistics)
CONTENT
Introduction
Mean Squared Error
Mean Squared Error Consistency
Consistent Estimator
Unbiased Estimator
Tips
INTRODUCTION
Let ෡
𝜃 be the estimator of the unknown parameter θ
from the random sample X1, X2, · · · , Xn.
Then clearly the deviation from ෡
𝜃 to the true value of
θ, | ෡
𝜃 − θ|, measures the quality of the estimator, or
equivalently, we can use { ෡
𝜃 - }2 for the ease of
computation.
Since ෡
𝜃 is a random variable, we should take average
to evaluation the quality of the estimator.
Thus, we introduce the following Definition:
The mean square error (MSE) of an estimator ෡
𝜃 of a
parameter θ is the function of θ defined by E{෡
𝜃 −}2, and this is
denoted as 𝑀𝑆𝐸෡
𝜃 .
This is also called the risk function of an estimator, with
{෡
𝜃 −}2 called the quadratic loss function.
The expectation is with respect to the random variables X1, · ·
· , Xn since they are the only random components in the
expression.
Notice that the MSE measures the average squared
difference between the estimator ෡
𝜃 and the parameter θ, a
somewhat reasonable measure of performance for an
estimator.
 In general, any increasing function of the absolute distance
| ෡
𝜃 − θ| would serve to measure the goodness of an
estimator (mean absolute error, E{ ෡
𝜃 −} 2 ), is a reasonable
alternative.
But MSE has at least two advantages over other distance
measures: First, it is analytically tractable and, secondly, it
has the interpretation
MEAN SQUARED ERROR (MSE)
The Mean Square Error (MSE) of an estimator for estimating  is
𝑀𝑆𝐸𝜃
መ
𝜃 = 𝐸( መ
𝜃 − 𝜃)2
= 𝑉𝑎𝑟 መ
𝜃 + (𝐵𝑖𝑎𝑠𝜃
መ
𝜃 )2
If 𝑀𝑆𝐸𝜃
መ
𝜃 is smaller, መ
𝜃 is a better estimator of 𝜃.
For two estimators, መ
𝜃1 and መ
𝜃2 𝑜𝑓𝜃,
If 𝑀𝑆𝐸𝜃
መ
𝜃1 < 𝑀𝑆𝐸𝜃
መ
𝜃2 , 𝜃 ∈ Ω
መ
𝜃1 is better estimator of 𝜃 than መ
𝜃2.
𝑀𝑆𝐸෡
𝜃 = E{෡
𝜃 −}2
= Var(෡
𝜃 ) + {𝐸(෡
𝜃 )−}2
= Var(෡
𝜃 ) + {𝐵𝑎𝑖𝑠 𝑜𝑓 ෡
𝜃 }2
This is so because
E{෡
𝜃 −}2 = E(෡
𝜃 2 ) + E(2) − 2θE(෡
𝜃 )
= Var(෡
𝜃 ) + {𝐸(෡
𝜃 )}2 + −2 − 2θE(෡
𝜃 )
= Var(෡
𝜃 ) + {𝐸(෡
𝜃 )−}2
MEAN SQUARED ERROR CONSISTENCY
෡
𝜃 is called mean squared error consistent (or consistent in
quadratic mean) if,
E{෡
𝜃 - }2→ 0 as n → .
Theorem: ෡
𝜃 is consistent in MSE iff
i) Var(෡
𝜃 )→ 0 as n → .

 =

→
)
ˆ
(
lim
) E
ii
n
If E{෡
𝜃 −}2→ 0 as n → , ෡
𝜃 is also a CE of .
CONSISTENT ESTIMATOR (CE): An estimator which
converges in probability to an unknown parameter  for all
   is called a CE of .
ˆ .
p
 
⎯⎯
→
For large n, a CE tends to be closer to the unknown
population parameter.
MLEs are generally CEs.
UNBIASED ESTIMATOR (UE):
We know that, the bias of an estimator ෡
𝜃 of a parameter θ is
the difference between the expected value of ෡
𝜃 and θ; that is,
Bias (෡
𝜃 ) = E(෡
𝜃 )−θ.
And an estimator whose bias is identically equal to 0 is called
unbiased estimator and satisfies
E(෡
𝜃 ) = θ for all θ.
For an unbiased estimator ෡
𝜃 , we have
𝑀𝑆𝐸෡
𝜃 = E{෡
𝜃 −}2 = Var(෡
𝜃 ) + {𝐸(෡
𝜃 )−}2 = Var(෡
𝜃 )
and so, if an estimator is unbiased, its MSE is equal to its
variance.
TIPS
MSE has two components,
1. One measures the variability of the estimator (precision)
and
2. Other measures its bias (accuracy).
An estimator that has good MSE properties has small
combined variance and bias.
To find an estimator with good MSE properties, we need to
find estimators that control both variance and bias.
THANK YOU

More Related Content

PDF
Bias and Mean square error
PDF
Estimation theory 1
PPTX
Point Estimation
PPTX
Normal Distribution
PPTX
Repeated Measures ANOVA
PPTX
Negative binomial distribution
PPTX
Binomial and Poission Probablity distribution
Bias and Mean square error
Estimation theory 1
Point Estimation
Normal Distribution
Repeated Measures ANOVA
Negative binomial distribution
Binomial and Poission Probablity distribution

What's hot (20)

PPTX
Sampling Distribution
PPTX
Point estimation
PPTX
What is a single sample t test?
PPTX
Bayes Theorem
PPTX
Regression (Linear Regression and Logistic Regression) by Akanksha Bali
PPTX
Introduction to principal component analysis (pca)
PPTX
Introduction to the t Statistic
PPTX
Spearman rank correlation coefficient
PPTX
Binomial distribution
PPTX
Uniform Distribution
PDF
Statistics lecture 8 (chapter 7)
ODP
Introduction to Principle Component Analysis
PPTX
Normal as Approximation to Binomial
PPT
Continuous Random variable
PPTX
Sampling Distributions
PPTX
Hypothesis testing , T test , chi square test, z test
PPT
Simple linear regression
PDF
Ordinal Logistic Regression
PPTX
Discrete and Continuous Random Variables
PPTX
Binomial probability distribution
Sampling Distribution
Point estimation
What is a single sample t test?
Bayes Theorem
Regression (Linear Regression and Logistic Regression) by Akanksha Bali
Introduction to principal component analysis (pca)
Introduction to the t Statistic
Spearman rank correlation coefficient
Binomial distribution
Uniform Distribution
Statistics lecture 8 (chapter 7)
Introduction to Principle Component Analysis
Normal as Approximation to Binomial
Continuous Random variable
Sampling Distributions
Hypothesis testing , T test , chi square test, z test
Simple linear regression
Ordinal Logistic Regression
Discrete and Continuous Random Variables
Binomial probability distribution
Ad

Similar to Mean Squared Error (MSE) of an Estimator (20)

PDF
Statistical parameters
PDF
Simple Linear Regression
PDF
Econometrics 1 Slide from the masters degree 1
PDF
IUT Probability and Statistics - Chapter 02_Part-1.pdf
PDF
Estimation Theory Class (Summary and Revision)
PDF
Bio-L8- Correlation and Regression Analysis.pdf
PPTX
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and Affine Transfo...
PPT
Statistics08_Cut_Regression.jdnkdjvbjddj
PPTX
State estimation
PPTX
Linear regression, costs & gradient descent
PPTX
Basic of Statistical Inference Part-III: The Theory of Estimation from Dexlab...
PPTX
Simple Regression.pptx
PDF
InnerSoft STATS - Methods and formulas help
PDF
ch02.pdf
PPTX
Tutorial on EM algorithm – Part 1
PDF
Chapter 14 Part I
PDF
A Mathematical Model for the Enhancement of Stress Induced Hypoglycaemia by A...
PDF
A Mathematical Model for the Enhancement of Stress Induced Hypoglycaemia by A...
PDF
Class of Estimators of Population Median Using New Parametric Relationship fo...
Statistical parameters
Simple Linear Regression
Econometrics 1 Slide from the masters degree 1
IUT Probability and Statistics - Chapter 02_Part-1.pdf
Estimation Theory Class (Summary and Revision)
Bio-L8- Correlation and Regression Analysis.pdf
Dmitrii Tihonkih - The Iterative Closest Points Algorithm and Affine Transfo...
Statistics08_Cut_Regression.jdnkdjvbjddj
State estimation
Linear regression, costs & gradient descent
Basic of Statistical Inference Part-III: The Theory of Estimation from Dexlab...
Simple Regression.pptx
InnerSoft STATS - Methods and formulas help
ch02.pdf
Tutorial on EM algorithm – Part 1
Chapter 14 Part I
A Mathematical Model for the Enhancement of Stress Induced Hypoglycaemia by A...
A Mathematical Model for the Enhancement of Stress Induced Hypoglycaemia by A...
Class of Estimators of Population Median Using New Parametric Relationship fo...
Ad

Recently uploaded (20)

PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
The scientific heritage No 166 (166) (2025)
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
INTRODUCTION TO EVS | Concept of sustainability
bbec55_b34400a7914c42429908233dbd381773.pdf
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Derivatives of integument scales, beaks, horns,.pptx
Comparative Structure of Integument in Vertebrates.pptx
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
neck nodes and dissection types and lymph nodes levels
The scientific heritage No 166 (166) (2025)
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Placing the Near-Earth Object Impact Probability in Context
HPLC-PPT.docx high performance liquid chromatography
Cell Membrane: Structure, Composition & Functions
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...

Mean Squared Error (MSE) of an Estimator

  • 1. MEAN SQUARED ERROR (MSE) OF AN ESTIMATOR By Suruchi Somwanshi M.Sc. (Mathematics) M.Sc. (Statistics)
  • 2. CONTENT Introduction Mean Squared Error Mean Squared Error Consistency Consistent Estimator Unbiased Estimator Tips
  • 3. INTRODUCTION Let ෡ 𝜃 be the estimator of the unknown parameter θ from the random sample X1, X2, · · · , Xn. Then clearly the deviation from ෡ 𝜃 to the true value of θ, | ෡ 𝜃 − θ|, measures the quality of the estimator, or equivalently, we can use { ෡ 𝜃 - }2 for the ease of computation. Since ෡ 𝜃 is a random variable, we should take average to evaluation the quality of the estimator.
  • 4. Thus, we introduce the following Definition: The mean square error (MSE) of an estimator ෡ 𝜃 of a parameter θ is the function of θ defined by E{෡ 𝜃 −}2, and this is denoted as 𝑀𝑆𝐸෡ 𝜃 . This is also called the risk function of an estimator, with {෡ 𝜃 −}2 called the quadratic loss function. The expectation is with respect to the random variables X1, · · · , Xn since they are the only random components in the expression.
  • 5. Notice that the MSE measures the average squared difference between the estimator ෡ 𝜃 and the parameter θ, a somewhat reasonable measure of performance for an estimator.  In general, any increasing function of the absolute distance | ෡ 𝜃 − θ| would serve to measure the goodness of an estimator (mean absolute error, E{ ෡ 𝜃 −} 2 ), is a reasonable alternative. But MSE has at least two advantages over other distance measures: First, it is analytically tractable and, secondly, it has the interpretation
  • 6. MEAN SQUARED ERROR (MSE) The Mean Square Error (MSE) of an estimator for estimating  is 𝑀𝑆𝐸𝜃 መ 𝜃 = 𝐸( መ 𝜃 − 𝜃)2 = 𝑉𝑎𝑟 መ 𝜃 + (𝐵𝑖𝑎𝑠𝜃 መ 𝜃 )2 If 𝑀𝑆𝐸𝜃 መ 𝜃 is smaller, መ 𝜃 is a better estimator of 𝜃. For two estimators, መ 𝜃1 and መ 𝜃2 𝑜𝑓𝜃, If 𝑀𝑆𝐸𝜃 መ 𝜃1 < 𝑀𝑆𝐸𝜃 መ 𝜃2 , 𝜃 ∈ Ω መ 𝜃1 is better estimator of 𝜃 than መ 𝜃2.
  • 7. 𝑀𝑆𝐸෡ 𝜃 = E{෡ 𝜃 −}2 = Var(෡ 𝜃 ) + {𝐸(෡ 𝜃 )−}2 = Var(෡ 𝜃 ) + {𝐵𝑎𝑖𝑠 𝑜𝑓 ෡ 𝜃 }2 This is so because E{෡ 𝜃 −}2 = E(෡ 𝜃 2 ) + E(2) − 2θE(෡ 𝜃 ) = Var(෡ 𝜃 ) + {𝐸(෡ 𝜃 )}2 + −2 − 2θE(෡ 𝜃 ) = Var(෡ 𝜃 ) + {𝐸(෡ 𝜃 )−}2
  • 8. MEAN SQUARED ERROR CONSISTENCY ෡ 𝜃 is called mean squared error consistent (or consistent in quadratic mean) if, E{෡ 𝜃 - }2→ 0 as n → . Theorem: ෡ 𝜃 is consistent in MSE iff i) Var(෡ 𝜃 )→ 0 as n → .   =  → ) ˆ ( lim ) E ii n If E{෡ 𝜃 −}2→ 0 as n → , ෡ 𝜃 is also a CE of .
  • 9. CONSISTENT ESTIMATOR (CE): An estimator which converges in probability to an unknown parameter  for all    is called a CE of . ˆ . p   ⎯⎯ → For large n, a CE tends to be closer to the unknown population parameter. MLEs are generally CEs.
  • 10. UNBIASED ESTIMATOR (UE): We know that, the bias of an estimator ෡ 𝜃 of a parameter θ is the difference between the expected value of ෡ 𝜃 and θ; that is, Bias (෡ 𝜃 ) = E(෡ 𝜃 )−θ. And an estimator whose bias is identically equal to 0 is called unbiased estimator and satisfies E(෡ 𝜃 ) = θ for all θ. For an unbiased estimator ෡ 𝜃 , we have 𝑀𝑆𝐸෡ 𝜃 = E{෡ 𝜃 −}2 = Var(෡ 𝜃 ) + {𝐸(෡ 𝜃 )−}2 = Var(෡ 𝜃 ) and so, if an estimator is unbiased, its MSE is equal to its variance.
  • 11. TIPS MSE has two components, 1. One measures the variability of the estimator (precision) and 2. Other measures its bias (accuracy). An estimator that has good MSE properties has small combined variance and bias. To find an estimator with good MSE properties, we need to find estimators that control both variance and bias.