SlideShare a Scribd company logo
2
Most read
3
Most read
11
Most read
Alessandro Ortis
Università degli Studi di Catania
Dipartimento di Matematica e Informatica
Image Processing Lab - iplab.dmi.unict.it
Sufficient statistics
Sufficient statistics
Parameter estimation: given a sample X = (x1, x2, … xn)
from a population with pdf P(X|θ), we try to infer θ
from some information represented by X.
A. Ortis – Sufficient statistics
Sufficient statistics
Could be useful finding a reduced representation of X
by means a function F(X)?
Ex:
X T(X)= mean(X)
4 5 6 5
5 5 5 5
3 5 7 5
A. Ortis – Sufficient statistics
Sufficient statistics
[ 4, 5, 6]
T(X) = 5 [ 5, 5, 5]
[ 3, 5, 7]
...
Is there any loss of information ? Have we lost useful
data or the representation given by T(X) is enought to
infer the same information about θ conteined in X ?
A. Ortis – Sufficient statistics
Sufficient statistics
[ 4, 5, 6]
T(X) = 5 [ 5, 5, 5]
[ 3, 5, 7]
….
Is it sufficient to consider only the reduced data T(X)?
A. Ortis – Sufficient statistics
Sufficient statistics
Def .
A statistic T(X) is sufficient for θ if P(X|T(X)) is
not a function of θ.
A. Ortis – Sufficient statistics
Sufficient statistics
Example: Let (x1, x2, … xn) be a random sample of n Bernoulli(p)
trials
x =
1 𝑤𝑖𝑡ℎ 𝑝𝑟𝑜𝑏. 𝑝
0 𝑤𝑖𝑡ℎ 𝑝𝑟𝑜𝑏. 1 − 𝑝
Can we find a sufficient statistic for p?
Considering the definition of sufficiency, can we find a function
T(X) such that P(X|T(X)) is independent from p?
(solution in the next slide...)
A. Ortis – Sufficient statistics
Sufficient statistics
This conditional distribution does not depend on p!
Once the value of T(X) is known, no other functon of X
will provide any additiona information about p.
If T(X) = 𝑋𝑖 = t
we have P(X | T(X)) =
1
𝑛
𝑡
A. Ortis – Sufficient statistics
Sufficient statistics
A sufficient statistic T(X) reduces X in two senses:
1) We can reduce the dimensionality of data
2) The possible values assumed by T(X) are fewer
A. Ortis – Sufficient statistics
Sufficient statistics
A statistic T(X) induces a partition on the sample space.
Given a value t, we can define the subset
𝐴 𝑡 = 𝑋: 𝑇 𝑋 = 𝑡
A. Ortis – Sufficient statistics
Sufficient statistics
Bernoulli population with n=3, the sample space of X is
0,0,0 ; 0,0,1 ; 0,1,0 ; 0,1,1 ; 1,0,0 ;
1,0,1 ; 1,1,0 ; 1,1,1
A. Ortis – Sufficient statistics
Sufficient statistics
0,0,0 ; 0,0,1 ; 0,1,0 ; 0,1,1 ; 1,0,0 ;
1,0,1 ; 1,1,0 ; 1,1,1
t Induced subset
0 { 0,0,0 }
1 { 0,0,1 ; 0,1,0 ; 1,0,0 }
2 { 0,1,1 ; 1,1,0 ; 1,0,1 }
3 { 1,1,1 }
A. Ortis – Sufficient statistics
Sufficient statistics
Theorem:
T(X) is a sufficient statistic for θ sif the likelihood
factorizes into the following form
L(x1, x2, … xn | θ ) = g( θ, T(x1, x2, … xn))·h(x1, x2, … xn)
A. Ortis – Sufficient statistics
Sufficient statistics
Theorem:
T(X) is a sufficient statistic for θ sif the likelihood
factorizes into the following form
L(x1, x2, … xn | θ ) = g( θ, T(x1, x2, … xn))·h(x1, x2, … xn)
θ and X interact only via T(X)
A. Ortis – Sufficient statistics
Sufficient statistics
Def.
T is a minimal sufficient statistic if the following statements
are true:
1. T is sufficient
2. If S is any other sufficient statistic then T = g(U) for some
function g
A. Ortis – Sufficient statistics
Sufficient statistics
In other words, T generates the coarsest sufficient partition.
A minimal sufficient statistic is the smallest sufficient
statistic and therefore it represents the ultimate data
reduction with respect to estimating θ . In general, it may or
may not exists.
A. Ortis – Sufficient statistics
Sufficient statistics
Theorem:
T(X) is a minimal sufficient statistics if
P(𝑥1, 𝑥2, … 𝑥 𝑛 | 𝜃)
P(𝑦1, 𝑦2, … 𝑦𝑛 | 𝜃)
𝑖𝑠 𝑛𝑜𝑡 𝑎 𝑓𝑢𝑛𝑐𝑡𝑖𝑜𝑛 𝑜𝑓 𝜃
𝑇(𝑥1, 𝑥2, … 𝑥 𝑛) = 𝑇(𝑦1, 𝑦2, … 𝑦𝑛)
A. Ortis – Sufficient statistics
Sufficient statistics
• T(X) may not exist
• If so, is not unique
• Any 1-1 function of a sufficient statistic which does
not depends on 𝜃 is also a sufficient statistic
• All we considered so far on sufficiency can easily be
extended to accommodate two (or more)
parameters.
A. Ortis – Sufficient statistics
Sufficient statistics
Example: let (x1, x2, … xn) be N(μ,σ2) observations.
Let 𝜃1 = μ e 𝜃2 = σ2 we have that
T(X) = ( 𝑋𝑖, 𝑋𝑖
2
)
T(X) = ( 𝑋, 𝑆2)
Are both minimal sufficient statistics for N(μ,σ2)
A. Ortis – Sufficient statistics

More Related Content

PPT
Law of large numbers
PPT
Linear regression
PPTX
Theory of estimation
PPTX
Probability Density Function (PDF)
PPTX
Uniform Distribution
PPTX
Fuzzy Sets Introduction With Example
PPT
Chap07 interval estimation
PPTX
Introduction to Maximum Likelihood Estimator
Law of large numbers
Linear regression
Theory of estimation
Probability Density Function (PDF)
Uniform Distribution
Fuzzy Sets Introduction With Example
Chap07 interval estimation
Introduction to Maximum Likelihood Estimator

What's hot (20)

PPTX
Curve fitting
PPTX
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
PPTX
Probability distributions
PPTX
Bernoulli distribution
PPTX
Bays theorem of probability
PPT
Simplex method
PPTX
Binomial and Poission Probablity distribution
PPTX
A power point presentation on statistics
PPT
Regression
PDF
Linear regression theory
PPT
Probability By Ms Aarti
PPT
Ch08 1
PPTX
Heteroscedasticity | Eonomics
PPTX
Properties of estimators (blue)
PDF
TEXTBOOK ON MATHEMATICAL ECONOMICS FOR CU , BU CALCUTTA , SOLVED EXERCISES , ...
PPT
Probability And Probability Distributions
PPTX
Dimensions
PPTX
Big M method
DOC
Deterministic vs stochastic
PDF
Probability cheatsheet
Curve fitting
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
Probability distributions
Bernoulli distribution
Bays theorem of probability
Simplex method
Binomial and Poission Probablity distribution
A power point presentation on statistics
Regression
Linear regression theory
Probability By Ms Aarti
Ch08 1
Heteroscedasticity | Eonomics
Properties of estimators (blue)
TEXTBOOK ON MATHEMATICAL ECONOMICS FOR CU , BU CALCUTTA , SOLVED EXERCISES , ...
Probability And Probability Distributions
Dimensions
Big M method
Deterministic vs stochastic
Probability cheatsheet
Ad

Viewers also liked (13)

PDF
07 Analysis of Algorithms: Order Statistics
PDF
Estimation theory 1
PDF
Multivariate normal proof
PPT
Medians and order statistics
PDF
Augmenting Data Structures
PPTX
median and order statistics
PPT
Algorithm
PDF
Session 9 10
PPT
lecture 11
PDF
A Measure Of Independence For A Multifariate Normal Distribution And Some Con...
PPTX
Medians and Order Statistics
PPT
Next higher number with same number of binary bits set
PPT
Algorithm Design and Complexity - Course 8
07 Analysis of Algorithms: Order Statistics
Estimation theory 1
Multivariate normal proof
Medians and order statistics
Augmenting Data Structures
median and order statistics
Algorithm
Session 9 10
lecture 11
A Measure Of Independence For A Multifariate Normal Distribution And Some Con...
Medians and Order Statistics
Next higher number with same number of binary bits set
Algorithm Design and Complexity - Course 8
Ad

Similar to Sufficient statistics (20)

PDF
Sufficiency
PDF
Contribution of Fixed Point Theorem in Quasi Metric Spaces
PDF
Information Theory Mike Brookes E4.40, ISE4.51, SO20.pdf
PDF
Machine Learning With MapReduce, K-Means, MLE
PDF
Unique fixed point theorems for generalized weakly contractive condition in o...
PPT
Machine Learning
PDF
Insufficient Gibbs sampling (A. Luciano, C.P. Robert and R. Ryder)
PDF
Classification
PDF
Approximate Bayesian model choice via random forests
PDF
bayesian_statistics_introduction_uppsala_university
PDF
A Geometric Note on a Type of Multiple Testing-07-24-2015
PDF
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
PDF
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
PDF
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
PDF
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
PDF
Proba stats-r1-2017
PDF
Inequalities #2
PDF
Inequality, slides #2
PDF
Fisher_info_ppt and mathematical process to find time domain and frequency do...
PDF
11.[29 35]a unique common fixed point theorem under psi varphi contractive co...
Sufficiency
Contribution of Fixed Point Theorem in Quasi Metric Spaces
Information Theory Mike Brookes E4.40, ISE4.51, SO20.pdf
Machine Learning With MapReduce, K-Means, MLE
Unique fixed point theorems for generalized weakly contractive condition in o...
Machine Learning
Insufficient Gibbs sampling (A. Luciano, C.P. Robert and R. Ryder)
Classification
Approximate Bayesian model choice via random forests
bayesian_statistics_introduction_uppsala_university
A Geometric Note on a Type of Multiple Testing-07-24-2015
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
Proba stats-r1-2017
Inequalities #2
Inequality, slides #2
Fisher_info_ppt and mathematical process to find time domain and frequency do...
11.[29 35]a unique common fixed point theorem under psi varphi contractive co...

Recently uploaded (20)

PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PDF
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
PPTX
Microbes in human welfare class 12 .pptx
PDF
Placing the Near-Earth Object Impact Probability in Context
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PPTX
BIOMOLECULES PPT........................
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PDF
The Land of Punt — A research by Dhani Irwanto
PPTX
Fluid dynamics vivavoce presentation of prakash
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PDF
CHAPTER 2 The Chemical Basis of Life Lecture Outline.pdf
PDF
Science Form five needed shit SCIENEce so
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PPTX
PMR- PPT.pptx for students and doctors tt
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PPTX
perinatal infections 2-171220190027.pptx
PPT
veterinary parasitology ````````````.ppt
PPT
Mutation in dna of bacteria and repairss
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
Microbes in human welfare class 12 .pptx
Placing the Near-Earth Object Impact Probability in Context
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
BIOMOLECULES PPT........................
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
The Land of Punt — A research by Dhani Irwanto
Fluid dynamics vivavoce presentation of prakash
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
CHAPTER 2 The Chemical Basis of Life Lecture Outline.pdf
Science Form five needed shit SCIENEce so
Animal tissues, epithelial, muscle, connective, nervous tissue
PMR- PPT.pptx for students and doctors tt
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
perinatal infections 2-171220190027.pptx
veterinary parasitology ````````````.ppt
Mutation in dna of bacteria and repairss
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...

Sufficient statistics

  • 1. Alessandro Ortis Università degli Studi di Catania Dipartimento di Matematica e Informatica Image Processing Lab - iplab.dmi.unict.it Sufficient statistics
  • 2. Sufficient statistics Parameter estimation: given a sample X = (x1, x2, … xn) from a population with pdf P(X|θ), we try to infer θ from some information represented by X. A. Ortis – Sufficient statistics
  • 3. Sufficient statistics Could be useful finding a reduced representation of X by means a function F(X)? Ex: X T(X)= mean(X) 4 5 6 5 5 5 5 5 3 5 7 5 A. Ortis – Sufficient statistics
  • 4. Sufficient statistics [ 4, 5, 6] T(X) = 5 [ 5, 5, 5] [ 3, 5, 7] ... Is there any loss of information ? Have we lost useful data or the representation given by T(X) is enought to infer the same information about θ conteined in X ? A. Ortis – Sufficient statistics
  • 5. Sufficient statistics [ 4, 5, 6] T(X) = 5 [ 5, 5, 5] [ 3, 5, 7] …. Is it sufficient to consider only the reduced data T(X)? A. Ortis – Sufficient statistics
  • 6. Sufficient statistics Def . A statistic T(X) is sufficient for θ if P(X|T(X)) is not a function of θ. A. Ortis – Sufficient statistics
  • 7. Sufficient statistics Example: Let (x1, x2, … xn) be a random sample of n Bernoulli(p) trials x = 1 𝑤𝑖𝑡ℎ 𝑝𝑟𝑜𝑏. 𝑝 0 𝑤𝑖𝑡ℎ 𝑝𝑟𝑜𝑏. 1 − 𝑝 Can we find a sufficient statistic for p? Considering the definition of sufficiency, can we find a function T(X) such that P(X|T(X)) is independent from p? (solution in the next slide...) A. Ortis – Sufficient statistics
  • 8. Sufficient statistics This conditional distribution does not depend on p! Once the value of T(X) is known, no other functon of X will provide any additiona information about p. If T(X) = 𝑋𝑖 = t we have P(X | T(X)) = 1 𝑛 𝑡 A. Ortis – Sufficient statistics
  • 9. Sufficient statistics A sufficient statistic T(X) reduces X in two senses: 1) We can reduce the dimensionality of data 2) The possible values assumed by T(X) are fewer A. Ortis – Sufficient statistics
  • 10. Sufficient statistics A statistic T(X) induces a partition on the sample space. Given a value t, we can define the subset 𝐴 𝑡 = 𝑋: 𝑇 𝑋 = 𝑡 A. Ortis – Sufficient statistics
  • 11. Sufficient statistics Bernoulli population with n=3, the sample space of X is 0,0,0 ; 0,0,1 ; 0,1,0 ; 0,1,1 ; 1,0,0 ; 1,0,1 ; 1,1,0 ; 1,1,1 A. Ortis – Sufficient statistics
  • 12. Sufficient statistics 0,0,0 ; 0,0,1 ; 0,1,0 ; 0,1,1 ; 1,0,0 ; 1,0,1 ; 1,1,0 ; 1,1,1 t Induced subset 0 { 0,0,0 } 1 { 0,0,1 ; 0,1,0 ; 1,0,0 } 2 { 0,1,1 ; 1,1,0 ; 1,0,1 } 3 { 1,1,1 } A. Ortis – Sufficient statistics
  • 13. Sufficient statistics Theorem: T(X) is a sufficient statistic for θ sif the likelihood factorizes into the following form L(x1, x2, … xn | θ ) = g( θ, T(x1, x2, … xn))·h(x1, x2, … xn) A. Ortis – Sufficient statistics
  • 14. Sufficient statistics Theorem: T(X) is a sufficient statistic for θ sif the likelihood factorizes into the following form L(x1, x2, … xn | θ ) = g( θ, T(x1, x2, … xn))·h(x1, x2, … xn) θ and X interact only via T(X) A. Ortis – Sufficient statistics
  • 15. Sufficient statistics Def. T is a minimal sufficient statistic if the following statements are true: 1. T is sufficient 2. If S is any other sufficient statistic then T = g(U) for some function g A. Ortis – Sufficient statistics
  • 16. Sufficient statistics In other words, T generates the coarsest sufficient partition. A minimal sufficient statistic is the smallest sufficient statistic and therefore it represents the ultimate data reduction with respect to estimating θ . In general, it may or may not exists. A. Ortis – Sufficient statistics
  • 17. Sufficient statistics Theorem: T(X) is a minimal sufficient statistics if P(𝑥1, 𝑥2, … 𝑥 𝑛 | 𝜃) P(𝑦1, 𝑦2, … 𝑦𝑛 | 𝜃) 𝑖𝑠 𝑛𝑜𝑡 𝑎 𝑓𝑢𝑛𝑐𝑡𝑖𝑜𝑛 𝑜𝑓 𝜃 𝑇(𝑥1, 𝑥2, … 𝑥 𝑛) = 𝑇(𝑦1, 𝑦2, … 𝑦𝑛) A. Ortis – Sufficient statistics
  • 18. Sufficient statistics • T(X) may not exist • If so, is not unique • Any 1-1 function of a sufficient statistic which does not depends on 𝜃 is also a sufficient statistic • All we considered so far on sufficiency can easily be extended to accommodate two (or more) parameters. A. Ortis – Sufficient statistics
  • 19. Sufficient statistics Example: let (x1, x2, … xn) be N(μ,σ2) observations. Let 𝜃1 = μ e 𝜃2 = σ2 we have that T(X) = ( 𝑋𝑖, 𝑋𝑖 2 ) T(X) = ( 𝑋, 𝑆2) Are both minimal sufficient statistics for N(μ,σ2) A. Ortis – Sufficient statistics