SlideShare a Scribd company logo
Reliability of estimates in socio-
demographic groups with small samples
D.Buono
Statistical Office of European Union
19 August 2016, SAE, Maastricht
All expressed opinions are of the author
Facts and figures about Eurostat
• About 800 people with 28 different nationalities
• Small central methodology team
• TS, Econometrics, SDC, research & EA
• Plus domain methodologists networking
• Statistical Office but not independent authority, General
Directorate of the European Commission
• Subsidiary principle!
Eurostat core business
• Euro-zone (19) & EU (28)
aggregates
• harmonization, best
practices, guidelines,
trainings & international
cooperation
Why interested in SAE?
• European regional policies
• Different sizes of Member States, primary data providers
• According to the EU 2011 Population Census there are
79,652,380 residents in DE and 512,353 in LU!!!
• Some dilemmas:
• How big is a small area?
• Can SAE help with data breakdown demand by users?
Outline
• Reliability of indicators
• At-risk-of-poverty indicators
• SAE techniques for Official Statistics
• Application for 2 EU countries
• Learnings and open questions
• ADS and EU research funds for SAE expertise
Some Notation
• U – finite population of size N
• D – number of socio-demographic groups in the target
population
• s – sample of size n
• sd – sub-sample from domain d of size nd
• r – not sampled elements of size N-n
• rd – not sampled elements from domain d of size Nd-nd
• y – target variable
• X – vector of auxiliary information
Indicator of interest: ARPT
Estimation methods
Empirical Bayes (EB) method
Hierarchical Bayes (HB) method
packages and functions used
• sae.R
• Functions:
direct
ebBHF
pbmseBHF
• hbsae.R
• Functions:
fSAE
fSAE.Area
Application: Target and data
• Target: Calculate direct and indirect at-risk-of-
poverty rate estimates by socio-demographic
breakdowns
• Data sources: Survey on Income and Living
Conditions (EU-SILC) and Census data of some EU
countries in 2011
• Sample: divided in 18 disjoint socio-demographic
groups of small and large sizes
• Auxiliary variables: unit level information on
economic activity status and highest level of
education attained
Application 1: Results
Application 1: Results
Application 2: Results
Application 2: Results
Learnings and future work
• By applying model-based SAE techniques reliability of
estimates could be increased
• Enlargement of number auxiliary variables
• Further investigation is needed to assess the most
appropriate estimator (call for harmonization?)
• Extension to additional countries and socio-demographic
groups
Open questions on SAE
• EB vs. HB dichotomy calls for harmonised practices in
Official Statistics?
• Design based to model based to algorithm based:
maybe there is a possible link between SAE and
statistical learning?
• Reversing the approach: starting from the data rather
than from the goal?
• How about the use of SAE for data protection?
Advertisement
CESS2016, Conference of European Statistics Stakeholders
Budapest, 20–21 Oct 16 (by ESTAT, ECB & HCSO), free!
• Session B3: Official statistics on cross-border phenomena
• Session C9: Small area estimation and weighting
NTTS2017, New Techniques and Technologies for Statistics
Brussels, 14–16 March 17 (by ESTAT), free!
• abstract by 28 Oct 16, track C includes SAE
Research funds under Horizon 2020
TOPIC : Towards a new growth strategy in Europe - Improved
economic and social measurement, data and official statistics
Opening: 4 of October 2016
Closing: 2 of February 2017
For more info here to submit a proposal here
"Disaggregation of statistics - geographically, or by other domains
(e.g. identifying vulnerable population groups) - to provide greater
insights and providing evidence allowing more focused policy
decisions should be covered. At the same time data protection
concerns should be addressed. Small Area Estimation expertise
could cover the geographical/domain disaggregation aspect"
Thank you!
dario.buono@ec.europa.eu
http://ec.europa.eu/eurostat

More Related Content

PDF
Involving users in development of statistics, Ineke Stoop
PDF
Putting users first, Mariana Kotzeva
PPTX
Methodological network and strategy
PDF
Cros portal for-knowledge
PPTX
Bachelor in business statistics in knu
PPTX
Doctorate in mathematics & statistics
PPT
Osm presentation workshop 19 sept 2018
PDF
Ingo Weustenfeld - The 2016 EU Justice Scoreboard
Involving users in development of statistics, Ineke Stoop
Putting users first, Mariana Kotzeva
Methodological network and strategy
Cros portal for-knowledge
Bachelor in business statistics in knu
Doctorate in mathematics & statistics
Osm presentation workshop 19 sept 2018
Ingo Weustenfeld - The 2016 EU Justice Scoreboard

What's hot (7)

PPT
Developments in European Statistics
PPTX
IAOS 2018 - Evolving statistics, transforming decisions, J. Heng
PDF
Labour market intelligence in the netherlands 0.2
PDF
Big Data Analysis: The curse of dimensionality in official statistics
PPTX
From econometrics to bibliometrics
PDF
Curse of Dimensionality and Big Data
PPT
Presentation1
Developments in European Statistics
IAOS 2018 - Evolving statistics, transforming decisions, J. Heng
Labour market intelligence in the netherlands 0.2
Big Data Analysis: The curse of dimensionality in official statistics
From econometrics to bibliometrics
Curse of Dimensionality and Big Data
Presentation1
Ad

Viewers also liked (10)

PDF
Work culture can improve employee referral program
PDF
SAC recruitment 2015 in Gujarat
PPTX
Moodboard
PDF
P3-Point-White-Paper-Final
PDF
To hell with_money__slideshare_
PDF
APRESENTAÇÃO TPD 2015
PDF
Group D_Final Presentation
PPTX
How to become a bank teller
PDF
Arm cm3 architecture_and_programmer_model
PPTX
Methods of teaching mathematics
Work culture can improve employee referral program
SAC recruitment 2015 in Gujarat
Moodboard
P3-Point-White-Paper-Final
To hell with_money__slideshare_
APRESENTAÇÃO TPD 2015
Group D_Final Presentation
How to become a bank teller
Arm cm3 architecture_and_programmer_model
Methods of teaching mathematics
Ad

Similar to Reliability of estimates in socio-demographic groups with small samples (20)

PPTX
Big Data and Nowcasting
PPTX
Putting users at the center The ESS approach in the DIGICOM project
PPT
Commission studies on eaccessibility
PDF
Civil society indicators for the Sustainable Development Goals
PPTX
IAOS 2018 - Teaching official statistics in universities: some recommendation...
PPT
Sandrine RATH, Christiane ARBOGAST and Patrick DELECRAZ: Two approaches to im...
PPT
Sandrine RATH, Christiane ARBOGAST and Patrick DELECRAZ: Two approaches to im...
PDF
Evaluating eParticipation sophistication of Regional Authorities websites: Th...
PPTX
Dgins estat-presentation. fina-lpptx
PPTX
Internal Market for Inclusive and Assistive ICT_Sebastiaan van der Peijl_Delo...
PDF
4. Invisible in statistics
PDF
DELSA/GOV 3rd Health meeting - Barbara UBALDI
PPTX
Developing an urban barometer (Antonio Cañamas and Fabrice Murtin, OECD, France)
PDF
A challenge of the sampling variance and note on the outlier treatment of EU ...
PDF
Evidence - Based Regional Policy: Lessons and Challenges / Duarte Rodrigues C...
PDF
7th OECD World Forum on Well-being, Rome, Orietta LUZI
PPTX
EGOV / ePart 2015 - Policy Compass Workshop Presentation
PDF
7th OECD World Forum on Well-being, Rome, Monica PRATESI
PDF
Putting users first, Eduardo Barredo Capelot, Eurostat
PPT
EC policy actions and priorities in employment, and the potential of online e...
Big Data and Nowcasting
Putting users at the center The ESS approach in the DIGICOM project
Commission studies on eaccessibility
Civil society indicators for the Sustainable Development Goals
IAOS 2018 - Teaching official statistics in universities: some recommendation...
Sandrine RATH, Christiane ARBOGAST and Patrick DELECRAZ: Two approaches to im...
Sandrine RATH, Christiane ARBOGAST and Patrick DELECRAZ: Two approaches to im...
Evaluating eParticipation sophistication of Regional Authorities websites: Th...
Dgins estat-presentation. fina-lpptx
Internal Market for Inclusive and Assistive ICT_Sebastiaan van der Peijl_Delo...
4. Invisible in statistics
DELSA/GOV 3rd Health meeting - Barbara UBALDI
Developing an urban barometer (Antonio Cañamas and Fabrice Murtin, OECD, France)
A challenge of the sampling variance and note on the outlier treatment of EU ...
Evidence - Based Regional Policy: Lessons and Challenges / Duarte Rodrigues C...
7th OECD World Forum on Well-being, Rome, Orietta LUZI
EGOV / ePart 2015 - Policy Compass Workshop Presentation
7th OECD World Forum on Well-being, Rome, Monica PRATESI
Putting users first, Eduardo Barredo Capelot, Eurostat
EC policy actions and priorities in employment, and the potential of online e...

More from Dario Buono (13)

PPTX
Introduction to LLMs and their relevance for Official Statistics
PDF
Reporting uncertainties - too much information?
PPTX
Skills for the new generation of statisticians
PPTX
JDemetra+ Java Tool for Seasonal Adjustment
PDF
Big data and macroeconomic nowcasting from data access to modelling
PPTX
Physics4Stats & BMI vs. QoL
PPTX
Safebook quality grading
PPT
MIP: Analysis of metadata and data revisions
PPT
New innovative 3 way anova a-priori test for direct vs. indirect approach in ...
PPT
Eurostat tools for benchmarking and seasonal adjustment j_demetra+ and jecotr...
PPTX
Detecting outliers at the end of the series using forecast intervals
PPT
1 out of 20 scenarios
PPT
Eurostat methodological skills staff survey lesson learned final
Introduction to LLMs and their relevance for Official Statistics
Reporting uncertainties - too much information?
Skills for the new generation of statisticians
JDemetra+ Java Tool for Seasonal Adjustment
Big data and macroeconomic nowcasting from data access to modelling
Physics4Stats & BMI vs. QoL
Safebook quality grading
MIP: Analysis of metadata and data revisions
New innovative 3 way anova a-priori test for direct vs. indirect approach in ...
Eurostat tools for benchmarking and seasonal adjustment j_demetra+ and jecotr...
Detecting outliers at the end of the series using forecast intervals
1 out of 20 scenarios
Eurostat methodological skills staff survey lesson learned final

Recently uploaded (20)

PPTX
ANICK 6 BIRTHDAY....................................................
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
The Effect of Human Resource Management Practice on Organizational Performanc...
PPT
The Effect of Human Resource Management Practice on Organizational Performanc...
PPTX
nose tajweed for the arabic alphabets for the responsive
PPTX
FINAL TEST 3C_OCTAVIA RAMADHANI SANTOSO-1.pptx
PPTX
_ISO_Presentation_ISO 9001 and 45001.pptx
PPTX
Tablets And Capsule Preformulation Of Paracetamol
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
DOCX
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
PPTX
Lesson-7-Gas. -Exchange_074636.pptx
DOCX
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
PPTX
fundraisepro pitch deck elegant and modern
PPTX
Project and change Managment: short video sequences for IBA
PPTX
Anesthesia and it's stage with mnemonic and images
PPTX
Intro to ISO 9001 2015.pptx wareness raising
PPTX
lesson6-211001025531lesson plan ppt.pptx
PPTX
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
PPTX
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
PPTX
2025-08-10 Joseph 02 (shared slides).pptx
ANICK 6 BIRTHDAY....................................................
Impressionism_PostImpressionism_Presentation.pptx
The Effect of Human Resource Management Practice on Organizational Performanc...
The Effect of Human Resource Management Practice on Organizational Performanc...
nose tajweed for the arabic alphabets for the responsive
FINAL TEST 3C_OCTAVIA RAMADHANI SANTOSO-1.pptx
_ISO_Presentation_ISO 9001 and 45001.pptx
Tablets And Capsule Preformulation Of Paracetamol
chapter8-180915055454bycuufucdghrwtrt.pptx
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
Lesson-7-Gas. -Exchange_074636.pptx
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
fundraisepro pitch deck elegant and modern
Project and change Managment: short video sequences for IBA
Anesthesia and it's stage with mnemonic and images
Intro to ISO 9001 2015.pptx wareness raising
lesson6-211001025531lesson plan ppt.pptx
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
2025-08-10 Joseph 02 (shared slides).pptx

Reliability of estimates in socio-demographic groups with small samples

  • 1. Reliability of estimates in socio- demographic groups with small samples D.Buono Statistical Office of European Union 19 August 2016, SAE, Maastricht All expressed opinions are of the author
  • 2. Facts and figures about Eurostat • About 800 people with 28 different nationalities • Small central methodology team • TS, Econometrics, SDC, research & EA • Plus domain methodologists networking • Statistical Office but not independent authority, General Directorate of the European Commission • Subsidiary principle!
  • 3. Eurostat core business • Euro-zone (19) & EU (28) aggregates • harmonization, best practices, guidelines, trainings & international cooperation
  • 4. Why interested in SAE? • European regional policies • Different sizes of Member States, primary data providers • According to the EU 2011 Population Census there are 79,652,380 residents in DE and 512,353 in LU!!! • Some dilemmas: • How big is a small area? • Can SAE help with data breakdown demand by users?
  • 5. Outline • Reliability of indicators • At-risk-of-poverty indicators • SAE techniques for Official Statistics • Application for 2 EU countries • Learnings and open questions • ADS and EU research funds for SAE expertise
  • 6. Some Notation • U – finite population of size N • D – number of socio-demographic groups in the target population • s – sample of size n • sd – sub-sample from domain d of size nd • r – not sampled elements of size N-n • rd – not sampled elements from domain d of size Nd-nd • y – target variable • X – vector of auxiliary information
  • 11. packages and functions used • sae.R • Functions: direct ebBHF pbmseBHF • hbsae.R • Functions: fSAE fSAE.Area
  • 12. Application: Target and data • Target: Calculate direct and indirect at-risk-of- poverty rate estimates by socio-demographic breakdowns • Data sources: Survey on Income and Living Conditions (EU-SILC) and Census data of some EU countries in 2011 • Sample: divided in 18 disjoint socio-demographic groups of small and large sizes • Auxiliary variables: unit level information on economic activity status and highest level of education attained
  • 17. Learnings and future work • By applying model-based SAE techniques reliability of estimates could be increased • Enlargement of number auxiliary variables • Further investigation is needed to assess the most appropriate estimator (call for harmonization?) • Extension to additional countries and socio-demographic groups
  • 18. Open questions on SAE • EB vs. HB dichotomy calls for harmonised practices in Official Statistics? • Design based to model based to algorithm based: maybe there is a possible link between SAE and statistical learning? • Reversing the approach: starting from the data rather than from the goal? • How about the use of SAE for data protection?
  • 19. Advertisement CESS2016, Conference of European Statistics Stakeholders Budapest, 20–21 Oct 16 (by ESTAT, ECB & HCSO), free! • Session B3: Official statistics on cross-border phenomena • Session C9: Small area estimation and weighting NTTS2017, New Techniques and Technologies for Statistics Brussels, 14–16 March 17 (by ESTAT), free! • abstract by 28 Oct 16, track C includes SAE
  • 20. Research funds under Horizon 2020 TOPIC : Towards a new growth strategy in Europe - Improved economic and social measurement, data and official statistics Opening: 4 of October 2016 Closing: 2 of February 2017 For more info here to submit a proposal here "Disaggregation of statistics - geographically, or by other domains (e.g. identifying vulnerable population groups) - to provide greater insights and providing evidence allowing more focused policy decisions should be covered. At the same time data protection concerns should be addressed. Small Area Estimation expertise could cover the geographical/domain disaggregation aspect"