SlideShare a Scribd company logo
What is Selective Inference?
• When you use a feature selection method (e.g. Lasso), there
exists selection bias.
• There are features : 𝑥1, 𝑥2, … 𝑥100, and 𝑥2, 𝑥21, 𝑥30 are chosen.
• Then you make the model : y = 𝑤2 𝑥2 + 𝑤21 𝑥21 + 𝑤30 𝑥30
• Fit it, and get the parameters : 𝑤2 = +3.8, 𝑤21 = −4.5, 𝑤30 = +6.2
• p-values of 𝑤2, w21, w30 are beyond 𝛼 = 0.05.
• However, you cannot directly say that :
“𝑤2 and w30 have positive effects and 𝑤21 has negative effect.”
• Hypothesis selected by some algorithm has selection bias and we
have to correct the bias. To do that, we get following 𝑙 𝑎𝑛𝑑 𝑢.
Ρ 𝑤𝑖 ∉ 𝑙, 𝑢 𝑖 ∈ 𝑆, 𝑆 ← 𝐿𝑎𝑠𝑠𝑜) < 𝛼
• このような条件付分布に基づく統計的推論の枠組をSelective Inference と呼ぶようです

More Related Content

PPT
Chap11 chie square & non parametrics
PPT
Chap10 anova
PDF
Regression analysis in excel
PPTX
Chi square Test
PPTX
Statr session 19 and 20
PPT
Chap12 simple regression
PPT
Simple Linier Regression
PPTX
Student’s t test
Chap11 chie square & non parametrics
Chap10 anova
Regression analysis in excel
Chi square Test
Statr session 19 and 20
Chap12 simple regression
Simple Linier Regression
Student’s t test

What's hot (20)

PPTX
PPT
11.1 anova1
PPTX
ERF Training Workshop Panel Data 4
PPTX
T test for one sample mean
PPTX
ERF Training Workshop Panel Data 5
PPTX
ERF Training Workshop Panel Data 3
PPTX
Image stitching
PPT
Student t t est
PPTX
Chi square test
PPTX
PPTX
Static Models of Continuous Variables
PPT
Mann Whitney U test
PPTX
Linear regression by Kodebay
PDF
Simple linear regression
PPT
Mann Whitney U Test
PPTX
Unit 3 random number generation, random-variate generation
PPTX
Hypothesis testing part i
PPTX
Hypothesis testing part iii for difference of means
PPTX
Wilcoxon Rank-Sum Test
PPTX
Curve fitting
11.1 anova1
ERF Training Workshop Panel Data 4
T test for one sample mean
ERF Training Workshop Panel Data 5
ERF Training Workshop Panel Data 3
Image stitching
Student t t est
Chi square test
Static Models of Continuous Variables
Mann Whitney U test
Linear regression by Kodebay
Simple linear regression
Mann Whitney U Test
Unit 3 random number generation, random-variate generation
Hypothesis testing part i
Hypothesis testing part iii for difference of means
Wilcoxon Rank-Sum Test
Curve fitting
Ad

More from Daiki Tanaka (13)

PDF
[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
PDF
カーネル法:正定値カーネルの理論
PDF
[Paper Reading] Causal Bandits: Learning Good Interventions via Causal Inference
PDF
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
PDF
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
PDF
オンライン学習 : Online learning
PPTX
[Paper Reading] Attention is All You Need
PDF
Local Outlier Detection with Interpretation
PDF
Interpretability of machine learning
PDF
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
PDF
The Limits of Popularity-Based Recommendations, and the Role of Social Ties
PDF
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
PDF
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
カーネル法:正定値カーネルの理論
[Paper Reading] Causal Bandits: Learning Good Interventions via Causal Inference
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
オンライン学習 : Online learning
[Paper Reading] Attention is All You Need
Local Outlier Detection with Interpretation
Interpretability of machine learning
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
The Limits of Popularity-Based Recommendations, and the Role of Social Ties
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
Ad

Recently uploaded (20)

PPTX
Database Infoormation System (DBIS).pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Mega Projects Data Mega Projects Data
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
Computer network topology notes for revision
Database Infoormation System (DBIS).pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Mega Projects Data Mega Projects Data
Clinical guidelines as a resource for EBP(1).pdf
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Knowledge Engineering Part 1
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Major-Components-ofNKJNNKNKNKNKronment.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Data_Analytics_and_PowerBI_Presentation.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Quality review (1)_presentation of this 21
Computer network topology notes for revision

Selective inference

  • 1. What is Selective Inference? • When you use a feature selection method (e.g. Lasso), there exists selection bias. • There are features : 𝑥1, 𝑥2, … 𝑥100, and 𝑥2, 𝑥21, 𝑥30 are chosen. • Then you make the model : y = 𝑤2 𝑥2 + 𝑤21 𝑥21 + 𝑤30 𝑥30 • Fit it, and get the parameters : 𝑤2 = +3.8, 𝑤21 = −4.5, 𝑤30 = +6.2 • p-values of 𝑤2, w21, w30 are beyond 𝛼 = 0.05. • However, you cannot directly say that : “𝑤2 and w30 have positive effects and 𝑤21 has negative effect.” • Hypothesis selected by some algorithm has selection bias and we have to correct the bias. To do that, we get following 𝑙 𝑎𝑛𝑑 𝑢. Ρ 𝑤𝑖 ∉ 𝑙, 𝑢 𝑖 ∈ 𝑆, 𝑆 ← 𝐿𝑎𝑠𝑠𝑜) < 𝛼 • このような条件付分布に基づく統計的推論の枠組をSelective Inference と呼ぶようです