SlideShare a Scribd company logo
1
http://www.amitsharma.in
http://www.github.com/amit-sharma/causal-inference-
tutorial
2
3
4
5
Use these correlations to make a predictive model.
Future Activity ->
f(number of friends, logins in past month)

6
7
8
9
10
11
12
13
14
15
16
17
18
19
Algorithm A Algorithm B
50/1000 (5%) 54/1000 (5.4%)
20
Algorithm A Algorithm B
10/400 (2.5%) 4/200 (2%)
Algorithm A Algorithm B
40/600 (6.6%) 50/800 (6.2%)
Is Algorithm A better?
Algorithm A Algorithm B
Success Rate for
Low-Activity users
10/400 (2.5%) 4/200 (2%)
Success Rate for
High-Activity users
40/600 (6.6%) 50/800 (6.2%)
Total Success Rate 50/1000 (5%) 54/1000 (5.4%)
21
22
Average comment length decreases over time.
23
But for each yearly cohort of users, comment length
increases over time.
24
25
26
27http://plato.stanford.edu/entries/causation-mani/
28http://plato.stanford.edu/entries/causation-counterfactual/
29
30
31
32
33
34
35
36
37
38Dunning (2002), Rosenzweig-Wolpin (2000)
39
40
41
42
43
44
45
46
47http://tylervigen.com/spurious-correlations
http://www.github.com/amit-sharma/causal-inference-tutorial
amshar@microsoft.com
48

More Related Content

PPTX
Big Data Analytics
PPT
1.6.data preprocessing
PPTX
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
PDF
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...
PPTX
Hive and HiveQL - Module6
PDF
Data preprocessing using Machine Learning
PPTX
Big data analytics in healthcare
PPTX
boosting algorithm
Big Data Analytics
1.6.data preprocessing
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
PhD Thesis Defense Presentation: Robust Low-rank and Sparse Decomposition for...
Hive and HiveQL - Module6
Data preprocessing using Machine Learning
Big data analytics in healthcare
boosting algorithm

What's hot (20)

PPT
Slide3.ppt
PPTX
Lecture 18: Gaussian Mixture Models and Expectation Maximization
PPTX
Machine Learning and Causal Inference
PDF
Dimensionality Reduction
PPTX
Data analytics
PPTX
Causal Inference in Marketing
PDF
Lessons learned from building practical deep learning systems
PDF
Data Science Use cases in Banking
PPTX
Machine learning & Time Series Analysis
PDF
Feature Engineering
PPT
5.1 mining data streams
PPT
Introduction To Predictive Analytics Part I
PDF
Customer Clustering For Retail Marketing
PPTX
Belief Networks & Bayesian Classification
PPT
Data Science in the Real World: Making a Difference
PPTX
Big data analytics
PDF
Introduction to text classification using naive bayes
PPTX
Explainable AI in Industry (KDD 2019 Tutorial)
PPTX
Exploratory data analysis
PPTX
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Slide3.ppt
Lecture 18: Gaussian Mixture Models and Expectation Maximization
Machine Learning and Causal Inference
Dimensionality Reduction
Data analytics
Causal Inference in Marketing
Lessons learned from building practical deep learning systems
Data Science Use cases in Banking
Machine learning & Time Series Analysis
Feature Engineering
5.1 mining data streams
Introduction To Predictive Analytics Part I
Customer Clustering For Retail Marketing
Belief Networks & Bayesian Classification
Data Science in the Real World: Making a Difference
Big data analytics
Introduction to text classification using naive bayes
Explainable AI in Industry (KDD 2019 Tutorial)
Exploratory data analysis
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Ad

Viewers also liked (20)

PPTX
Causal inference in online systems: Methods, pitfalls and best practices
PPTX
Data mining for causal inference: Effect of recommendations on Amazon.com
PPTX
From prediction to causation: Causal inference in online systems
PPTX
Causal data mining: Identifying causal effects at scale
PPTX
5.3.5 causal inference in research
 
PPTX
Causal inference in practice
PPTX
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
PDF
Plotcon 2016 Visualization Talk by Alexandra Johnson
PPTX
Data Eng Conf NY Nov 2016 Parquet Arrow
PDF
Using social media to communicate research: Experiences of the International ...
PPT
241109 rm-j.p.-non experimental design
PDF
PPTX
Reducing the dimensionality of data with neural networks
PPT
Identifying Problems from Observations
PDF
Reference And Inference By Dr.Shadia.Pptx
PPT
Observation, Inference, and Prediction Review
PPTX
UTILIZATION OF NURSING RESEARCH
PDF
Kdd 2014 Tutorial - the recommender problem revisited
PPT
Observations vs Inferences
PDF
Agile Data Science 2.0
Causal inference in online systems: Methods, pitfalls and best practices
Data mining for causal inference: Effect of recommendations on Amazon.com
From prediction to causation: Causal inference in online systems
Causal data mining: Identifying causal effects at scale
5.3.5 causal inference in research
 
Causal inference in practice
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Plotcon 2016 Visualization Talk by Alexandra Johnson
Data Eng Conf NY Nov 2016 Parquet Arrow
Using social media to communicate research: Experiences of the International ...
241109 rm-j.p.-non experimental design
Reducing the dimensionality of data with neural networks
Identifying Problems from Observations
Reference And Inference By Dr.Shadia.Pptx
Observation, Inference, and Prediction Review
UTILIZATION OF NURSING RESEARCH
Kdd 2014 Tutorial - the recommender problem revisited
Observations vs Inferences
Agile Data Science 2.0
Ad

Similar to Causal inference in data science (14)

PPTX
causal_inference_extended_tutorial.pptx
PPTX
The Impact of Computing Systems | Causal inference in practice
PPTX
Modeling Social Data, Lecture 11: Causality and Experiments, Part 1
PPTX
DoWhy Python library for causal inference: An End-to-End tool
PDF
Causal Inference in Data Science and Machine Learning
PDF
Business Optimization via Causal Inference
PDF
Clinical studies & observational trials in the age of AI
PDF
Supercharge your AB testing with automated causal inference - Community Works...
PPTX
Machine learning session7(nb classifier k-nn)
PDF
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
PDF
Predictive Analytics with UX Research Data: Yes We Can!
PPTX
What is A/B-testing? An Introduction
PDF
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
PDF
[SOLVED 2025] Correlation is a common statistic to measure a general linear r...
causal_inference_extended_tutorial.pptx
The Impact of Computing Systems | Causal inference in practice
Modeling Social Data, Lecture 11: Causality and Experiments, Part 1
DoWhy Python library for causal inference: An End-to-End tool
Causal Inference in Data Science and Machine Learning
Business Optimization via Causal Inference
Clinical studies & observational trials in the age of AI
Supercharge your AB testing with automated causal inference - Community Works...
Machine learning session7(nb classifier k-nn)
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Predictive Analytics with UX Research Data: Yes We Can!
What is A/B-testing? An Introduction
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
[SOLVED 2025] Correlation is a common statistic to measure a general linear r...

More from Amit Sharma (14)

PPTX
Dowhy: An end-to-end library for causal inference
PPTX
Alleviating Privacy Attacks Using Causal Models
PPTX
Artificial Intelligence for Societal Impact
PPTX
Measuring effectiveness of machine learning systems
PPTX
Auditing search engines for differential satisfaction across demographics
PPTX
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
PPTX
Estimating the causal impact of recommender systems
PPTX
Predictability of popularity on online social media: Gaps between prediction ...
PPTX
Estimating influence of online activity feeds on people's actions
PPTX
Causal inference in practice: Here, there, causality is everywhere
PPTX
The interplay of personal preference and social influence in sharing networks...
PDF
The role of social connections in shaping our preferences
PDF
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
PDF
RSWEB 2013: A research platform for social recommendation
Dowhy: An end-to-end library for causal inference
Alleviating Privacy Attacks Using Causal Models
Artificial Intelligence for Societal Impact
Measuring effectiveness of machine learning systems
Auditing search engines for differential satisfaction across demographics
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Estimating the causal impact of recommender systems
Predictability of popularity on online social media: Gaps between prediction ...
Estimating influence of online activity feeds on people's actions
Causal inference in practice: Here, there, causality is everywhere
The interplay of personal preference and social influence in sharing networks...
The role of social connections in shaping our preferences
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
RSWEB 2013: A research platform for social recommendation

Recently uploaded (20)

PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction to machine learning and Linear Models
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Mega Projects Data Mega Projects Data
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Lecture1 pattern recognition............
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Computer network topology notes for revision
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPT
Quality review (1)_presentation of this 21
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
climate analysis of Dhaka ,Banglades.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to machine learning and Linear Models
Data_Analytics_and_PowerBI_Presentation.pptx
Mega Projects Data Mega Projects Data
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Introduction to Knowledge Engineering Part 1
Lecture1 pattern recognition............
Introduction-to-Cloud-ComputingFinal.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Fluorescence-microscope_Botany_detailed content
Computer network topology notes for revision
Galatica Smart Energy Infrastructure Startup Pitch Deck
Quality review (1)_presentation of this 21
Reliability_Chapter_ presentation 1221.5784
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Miokarditis (Inflamasi pada Otot Jantung)
climate analysis of Dhaka ,Banglades.pptx

Causal inference in data science