SlideShare a Scribd company logo
CODE@MIT – OCTOBER 16 2015
Lessons learned in display advertising
Natural experiments at scale
Robert Moakler – (rmoakler@stern.nyu.edu)
Ekaterina Eliseeva – (keliseeva@integralads.com)
Kiril Tsemekhman – (kiril@integralads.com)
CODE@MIT 2015
CODE@MIT – OCTOBER 16 2015
The $100+ billion question!

Does online advertising really work? 
$104.57
$120.05
$140.15
$160.18
$178.45
$196.05
$213.89Digital ad spending!
% change!
2012 2013 2014 2015 2016 2017 2018!
Source: www.emarketer.com, “Global Ad Spending Growth to Double This Year”
20.4%
14.8% 16.7% 14.3% 11.4% 9.9% 9.1%
CODE@MIT – OCTOBER 16 2015
The $100+ billion question!

Does online advertising really work? 

Do online ads cause you to take some action?
CODE@MIT – OCTOBER 16 2015
The usual approach!
Randomized experiments and A/B tests are great!
Campaign AdPSA
CODE@MIT – OCTOBER 16 2015
The usual approach!
But sometimes …
RIGHT
WRONG
Randomized experiments and A/B tests are great!
Campaign AdPSA
CODE@MIT – OCTOBER 16 2015
Natural experiments!
•  Consider the typical setup for the ad serving process
Confounding!
W
User
features
A
Served
ads
Y
Convert
CODE@MIT – OCTOBER 16 2015
Natural experiments!
•  Consider the typical setup for the ad serving process
•  Introduce a mediating variable
W
User
features
A
Served
ads
Y
Convert
M
Mediator
W’
Residual
Confounders
CODE@MIT – OCTOBER 16 2015
Natural experiments!
•  Consider the typical setup for the ad serving process
•  Introduce a mediating variable
–  Viewability
W
User
features
A
Served
ads
Y
Convert
V
Viewable
ad
W’
Residual
Confounders
CODE@MIT – OCTOBER 16 2015
Ad viewability!
Horizontal location (px) Proportion of ads
Verticallocation(px)
Addensity
CODE@MIT – OCTOBER 16 2015
Running in the wild!
•  Natural experiments aren’t always clean or easy
•  We will discuss five problems that we have run into and some
solutions for dealing with them
CODE@MIT – OCTOBER 16 2015
An online advertising campaign!
•  Our data structure
Analysis window
Viewable ad
Unviewable ad
Conversion
Web activity
Our users
CODE@MIT – OCTOBER 16 2015
Longitudinal data!
•  Monitoring
–  Most online advertising campaigns run continually
–  We are constantly monitoring many campaigns at the event level
•  Running an intermediary analysis
–  Data is subject to left truncation and right censoring
–  We need to account for our residual confounders, W’
•  Use survival analysis
–  Cox Proportional Hazards (CPH) model
CODE@MIT – OCTOBER 16 2015
User fragmentation and study period!
•  In reality, our users are defined by cookies. 
–  However, people do not just have one cookie!
Viewable ad
Unviewable ad
Conversion
Web activity
Sarah
Cookie 1 Cookie 2 Cookie 3
Bob
Cookie 1 Cookie 2
Analysis window
Cookie 3
CODE@MIT – OCTOBER 16 2015
User fragmentation and study period!
•  In reality, our users are defined by cookies. 
–  However, people do not just have one cookie!
•  Some methods we use to account for this
–  We define an effect period of 1 week
•  Seasonality has a major impact
•  Users are selected through iterative simulation and research
•  Incremental causal estimates level off after a single week
CODE@MIT – OCTOBER 16 2015
Validation!
•  How do we know our causal models give reasonable estimates?
•  Use an array of negative control tests
–  Use the impressions of one campaign to predict an unrelated conversion 
W
User
features
A
Served
ads
Y
Convert
W’
Residual
Confounders
Y
Unrelated
Event
-
V
Viewable
ad
CODE@MIT – OCTOBER 16 2015
Running at scale!
•  Converting our data into something analyzable is a challenge
…
Raw daily logs
Billions of events
HDFS scalable
cluster storage
Hadoop
People browse
the web.
Advertising events
turn into billions of
daily events.
Raw data is moved
to scalable storage
optimized for our
experimental setup.
Users are subsampled
and negative controls
are chosen in parallel.
Reports are run in
parallel using
stripped down R
libraries.
Iterative process of
simulation and
research.
CODE@MIT – OCTOBER 16 2015
Summary!
•  Mediators and natural experiments may already exist in your data
•  Running a natural experiment at scale is not straight forward, because
1.  The longitudinal nature of the data
2.  Users can become highly fragmented
3.  No predetermined start and end dates
4.  Validation of causal models
5.  Billions of events and terabytes of raw data
•  Equal parts engineering and modeling
•  We explored online advertising, but this setup can apply to a wide
variety of industries
CODE@MIT – OCTOBER 16 2015
Thanks!
Robert Moakler – (rmoakler@stern.nyu.edu)
Ekaterina Eliseeva – (keliseeva@integralads.com)
Kiril Tsemekhman – (kiril@integralads.com)

Grab this deck @ bit.ly/natural-experiments-at-scale
CODE@MIT – OCTOBER 16 2015
Acknowledgments!
Integral Ad Science
Ekaterina Eliseeva
Kiril Tsemekhman
Ana Calabrese
Gijs Joost Brouwer
Sergei Izrailev 

NYU Stern
Foster Provost

Amazon, Inc.
Daniel Hill
CODE@MIT – OCTOBER 16 2015
References!
Chan, D., Ge, R., Gershony, O., Hesterberg, T., & Lambert, D. (2010, July). Evaluating online ad
campaigns in a pipeline: causal models at scale. In Proceedings of the 16th ACM SIGKDD
international conference on Knowledge discovery and data mining (pp. 7-16). ACM.
Dalessandro, B., Perlich, C., Stitelman, O., & Provost, F. (2012, August). Causally motivated attribution
for online advertising. In Proceedings of the Sixth International Workshop on Data Mining for Online
Advertising and Internet Economy (p. 7). ACM.
Hill, D. N., Moakler, R., Hubbard, A. E., Tsemekhman, V., Provost, F., & Tsemekhman, K. (2015, August).
Measuring Causal Impact of Online Actions via Natural Experiments: Application to Display
Advertising. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge
Discovery and Data Mining (pp. 1839-1847). ACM.
Johnson, G. A., Lewis, R. A., Nubbemeyer, E. I. (2015, October). Ghost Ads: Improving the Economics
of Measuring Ad Effectiveness. Available on SSRN: ssrn.com/abstract=2620078
Klein, J. P., & Moeschberger, M. L. (2003). Survival analysis: techniques for censored and truncated
data. Springer Science & Business Media.
Pearl, J. (2009). Causality. Cambridge university press.

More Related Content

PPTX
Machine Learning and Causal Inference
PDF
Agent-based modelling, the London riots, and Python
PDF
PyCon Colombia Keynote 2018
PDF
WisOwners PowerPoint - Proof 1
PDF
Robert Moakler, Data Science Intern, Integral Ad Science at MLconf SEA - 5/01/15
PPTX
The Impact of Computing Systems | Causal inference in practice
PPTX
Experimentation at Scale
PDF
Supporting decisions with ML
Machine Learning and Causal Inference
Agent-based modelling, the London riots, and Python
PyCon Colombia Keynote 2018
WisOwners PowerPoint - Proof 1
Robert Moakler, Data Science Intern, Integral Ad Science at MLconf SEA - 5/01/15
The Impact of Computing Systems | Causal inference in practice
Experimentation at Scale
Supporting decisions with ML

Similar to Natural Experiments at Scale (16)

PDF
Applications of Big Data
PDF
GalvanizeU Seattle: Eleven Almost-Truisms About Data
PDF
data science @NYT ; inaugural Data Science Initiative Lecture
PDF
Data Economy: Lessons learned and the Road ahead!
PPT
How Digital & Big Data Revolution Will Transform Primary Care Medicine
PDF
Ronny Kohavi, Microsoft (USA) - Conversion Hotel 2017 - keynote
PDF
Insights: July 26 big data workshop for causes
PDF
data science history / data science @ NYT
PDF
Computational Social Science, Lecture 10: Online Experiments
PPTX
Data science in the noc and beyond
PPTX
Data Models And Details About Open Data
PDF
Implicit Human-Computer Interaction - Lecture 11 - Next Generation User Inter...
PDF
Newsletter_April_2016
PPTX
The Quantified self
PDF
Introduction to data sciceeeeeeeeeeeeeeeeeeee
PDF
Accretive Health - Quality Management in Health Care
Applications of Big Data
GalvanizeU Seattle: Eleven Almost-Truisms About Data
data science @NYT ; inaugural Data Science Initiative Lecture
Data Economy: Lessons learned and the Road ahead!
How Digital & Big Data Revolution Will Transform Primary Care Medicine
Ronny Kohavi, Microsoft (USA) - Conversion Hotel 2017 - keynote
Insights: July 26 big data workshop for causes
data science history / data science @ NYT
Computational Social Science, Lecture 10: Online Experiments
Data science in the noc and beyond
Data Models And Details About Open Data
Implicit Human-Computer Interaction - Lecture 11 - Next Generation User Inter...
Newsletter_April_2016
The Quantified self
Introduction to data sciceeeeeeeeeeeeeeeeeeee
Accretive Health - Quality Management in Health Care
Ad

More from Integral Ad Science (8)

PDF
2015 Q4 Media Quality Report
PDF
2015 Q3 Media Quality Report
PDF
Year-end Survey Results
PDF
2015 Q2 Media Quality Report
PDF
2015 Q1 Media Quality Report
PDF
2014 Q4 Media Quality Report
PPTX
Integral Ad Science Viewability Presentation
PDF
Integral Ad Science Digital Ad Fraud Presentation
2015 Q4 Media Quality Report
2015 Q3 Media Quality Report
Year-end Survey Results
2015 Q2 Media Quality Report
2015 Q1 Media Quality Report
2014 Q4 Media Quality Report
Integral Ad Science Viewability Presentation
Integral Ad Science Digital Ad Fraud Presentation
Ad

Recently uploaded (20)

PDF
. Radiology Case Scenariosssssssssssssss
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
famous lake in india and its disturibution and importance
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPT
protein biochemistry.ppt for university classes
PPTX
Introduction to Cardiovascular system_structure and functions-1
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
BIOMOLECULES PPT........................
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
Microbiology with diagram medical studies .pptx
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PPTX
2. Earth - The Living Planet earth and life
PDF
The scientific heritage No 166 (166) (2025)
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
. Radiology Case Scenariosssssssssssssss
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Phytochemical Investigation of Miliusa longipes.pdf
famous lake in india and its disturibution and importance
Biophysics 2.pdffffffffffffffffffffffffff
protein biochemistry.ppt for university classes
Introduction to Cardiovascular system_structure and functions-1
The KM-GBF monitoring framework – status & key messages.pptx
2. Earth - The Living Planet Module 2ELS
neck nodes and dissection types and lymph nodes levels
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
INTRODUCTION TO EVS | Concept of sustainability
BIOMOLECULES PPT........................
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Microbiology with diagram medical studies .pptx
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
2. Earth - The Living Planet earth and life
The scientific heritage No 166 (166) (2025)
cpcsea ppt.pptxssssssssssssssjjdjdndndddd

Natural Experiments at Scale

  • 1. CODE@MIT – OCTOBER 16 2015 Lessons learned in display advertising Natural experiments at scale Robert Moakler – (rmoakler@stern.nyu.edu) Ekaterina Eliseeva – (keliseeva@integralads.com) Kiril Tsemekhman – (kiril@integralads.com) CODE@MIT 2015
  • 2. CODE@MIT – OCTOBER 16 2015 The $100+ billion question! Does online advertising really work? $104.57 $120.05 $140.15 $160.18 $178.45 $196.05 $213.89Digital ad spending! % change! 2012 2013 2014 2015 2016 2017 2018! Source: www.emarketer.com, “Global Ad Spending Growth to Double This Year” 20.4% 14.8% 16.7% 14.3% 11.4% 9.9% 9.1%
  • 3. CODE@MIT – OCTOBER 16 2015 The $100+ billion question! Does online advertising really work? Do online ads cause you to take some action?
  • 4. CODE@MIT – OCTOBER 16 2015 The usual approach! Randomized experiments and A/B tests are great! Campaign AdPSA
  • 5. CODE@MIT – OCTOBER 16 2015 The usual approach! But sometimes … RIGHT WRONG Randomized experiments and A/B tests are great! Campaign AdPSA
  • 6. CODE@MIT – OCTOBER 16 2015 Natural experiments! •  Consider the typical setup for the ad serving process Confounding! W User features A Served ads Y Convert
  • 7. CODE@MIT – OCTOBER 16 2015 Natural experiments! •  Consider the typical setup for the ad serving process •  Introduce a mediating variable W User features A Served ads Y Convert M Mediator W’ Residual Confounders
  • 8. CODE@MIT – OCTOBER 16 2015 Natural experiments! •  Consider the typical setup for the ad serving process •  Introduce a mediating variable –  Viewability W User features A Served ads Y Convert V Viewable ad W’ Residual Confounders
  • 9. CODE@MIT – OCTOBER 16 2015 Ad viewability! Horizontal location (px) Proportion of ads Verticallocation(px) Addensity
  • 10. CODE@MIT – OCTOBER 16 2015 Running in the wild! •  Natural experiments aren’t always clean or easy •  We will discuss five problems that we have run into and some solutions for dealing with them
  • 11. CODE@MIT – OCTOBER 16 2015 An online advertising campaign! •  Our data structure Analysis window Viewable ad Unviewable ad Conversion Web activity Our users
  • 12. CODE@MIT – OCTOBER 16 2015 Longitudinal data! •  Monitoring –  Most online advertising campaigns run continually –  We are constantly monitoring many campaigns at the event level •  Running an intermediary analysis –  Data is subject to left truncation and right censoring –  We need to account for our residual confounders, W’ •  Use survival analysis –  Cox Proportional Hazards (CPH) model
  • 13. CODE@MIT – OCTOBER 16 2015 User fragmentation and study period! •  In reality, our users are defined by cookies. –  However, people do not just have one cookie! Viewable ad Unviewable ad Conversion Web activity Sarah Cookie 1 Cookie 2 Cookie 3 Bob Cookie 1 Cookie 2 Analysis window Cookie 3
  • 14. CODE@MIT – OCTOBER 16 2015 User fragmentation and study period! •  In reality, our users are defined by cookies. –  However, people do not just have one cookie! •  Some methods we use to account for this –  We define an effect period of 1 week •  Seasonality has a major impact •  Users are selected through iterative simulation and research •  Incremental causal estimates level off after a single week
  • 15. CODE@MIT – OCTOBER 16 2015 Validation! •  How do we know our causal models give reasonable estimates? •  Use an array of negative control tests –  Use the impressions of one campaign to predict an unrelated conversion W User features A Served ads Y Convert W’ Residual Confounders Y Unrelated Event - V Viewable ad
  • 16. CODE@MIT – OCTOBER 16 2015 Running at scale! •  Converting our data into something analyzable is a challenge … Raw daily logs Billions of events HDFS scalable cluster storage Hadoop People browse the web. Advertising events turn into billions of daily events. Raw data is moved to scalable storage optimized for our experimental setup. Users are subsampled and negative controls are chosen in parallel. Reports are run in parallel using stripped down R libraries. Iterative process of simulation and research.
  • 17. CODE@MIT – OCTOBER 16 2015 Summary! •  Mediators and natural experiments may already exist in your data •  Running a natural experiment at scale is not straight forward, because 1.  The longitudinal nature of the data 2.  Users can become highly fragmented 3.  No predetermined start and end dates 4.  Validation of causal models 5.  Billions of events and terabytes of raw data •  Equal parts engineering and modeling •  We explored online advertising, but this setup can apply to a wide variety of industries
  • 18. CODE@MIT – OCTOBER 16 2015 Thanks! Robert Moakler – (rmoakler@stern.nyu.edu) Ekaterina Eliseeva – (keliseeva@integralads.com) Kiril Tsemekhman – (kiril@integralads.com) Grab this deck @ bit.ly/natural-experiments-at-scale
  • 19. CODE@MIT – OCTOBER 16 2015 Acknowledgments! Integral Ad Science Ekaterina Eliseeva Kiril Tsemekhman Ana Calabrese Gijs Joost Brouwer Sergei Izrailev NYU Stern Foster Provost Amazon, Inc. Daniel Hill
  • 20. CODE@MIT – OCTOBER 16 2015 References! Chan, D., Ge, R., Gershony, O., Hesterberg, T., & Lambert, D. (2010, July). Evaluating online ad campaigns in a pipeline: causal models at scale. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 7-16). ACM. Dalessandro, B., Perlich, C., Stitelman, O., & Provost, F. (2012, August). Causally motivated attribution for online advertising. In Proceedings of the Sixth International Workshop on Data Mining for Online Advertising and Internet Economy (p. 7). ACM. Hill, D. N., Moakler, R., Hubbard, A. E., Tsemekhman, V., Provost, F., & Tsemekhman, K. (2015, August). Measuring Causal Impact of Online Actions via Natural Experiments: Application to Display Advertising. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1839-1847). ACM. Johnson, G. A., Lewis, R. A., Nubbemeyer, E. I. (2015, October). Ghost Ads: Improving the Economics of Measuring Ad Effectiveness. Available on SSRN: ssrn.com/abstract=2620078 Klein, J. P., & Moeschberger, M. L. (2003). Survival analysis: techniques for censored and truncated data. Springer Science & Business Media. Pearl, J. (2009). Causality. Cambridge university press.