SlideShare a Scribd company logo
Consistent Transformation of Ratio Metrics for Efficient Online Controlled Experiments
Consistent Transformation of Ratio Metrics
for Efficient Online Controlled Experiments
Roman Budylin, Alexey Drutsa,
Ilya Katsev, Valeriya Tsoy
About Yandex
• The largest internet company in Europe
• ”Google of Russia”, “Amazon of Russia”, “Uber of Russia” and so on
• More than 50M DAU
A/B testing methodology
Group
A
Group
B
Split
them
randomly
trafficofusers
Group
A
Group
B
Split
them
randomly
trafficofusers
Variant for A
Variant for B
Expose to one of
two variants of the service
e.g., the current production version
e.g., an evaluated update
Group
A
Group
B
Split
them
randomly
trafficofusers
Variant for A
Variant for B
Expose to one of
two variants of the service
e.g., the current production version
e.g., an evaluated update
Calculate a key
measure for each user
X(uA1)
…
Calculate the OEC for each
group as the mean value
e.g., X(u) is the number
of sessions of the user u
X(uA2)
X(uA3)
X(uA4)
X(uA5)
X(uB1)
…
X(uB2)
X(uB3)
X(uB4)
X(uB5)
µA(X)=avgu in AX(u)
µB(X)=avgu in BX(u)
Overall Evaluation Criterion
(OEC) for the group B
Overall Evaluation Criterion
(OEC) for the group A
µA(X)=avgu in AX(u)
µB(X)=avgu in BX(u)
Overall Evaluation Criterion
(OEC) for the group B
Overall Evaluation Criterion
(OEC) for the group A
Calculate the OEC for each
group as the mean value
Δ(x) VS 0
Δ(X) = µB(X) – µA(X)
the evaluated update is
positive or negative
Statistical
significance test
the difference is caused by
a noise or
the treatment effect
(e.g., Student’s t-test)
Overall Evaluation
Criterion (OEC)
[Kohavi et al., DMKD’2009]
Overall Acceptance
Criterion (OAC)
[Drutsa et al., CIKM’2015]
Sensitivity
Directionality
OEC levels
1. User level metrics
2. Non-user level metric
For example ratio OEC:
Example – average length of session
Example – average length of session
Example – average length of session
> 30m
Example – average length of session
Example – average length of session
Let user u has sessions with lengths
Then the average of lengths of all sessions is equal to
Problem
For ratio OEC
t-test
sensitivity improvement techniques
Problem
For ratio OEC
t-test
sensitivity improvement techniques
Problem
For ratio OEC
t-test
sensitivity improvement techniques
Existing approaches
• Bootstrap test
Problem: computational expensiveness
• Delta method
Problem: does not allow to apply directly regression adjustment
techniques
Linearization transformation
Our paper, WSDM 2018
21
Consistent Transformation of Ratio Metrics for Efficient Online
Controlled Experiments
Roman Budylin, Alexey Drutsa, Ilya Katsev, Valeriya Tsoy
Data Quality metrics • OEC metrics • Guard rail metrics • Local feature/Diagnosticmetrics
Our paper, WSDM 2018
22
Consistent Transformation of Ratio Metrics for Efficient Online
Controlled Experiments
Roman Budylin, Alexey Drutsa, Ilya Katsev, Valeriya Tsoy
The best comment: «This is too good to be true!» (someone from
Facebook)
Data Quality metrics • OEC metrics • Guard rail metrics • Local feature/Diagnosticmetrics
We found a transformation such that
Our contribution
We found a transformation such that
Ratio OEC User level OEC
Our contribution
We found a transformation such that
Ratio OEC User level OEC
NB: Preserve directionality and significance level!
Our contribution
Let we have a ratio OEC:
Consider the next expression:
And let us use its average
as a metric:
Now we got a linearization OEC:
Linearization
Let A and B be the control and the experiment. Let us denote
Theorem 1: Let be positive. Then for any
the next is true:
Theorems: directionality
Theorem 2: Let be positive and . Let
be the t-statistic applied for the OEC and let be the be the asymt.
standard normal statistic of obtained via the Delta method.
If then then under the null hypothesis that
1. the t-statistics is asymptotically normal
2. converges to 1 by probability
Theorems: significance level
Approaches comparision
Experiments
• Internet search, 2013-2016, 390 experiments
• The transformation + regression adjustment
• +34% of sensitivity
Experiments
The code

More Related Content

PPTX
Decoding the Chinese Internet
PDF
Massive Simulations In Spark: Distributed Monte Carlo For Global Health Forec...
PPTX
Activity Ranking in LinkedIn Feed
PDF
The Hidden Value of Hadoop Migration
PPTX
Drive Successful ESG Initiatives with Trusted Data
PDF
2017 china internet report bcg
PDF
Data Quality Strategies
PDF
Data Monetization
Decoding the Chinese Internet
Massive Simulations In Spark: Distributed Monte Carlo For Global Health Forec...
Activity Ranking in LinkedIn Feed
The Hidden Value of Hadoop Migration
Drive Successful ESG Initiatives with Trusted Data
2017 china internet report bcg
Data Quality Strategies
Data Monetization

What's hot (20)

PDF
COVID-19 Auto & Mobility Consumer Insights
PPTX
Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PPTX
Power BI Create lightning fast dashboard with power bi & Its Components
PDF
BCG-Executive-Perspectives-CEOs-Dilemma-Supply-Chain-Resilience.pdf
PPTX
The Customer Journey Is a Graph
PDF
Win The Fight: Crush and Contain for Safer Reopening
PDF
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
PPTX
Gatling and Page Object: a way to performance testing
PPTX
Digital Consumers, Emerging Markets, and the $4 Trillion Future
PDF
Data Quality Best Practices
PPTX
Simple and Effective Enterprise Architecture with Tools you Already Own
PDF
FY2021-lenovo-sustainability-report.pdf
PDF
Wie Sie Data Governance Prozesse erfolgreich modellieren und optimieren
PPT
Data Quality Rules introduction
PPT
The data quality challenge
PDF
Big Data Architecture Workshop - Vahid Amiri
PDF
[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data
PPTX
Personalizing LinkedIn Feed
PDF
Top 15 Business Intelligence (BI) Software
COVID-19 Auto & Mobility Consumer Insights
Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Power BI Create lightning fast dashboard with power bi & Its Components
BCG-Executive-Perspectives-CEOs-Dilemma-Supply-Chain-Resilience.pdf
The Customer Journey Is a Graph
Win The Fight: Crush and Contain for Safer Reopening
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
Gatling and Page Object: a way to performance testing
Digital Consumers, Emerging Markets, and the $4 Trillion Future
Data Quality Best Practices
Simple and Effective Enterprise Architecture with Tools you Already Own
FY2021-lenovo-sustainability-report.pdf
Wie Sie Data Governance Prozesse erfolgreich modellieren und optimieren
Data Quality Rules introduction
The data quality challenge
Big Data Architecture Workshop - Vahid Amiri
[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data
Personalizing LinkedIn Feed
Top 15 Business Intelligence (BI) Software
Ad

Similar to Consistent Transformation of Ratio Metrics for Efficient Online Controlled Experiments (20)

PDF
Additional Descriptive Statistics methods for Data Analysis / Maxim Neronov (...
PDF
Business statistics homework help
PPTX
Machine Learning Powered A/B Testing
PDF
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
PPTX
Statistical techniques used in measurement
PDF
Bivariate Regression
PDF
Introductory Online Controlled Experiments
PPT
Simple (and Simplistic) Introduction to Econometrics and Linear Regression
PDF
Mth201 COMPLETE BOOK
PDF
TUTORIAL _5_Control charts for variables.pptx.pdf
PPT
Hypothesis Test Selection Guide
PPT
Hypothesis Test Selection Guide
PPTX
SURE Model_Panel data.pptx
PPTX
Conversion Optimization: Drive More Sales from Existing Traffic and Increase ...
PPTX
Control Charts in Lab and Trend Analysis
PPT
A05 Continuous One Variable Stat Tests
PPT
A05 Continuous One Variable Stat Tests
PDF
Joseph Jay Williams - WESST - Bridging Research via MOOClets and Collaborativ...
PPTX
Topic 2b .pptx
PPT
Econometrics chapter three and four finance.ppt
Additional Descriptive Statistics methods for Data Analysis / Maxim Neronov (...
Business statistics homework help
Machine Learning Powered A/B Testing
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Statistical techniques used in measurement
Bivariate Regression
Introductory Online Controlled Experiments
Simple (and Simplistic) Introduction to Econometrics and Linear Regression
Mth201 COMPLETE BOOK
TUTORIAL _5_Control charts for variables.pptx.pdf
Hypothesis Test Selection Guide
Hypothesis Test Selection Guide
SURE Model_Panel data.pptx
Conversion Optimization: Drive More Sales from Existing Traffic and Increase ...
Control Charts in Lab and Trend Analysis
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat Tests
Joseph Jay Williams - WESST - Bridging Research via MOOClets and Collaborativ...
Topic 2b .pptx
Econometrics chapter three and four finance.ppt
Ad

Recently uploaded (20)

PPTX
Machine Learning_overview_presentation.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
1. Introduction to Computer Programming.pptx
PPT
Teaching material agriculture food technology
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Tartificialntelligence_presentation.pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Approach and Philosophy of On baking technology
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Big Data Technologies - Introduction.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Empathic Computing: Creating Shared Understanding
PDF
Assigned Numbers - 2025 - Bluetooth® Document
Machine Learning_overview_presentation.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Machine learning based COVID-19 study performance prediction
1. Introduction to Computer Programming.pptx
Teaching material agriculture food technology
Group 1 Presentation -Planning and Decision Making .pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Tartificialntelligence_presentation.pptx
Getting Started with Data Integration: FME Form 101
Advanced methodologies resolving dimensionality complications for autism neur...
“AI and Expert System Decision Support & Business Intelligence Systems”
Approach and Philosophy of On baking technology
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
20250228 LYD VKU AI Blended-Learning.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Big Data Technologies - Introduction.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
A comparative analysis of optical character recognition models for extracting...
Empathic Computing: Creating Shared Understanding
Assigned Numbers - 2025 - Bluetooth® Document

Consistent Transformation of Ratio Metrics for Efficient Online Controlled Experiments

  • 2. Consistent Transformation of Ratio Metrics for Efficient Online Controlled Experiments Roman Budylin, Alexey Drutsa, Ilya Katsev, Valeriya Tsoy
  • 3. About Yandex • The largest internet company in Europe • ”Google of Russia”, “Amazon of Russia”, “Uber of Russia” and so on • More than 50M DAU
  • 6. Group A Group B Split them randomly trafficofusers Variant for A Variant for B Expose to one of two variants of the service e.g., the current production version e.g., an evaluated update
  • 7. Group A Group B Split them randomly trafficofusers Variant for A Variant for B Expose to one of two variants of the service e.g., the current production version e.g., an evaluated update Calculate a key measure for each user X(uA1) … Calculate the OEC for each group as the mean value e.g., X(u) is the number of sessions of the user u X(uA2) X(uA3) X(uA4) X(uA5) X(uB1) … X(uB2) X(uB3) X(uB4) X(uB5) µA(X)=avgu in AX(u) µB(X)=avgu in BX(u) Overall Evaluation Criterion (OEC) for the group B Overall Evaluation Criterion (OEC) for the group A
  • 8. µA(X)=avgu in AX(u) µB(X)=avgu in BX(u) Overall Evaluation Criterion (OEC) for the group B Overall Evaluation Criterion (OEC) for the group A Calculate the OEC for each group as the mean value Δ(x) VS 0 Δ(X) = µB(X) – µA(X) the evaluated update is positive or negative Statistical significance test the difference is caused by a noise or the treatment effect (e.g., Student’s t-test) Overall Evaluation Criterion (OEC) [Kohavi et al., DMKD’2009] Overall Acceptance Criterion (OAC) [Drutsa et al., CIKM’2015] Sensitivity Directionality
  • 9. OEC levels 1. User level metrics 2. Non-user level metric For example ratio OEC:
  • 10. Example – average length of session
  • 11. Example – average length of session
  • 12. Example – average length of session > 30m
  • 13. Example – average length of session
  • 14. Example – average length of session Let user u has sessions with lengths Then the average of lengths of all sessions is equal to
  • 18. Existing approaches • Bootstrap test Problem: computational expensiveness • Delta method Problem: does not allow to apply directly regression adjustment techniques
  • 20. Our paper, WSDM 2018 21 Consistent Transformation of Ratio Metrics for Efficient Online Controlled Experiments Roman Budylin, Alexey Drutsa, Ilya Katsev, Valeriya Tsoy Data Quality metrics • OEC metrics • Guard rail metrics • Local feature/Diagnosticmetrics
  • 21. Our paper, WSDM 2018 22 Consistent Transformation of Ratio Metrics for Efficient Online Controlled Experiments Roman Budylin, Alexey Drutsa, Ilya Katsev, Valeriya Tsoy The best comment: «This is too good to be true!» (someone from Facebook) Data Quality metrics • OEC metrics • Guard rail metrics • Local feature/Diagnosticmetrics
  • 22. We found a transformation such that Our contribution
  • 23. We found a transformation such that Ratio OEC User level OEC Our contribution
  • 24. We found a transformation such that Ratio OEC User level OEC NB: Preserve directionality and significance level! Our contribution
  • 25. Let we have a ratio OEC: Consider the next expression: And let us use its average as a metric: Now we got a linearization OEC: Linearization
  • 26. Let A and B be the control and the experiment. Let us denote Theorem 1: Let be positive. Then for any the next is true: Theorems: directionality
  • 27. Theorem 2: Let be positive and . Let be the t-statistic applied for the OEC and let be the be the asymt. standard normal statistic of obtained via the Delta method. If then then under the null hypothesis that 1. the t-statistics is asymptotically normal 2. converges to 1 by probability Theorems: significance level
  • 29. Experiments • Internet search, 2013-2016, 390 experiments • The transformation + regression adjustment • +34% of sensitivity