SlideShare a Scribd company logo
A/B Testing: from key concepts to
advanced techniques
By Anatoly Vuets
Head of Analytics Letyshops
Agenda
A/B test as part of statistical learning
Basic ideas: H0, significance, power, etc.
Common mistakes and misunderstandings
Typical problems and solutions
Bayesian approach to A/B testing
Questions and discussion
A/B testing as part of statistical learning
Prediction Inference
Question What would be our best guess for a new data
point yn+1
?
Question about the population
Y: y1..n
∊ Y stated in the form of binary hypothesis
Answer Estimated value for yn+1
Accepting or rejecting the Null Hypothesis H0
Known answers Observable Unobservable
Way to find the
answer
Model which is trained to minimize chosen loss
function
Binary decision based on comparing test statistics
value and a threshold. The threshold is chosen to
meet desired error tolerance
A/B testing as part of statistical learning
Inference
Basic ideas: H0/H1, power, significance level etc.
Basic ideas: H0/H1, power, significance level etc.
Basic ideas: H0/H1, power, significance level etc.
H0 H1
H0 1 - sign.level
Error T1
sign.level
H1
Error T2
1 - power
power
Test T(s)
Truth
Experiment design
Experiment design
Common mistakes and misunderstandings
1 Starting A-B test without design 2 Early stopping
3 P-value interpretation P.value is not a “probability that B is better then A”
How to interpret results in a ‘simpler way’
Prior beliefs: Likelihoods: Posteriors:
How to combine results from different experiments
What is the probability that the repeated experiment wont
reproduce results?
About 72%! Can be increased
to about 90%
Sample size reduction
Stratification CUPED
Bayesian testing
1. Set you prior beliefs about test statistics as a prior distribution
2. Collect some data
3. Get posterior distribution of the test statistics using Bayes’ theorem
4. Compute expected error
5. Repeat steps 3-5 till expected error drops below a threshold
Bayesian approach to the
inference problem
Different approaches: Bayesian testing
Different approaches: Bayesian testing
Summary and questions
Frequentist framework for A/B testing is more mature overall as it
provides control of errors and well-defined stopping rule.
Bayesian framework helps a lot to alleviate practical issues which
occur when using frequentist A/B testing

More Related Content

PDF
Statistical hypothesis testing in e commerce
PPTX
The Finishing Line
DOCX
Assignment 1 applications of the scientific method
PPTX
Test framing(sqa)
PPTX
Environmental Testing
PDF
Creating an in-house computerized adaptive testing (CAT) program with Concerto
PPTX
Computer Adaptive Test (cat)
PPTX
Introduction to Computerized Adaptive Testing (CAT)
Statistical hypothesis testing in e commerce
The Finishing Line
Assignment 1 applications of the scientific method
Test framing(sqa)
Environmental Testing
Creating an in-house computerized adaptive testing (CAT) program with Concerto
Computer Adaptive Test (cat)
Introduction to Computerized Adaptive Testing (CAT)

What's hot (20)

PPTX
ALT Approaches for Reliability
PPTX
The Nature Of Patterns
PPTX
Expanding our Testing Horizons
PPTX
What makes a good adaptive testing program
PPTX
Computer adaptive testing
PPT
Science Fair Project
PPT
Science Fair Project
PPTX
SBAC What is a CAT
PPT
The Scientific Method
PPT
Software testing one
PDF
Supervised and Unsupervised Machine Learning
PPTX
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...
PPTX
2014 what-is-a-cat-narrated
PPTX
Science Fair Project Presentation
 
PPTX
Progression by Regression: How to increase your A/B Test Velocity
PDF
Bad Metric, Bad!
PPTX
Quantitative Method Assignment Help
PPT
Blackboxtesting 02 An Example Test Series
PDF
Unsupervised Machine Learning Ml And How It Works
PPT
What is the scientific method?
ALT Approaches for Reliability
The Nature Of Patterns
Expanding our Testing Horizons
What makes a good adaptive testing program
Computer adaptive testing
Science Fair Project
Science Fair Project
SBAC What is a CAT
The Scientific Method
Software testing one
Supervised and Unsupervised Machine Learning
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...
2014 what-is-a-cat-narrated
Science Fair Project Presentation
 
Progression by Regression: How to increase your A/B Test Velocity
Bad Metric, Bad!
Quantitative Method Assignment Help
Blackboxtesting 02 An Example Test Series
Unsupervised Machine Learning Ml And How It Works
What is the scientific method?
Ad

Similar to A/B testing from basic concepts to advanced techniques (20)

PDF
Andrii Belas: A/B testing overview: use-cases, theory and tools
PDF
Columbus Data & Analytics Wednesdays - June 2024
PDF
Chris Stuccio - Data science - Conversion Hotel 2015
PDF
Principles Before Practices: Transform Your Testing by Understanding Key Conc...
PPTX
Crash Course in A/B testing
PDF
Data Insights Talk
 
PPTX
Test for business growth - analyzing A/B-test with a Bayesian approach
PPTX
Test for business growth
PPTX
10 Guidelines for A/B Testing
PDF
Optimizely Workshop: Take Action on Results with Statistics
PDF
SurveyDesignTutorial_Session3.pdf
PDF
Conversion Conference Berlin
PPTX
Ab test
PPTX
How can A/B testing go wrong?
PDF
A/B Testing and Experimentation in Data Science
PPTX
ABTest-20231020.pptx
PDF
When in doubt, go live
PDF
Faster and cheaper, smart ab experiments - public ver.
PDF
Statistics for CRO - Conversion Conference London
PPTX
A/B testing problems
Andrii Belas: A/B testing overview: use-cases, theory and tools
Columbus Data & Analytics Wednesdays - June 2024
Chris Stuccio - Data science - Conversion Hotel 2015
Principles Before Practices: Transform Your Testing by Understanding Key Conc...
Crash Course in A/B testing
Data Insights Talk
 
Test for business growth - analyzing A/B-test with a Bayesian approach
Test for business growth
10 Guidelines for A/B Testing
Optimizely Workshop: Take Action on Results with Statistics
SurveyDesignTutorial_Session3.pdf
Conversion Conference Berlin
Ab test
How can A/B testing go wrong?
A/B Testing and Experimentation in Data Science
ABTest-20231020.pptx
When in doubt, go live
Faster and cheaper, smart ab experiments - public ver.
Statistics for CRO - Conversion Conference London
A/B testing problems
Ad

Recently uploaded (20)

PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Introduction to the R Programming Language
PPTX
Computer network topology notes for revision
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Introduction to machine learning and Linear Models
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Database Infoormation System (DBIS).pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Clinical guidelines as a resource for EBP(1).pdf
Galatica Smart Energy Infrastructure Startup Pitch Deck
Fluorescence-microscope_Botany_detailed content
Introduction to Knowledge Engineering Part 1
Introduction to the R Programming Language
Computer network topology notes for revision
Reliability_Chapter_ presentation 1221.5784
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Data_Analytics_and_PowerBI_Presentation.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
ISS -ESG Data flows What is ESG and HowHow
Introduction to machine learning and Linear Models
Supervised vs unsupervised machine learning algorithms
STUDY DESIGN details- Lt Col Maksud (21).pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Database Infoormation System (DBIS).pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
Clinical guidelines as a resource for EBP(1).pdf

A/B testing from basic concepts to advanced techniques

  • 1. A/B Testing: from key concepts to advanced techniques By Anatoly Vuets Head of Analytics Letyshops
  • 2. Agenda A/B test as part of statistical learning Basic ideas: H0, significance, power, etc. Common mistakes and misunderstandings Typical problems and solutions Bayesian approach to A/B testing Questions and discussion
  • 3. A/B testing as part of statistical learning Prediction Inference Question What would be our best guess for a new data point yn+1 ? Question about the population Y: y1..n ∊ Y stated in the form of binary hypothesis Answer Estimated value for yn+1 Accepting or rejecting the Null Hypothesis H0 Known answers Observable Unobservable Way to find the answer Model which is trained to minimize chosen loss function Binary decision based on comparing test statistics value and a threshold. The threshold is chosen to meet desired error tolerance
  • 4. A/B testing as part of statistical learning Inference
  • 5. Basic ideas: H0/H1, power, significance level etc.
  • 6. Basic ideas: H0/H1, power, significance level etc.
  • 7. Basic ideas: H0/H1, power, significance level etc. H0 H1 H0 1 - sign.level Error T1 sign.level H1 Error T2 1 - power power Test T(s) Truth
  • 10. Common mistakes and misunderstandings 1 Starting A-B test without design 2 Early stopping 3 P-value interpretation P.value is not a “probability that B is better then A”
  • 11. How to interpret results in a ‘simpler way’ Prior beliefs: Likelihoods: Posteriors:
  • 12. How to combine results from different experiments
  • 13. What is the probability that the repeated experiment wont reproduce results? About 72%! Can be increased to about 90%
  • 15. Bayesian testing 1. Set you prior beliefs about test statistics as a prior distribution 2. Collect some data 3. Get posterior distribution of the test statistics using Bayes’ theorem 4. Compute expected error 5. Repeat steps 3-5 till expected error drops below a threshold Bayesian approach to the inference problem
  • 18. Summary and questions Frequentist framework for A/B testing is more mature overall as it provides control of errors and well-defined stopping rule. Bayesian framework helps a lot to alleviate practical issues which occur when using frequentist A/B testing