SlideShare a Scribd company logo
How is Bayesian Statistics 
Different? 
by Wayne Tai Lee
Goal 
● Clarify the difference between “classical and 
Bayesian Statistics 
● Lay out the pro/con with this “attitude”
One sentence definition 
Bayesian statistics is a mathematical 
framework to update beliefs as you observe 
more data.
Bayesian Update in Movies 
Recall movies where a female character 
realizes her period is late?
Movie cliché: Am I pregnant? 
● What did I do in the past month?
Movie cliché: Am I pregnant? 
● What did I do in the past month? 
– Forms a prior belief of whether I am pregnant
Movie cliché: Am I pregnant? 
● What did I do in the past month? 
– Forms a prior belief of whether I am pregnant 
● The missing period 
– Data!
Movie cliché: Am I pregnant? 
● What did I do in the past month? 
– Forms a prior belief of whether I am pregnant 
● The missing period 
– Data! 
● Belief is updated as more data is observed!
Bayesian terminology 
● Prior: your belief about pregnancy before 
seeing new data 
● Data: missing period 
● Posterior: your belief that is updated after 
seeing the data
How do we formalize this update? 
● Pregnant is a uncertain event with two 
outcomes: Yes or No
How do we formalize this update? 
● Pregnant is a uncertain event with two 
outcomes: Yes or No 
● “Days delayed of period” is a data point 
– If (Pregnant = Yes), delayed ~ 30*9 days 
– If (Pregnant = No), it might come sooner
Mathematical framework 
● “Pregnant” is a random variable: 
– P(Pregnant = Yes) = X 
– P(Pregnant = No) = (1 - X)
Mathematical framework 
● “Pregnant” is a random variable: 
– P(Pregnant = Yes) = X 
– P(Pregnant = No) = (1 - X) 
● “Days delayed of period” is another random 
variable! 
– P(days delay >= 7 days | Pregnant) = 1 
– P(days delay >= 7 days | Not Pregnant) = Y
Simplify 
● Start with the objective: 
Am I pregnant? 
i.e. P(Pregnant | Data)?
Simplify 
● Start with the objective: 
Am I pregnant? 
i.e. P(Pregnant | Data)? 
● Note all the numbers we know are the form of 
P( **** | Pregnant)
Conditional Probability! 
P(Pregnant | Data) 
= P(Data | Pregnant) P(Pregnant) / P(Data)
Conditional Probability! 
P(Pregnant | Data) 
= P(Data | Pregnant) P(Pregnant) / P(Data) 
Immediate implication: 
● If your prior says you cannot be pregnant, 
your belief cannot be changed!
“Bayes Rule” 
P(Pregnant | Data) 
= P(Data | Pregnant) P(Pregnant) / P(Data) 
= P(Data | Pregnant) P(Pregnant) / 
[ P(Data | Pregnant) P(Pregnant) + 
P(Data | Not Pregnant) P(Not Pregnant) ]
“Bayes Rule” 
P(Pregnant | Data) 
= P(Data | Pregnant) P(Pregnant) / P(Data) 
= P(Data | Pregnant) P(Pregnant) / 
[ P(Data | Pregnant) P(Pregnant) + 
P(Data | Not Pregnant) P(Not Pregnant) ] 
Why add more numbers? 
P(Data) was hard to compute, so chop it into 
pieces we know!
P(Data): Big Issue for Bayesians 
● Pregnant is binary which made this realllllly 
easy 
● In general, a lot of “tricks” are trying to 
– solve for P(Data) 
● Belief propagation in graphical models 
– getting around it 
● Sampling: MCMC 
● Approximation: Variational Bayes
Back to the key question: 
P(Pregnant | Data) 
= P(Data | Pregnant) P(Pregnant) / 
[ P(Data | Pregnant) P(Pregnant) + 
P(Data | Not Pregnant) P(Not Pregnant) ] 
= 1 * X / [ 1 * X + Y * (1 - X) ]
Back to the key question:
Can add more data 
….....almost for free! 
● Notice “Data” is quite general: 
– Can add pregnancy strips data to further 
update beliefs! 
– Treat previous outputs as priors then update 
similarly!
So.....what's the big deal? 
● Your belief matters a lot! 
– Your prior changes the outcome 
● Your prior and my prior may be different
What “could” a bad Frequentist 
Do? 
● Calculate the p-value for you, i.e. 
P(Late period | Not Pregnant) 
● Declare that you're Pregnant if this is <= 5%
What “could” a bad Frequentist 
Do? 
● Calculate the p-value for you, i.e. 
P(Late period | Not Pregnant) 
● Declare that you're Pregnant if this is <= 5% 
● Declaration has 5% false positive and a 
certain false negative rates
What “could” a bad Frequentist 
Do? 
● Calculate the p-value for you, i.e. 
P(Late period | Not Pregnant) 
● Declare that you're Pregnant if this is <= 5% 
● Declaration has 5% false positive and a 
certain false negative rates 
● Issue: Not as relevant to you! Rates are for all 
the people using this procedure...not specific 
to your case!
“not as relevant”? 
● There's no consideration of your specific case 
– There was no P(Pregnant) in the p-value 
calculation 
– You could be really sure that you're not 
pregnant....doesn't change the calculation!
What would a Frequentist say? 
● P(Pregnant) = 100% or 0% 
– Fixed but unknown 
– NOT uncertain 
● …Not actually interested in a single event 
– Probabilities are defined for repeated events 
– Will not write down P(Pregnant | Data) 
– For your one case, anything could be true
What would a Frequentist say? 
● P(Pregnant) = 100% or 0% 
– Fixed but unknown 
– NOT uncertain 
● …Not actually interested in a single event 
– Probabilities are defined for repeated events 
– Will not write down P(Pregnant | Data) 
– For your one case, anything could be true 
● Would say “Go talk to a doctor”
Key difference 
● “Attitude” 
– What can be a random variable? 
● Bayesian: Uncertain events 
● Frequentist: Repeatable events
Implications of this attitude 
● Bayesian: 
– Can incorporate prior knowledge easily 
– Can update beliefs easily 
– Can tackle a wider class of problems since 
probabilities are “beliefs”
Implications of this attitude 
● Bayesian: 
– Can incorporate prior knowledge easily 
– Can update beliefs easily 
– Can tackle a wider class of problems since 
probabilities are “beliefs” 
– Must specify a model 
– Your belief can be different from mine 
● Our answers will be different!
Implications of this attitude 
● Frequentist: 
– Probabilities are more objective 
– Harder to cheat 
– Has non-parametric methods
Implications of this attitude 
● Frequentist: 
– Probabilities are more objective 
– Harder to cheat 
– Has non-parametric methods 
– Focused on repeatable events 
– Prior knowledge is introduced in an ad hoc 
format 
– Usually need lots of data
In the end... 
● Frequentist and Bayesian use the same rules 
of probabilities 
● Difference exists in set-up: “What is random?” 
– Bayesians: uncertainty in knowledge 
– Frequentist: intrinsic randomness
Take Home 
● Different problems should use different 
approaches! 
– Both schools are awesome!~ 
● Be aware of what you're using and be 
consistent!

More Related Content

PPTX
Bayesian statistics
PPTX
Bayesian statistics
PDF
Introduction to Bayesian Methods
PPTX
Multinomial Logistic Regression Analysis
PPTX
An introduction to bayesian statistics
PDF
Bayesian intro
PPTX
Bayesian statistics
PPTX
Network meta-analysis & models for inconsistency
Bayesian statistics
Bayesian statistics
Introduction to Bayesian Methods
Multinomial Logistic Regression Analysis
An introduction to bayesian statistics
Bayesian intro
Bayesian statistics
Network meta-analysis & models for inconsistency

What's hot (20)

PPT
Logistic regression
PPTX
Correlation and Regression
ODP
Introduction to Bayesian Statistics
ODP
Multiple Linear Regression II and ANOVA I
PDF
Bayesian inference
PPTX
Lect4 principal component analysis-I
PDF
Multinomial Logistic Regression
PPTX
Probability and Samples: The Distribution of Sample Means
PPTX
Introduction to Maximum Likelihood Estimator
PPT
Metanalysis Lecture
PPTX
Tutorial repeated measures ANOVA
PPT
The Method Of Maximum Likelihood
PDF
Bayesian networks
PPT
bio statistics for clinical research
PPTX
Introduction to principal component analysis (pca)
PDF
Ordinal logistic regression
PDF
Uncertainty Quantification in AI
PPTX
Lecture 6. univariate and bivariate analysis
PPTX
Lect5 principal component analysis
PPTX
Statistical distributions
Logistic regression
Correlation and Regression
Introduction to Bayesian Statistics
Multiple Linear Regression II and ANOVA I
Bayesian inference
Lect4 principal component analysis-I
Multinomial Logistic Regression
Probability and Samples: The Distribution of Sample Means
Introduction to Maximum Likelihood Estimator
Metanalysis Lecture
Tutorial repeated measures ANOVA
The Method Of Maximum Likelihood
Bayesian networks
bio statistics for clinical research
Introduction to principal component analysis (pca)
Ordinal logistic regression
Uncertainty Quantification in AI
Lecture 6. univariate and bivariate analysis
Lect5 principal component analysis
Statistical distributions
Ad

Viewers also liked (18)

PDF
Bayesian statistical concepts
PPT
Bayesian statistics using r intro
PDF
An introduction to Bayesian Statistics using Python
PPTX
Bayes theorem explained
PDF
Mardi 15h05 stat et securite routiere_saint_pierre
PDF
ma12012id536
PPTX
Crash Course in A/B testing
PPTX
Feature selection can hurt model inference
PPTX
Chapter 3 maximum likelihood and bayesian estimation-fix
PPT
3 recursive bayesian estimation
PDF
Understanding your data with Bayesian networks (in Python) by Bartek Wilczyns...
PPTX
Contingency approach
PDF
陳宜欣/大數據下的情緒分析
PDF
Logistic Regression Analysis
PDF
Tout sur les p values
PDF
Evaluation du risque routier 2013
PPTX
Slideshare ppt
Bayesian statistical concepts
Bayesian statistics using r intro
An introduction to Bayesian Statistics using Python
Bayes theorem explained
Mardi 15h05 stat et securite routiere_saint_pierre
ma12012id536
Crash Course in A/B testing
Feature selection can hurt model inference
Chapter 3 maximum likelihood and bayesian estimation-fix
3 recursive bayesian estimation
Understanding your data with Bayesian networks (in Python) by Bartek Wilczyns...
Contingency approach
陳宜欣/大數據下的情緒分析
Logistic Regression Analysis
Tout sur les p values
Evaluation du risque routier 2013
Slideshare ppt
Ad

Similar to What is bayesian statistics and how is it different? (20)

PDF
Bayesian inference and the pest of premature interpretation.
PPT
Introduction to Bayesian Statistics.ppt
PPT
PDF
Causal Inference for Everyone
PDF
Bayesian data analysis1
PDF
1a_Bayes_rule.pdf
PDF
2013.03.26 An Introduction to Modern Statistical Analysis using Bayesian Methods
PDF
2013.03.26 Bayesian Methods for Modern Statistical Analysis
PDF
Slide2 bme adat_2016
PPTX
Bayes Rules _ Bayes' theorem _ Bayes.pptx
PPTX
presentation about Bayes Rules with examples.pptx
PDF
Slide3 bme adat_2015
PPT
Bayesian decision making in clinical research
PDF
Crisis of confidence, p-hacking and the future of psychology
PPTX
2.statistical DEcision makig.pptx
PDF
presentation4.pdf Intro to mcmc methodss
PDF
Bayesian Statistics.pdf
PDF
Research Methodology Module-05
PPTX
Lec13_Bayes.pptx
PDF
An introduction to Bayesian Statistics and its application
Bayesian inference and the pest of premature interpretation.
Introduction to Bayesian Statistics.ppt
Causal Inference for Everyone
Bayesian data analysis1
1a_Bayes_rule.pdf
2013.03.26 An Introduction to Modern Statistical Analysis using Bayesian Methods
2013.03.26 Bayesian Methods for Modern Statistical Analysis
Slide2 bme adat_2016
Bayes Rules _ Bayes' theorem _ Bayes.pptx
presentation about Bayes Rules with examples.pptx
Slide3 bme adat_2015
Bayesian decision making in clinical research
Crisis of confidence, p-hacking and the future of psychology
2.statistical DEcision makig.pptx
presentation4.pdf Intro to mcmc methodss
Bayesian Statistics.pdf
Research Methodology Module-05
Lec13_Bayes.pptx
An introduction to Bayesian Statistics and its application

Recently uploaded (20)

PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Sports Quiz easy sports quiz sports quiz
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Lesson notes of climatology university.
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
01-Introduction-to-Information-Management.pdf
PPTX
master seminar digital applications in india
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Sports Quiz easy sports quiz sports quiz
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Cell Types and Its function , kingdom of life
Microbial disease of the cardiovascular and lymphatic systems
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPH.pptx obstetrics and gynecology in nursing
Lesson notes of climatology university.
Pharmacology of Heart Failure /Pharmacotherapy of CHF
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Microbial diseases, their pathogenesis and prophylaxis
Final Presentation General Medicine 03-08-2024.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
Basic Mud Logging Guide for educational purpose
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Anesthesia in Laparoscopic Surgery in India
01-Introduction-to-Information-Management.pdf
master seminar digital applications in india

What is bayesian statistics and how is it different?

  • 1. How is Bayesian Statistics Different? by Wayne Tai Lee
  • 2. Goal ● Clarify the difference between “classical and Bayesian Statistics ● Lay out the pro/con with this “attitude”
  • 3. One sentence definition Bayesian statistics is a mathematical framework to update beliefs as you observe more data.
  • 4. Bayesian Update in Movies Recall movies where a female character realizes her period is late?
  • 5. Movie cliché: Am I pregnant? ● What did I do in the past month?
  • 6. Movie cliché: Am I pregnant? ● What did I do in the past month? – Forms a prior belief of whether I am pregnant
  • 7. Movie cliché: Am I pregnant? ● What did I do in the past month? – Forms a prior belief of whether I am pregnant ● The missing period – Data!
  • 8. Movie cliché: Am I pregnant? ● What did I do in the past month? – Forms a prior belief of whether I am pregnant ● The missing period – Data! ● Belief is updated as more data is observed!
  • 9. Bayesian terminology ● Prior: your belief about pregnancy before seeing new data ● Data: missing period ● Posterior: your belief that is updated after seeing the data
  • 10. How do we formalize this update? ● Pregnant is a uncertain event with two outcomes: Yes or No
  • 11. How do we formalize this update? ● Pregnant is a uncertain event with two outcomes: Yes or No ● “Days delayed of period” is a data point – If (Pregnant = Yes), delayed ~ 30*9 days – If (Pregnant = No), it might come sooner
  • 12. Mathematical framework ● “Pregnant” is a random variable: – P(Pregnant = Yes) = X – P(Pregnant = No) = (1 - X)
  • 13. Mathematical framework ● “Pregnant” is a random variable: – P(Pregnant = Yes) = X – P(Pregnant = No) = (1 - X) ● “Days delayed of period” is another random variable! – P(days delay >= 7 days | Pregnant) = 1 – P(days delay >= 7 days | Not Pregnant) = Y
  • 14. Simplify ● Start with the objective: Am I pregnant? i.e. P(Pregnant | Data)?
  • 15. Simplify ● Start with the objective: Am I pregnant? i.e. P(Pregnant | Data)? ● Note all the numbers we know are the form of P( **** | Pregnant)
  • 16. Conditional Probability! P(Pregnant | Data) = P(Data | Pregnant) P(Pregnant) / P(Data)
  • 17. Conditional Probability! P(Pregnant | Data) = P(Data | Pregnant) P(Pregnant) / P(Data) Immediate implication: ● If your prior says you cannot be pregnant, your belief cannot be changed!
  • 18. “Bayes Rule” P(Pregnant | Data) = P(Data | Pregnant) P(Pregnant) / P(Data) = P(Data | Pregnant) P(Pregnant) / [ P(Data | Pregnant) P(Pregnant) + P(Data | Not Pregnant) P(Not Pregnant) ]
  • 19. “Bayes Rule” P(Pregnant | Data) = P(Data | Pregnant) P(Pregnant) / P(Data) = P(Data | Pregnant) P(Pregnant) / [ P(Data | Pregnant) P(Pregnant) + P(Data | Not Pregnant) P(Not Pregnant) ] Why add more numbers? P(Data) was hard to compute, so chop it into pieces we know!
  • 20. P(Data): Big Issue for Bayesians ● Pregnant is binary which made this realllllly easy ● In general, a lot of “tricks” are trying to – solve for P(Data) ● Belief propagation in graphical models – getting around it ● Sampling: MCMC ● Approximation: Variational Bayes
  • 21. Back to the key question: P(Pregnant | Data) = P(Data | Pregnant) P(Pregnant) / [ P(Data | Pregnant) P(Pregnant) + P(Data | Not Pregnant) P(Not Pregnant) ] = 1 * X / [ 1 * X + Y * (1 - X) ]
  • 22. Back to the key question:
  • 23. Can add more data ….....almost for free! ● Notice “Data” is quite general: – Can add pregnancy strips data to further update beliefs! – Treat previous outputs as priors then update similarly!
  • 24. So.....what's the big deal? ● Your belief matters a lot! – Your prior changes the outcome ● Your prior and my prior may be different
  • 25. What “could” a bad Frequentist Do? ● Calculate the p-value for you, i.e. P(Late period | Not Pregnant) ● Declare that you're Pregnant if this is <= 5%
  • 26. What “could” a bad Frequentist Do? ● Calculate the p-value for you, i.e. P(Late period | Not Pregnant) ● Declare that you're Pregnant if this is <= 5% ● Declaration has 5% false positive and a certain false negative rates
  • 27. What “could” a bad Frequentist Do? ● Calculate the p-value for you, i.e. P(Late period | Not Pregnant) ● Declare that you're Pregnant if this is <= 5% ● Declaration has 5% false positive and a certain false negative rates ● Issue: Not as relevant to you! Rates are for all the people using this procedure...not specific to your case!
  • 28. “not as relevant”? ● There's no consideration of your specific case – There was no P(Pregnant) in the p-value calculation – You could be really sure that you're not pregnant....doesn't change the calculation!
  • 29. What would a Frequentist say? ● P(Pregnant) = 100% or 0% – Fixed but unknown – NOT uncertain ● …Not actually interested in a single event – Probabilities are defined for repeated events – Will not write down P(Pregnant | Data) – For your one case, anything could be true
  • 30. What would a Frequentist say? ● P(Pregnant) = 100% or 0% – Fixed but unknown – NOT uncertain ● …Not actually interested in a single event – Probabilities are defined for repeated events – Will not write down P(Pregnant | Data) – For your one case, anything could be true ● Would say “Go talk to a doctor”
  • 31. Key difference ● “Attitude” – What can be a random variable? ● Bayesian: Uncertain events ● Frequentist: Repeatable events
  • 32. Implications of this attitude ● Bayesian: – Can incorporate prior knowledge easily – Can update beliefs easily – Can tackle a wider class of problems since probabilities are “beliefs”
  • 33. Implications of this attitude ● Bayesian: – Can incorporate prior knowledge easily – Can update beliefs easily – Can tackle a wider class of problems since probabilities are “beliefs” – Must specify a model – Your belief can be different from mine ● Our answers will be different!
  • 34. Implications of this attitude ● Frequentist: – Probabilities are more objective – Harder to cheat – Has non-parametric methods
  • 35. Implications of this attitude ● Frequentist: – Probabilities are more objective – Harder to cheat – Has non-parametric methods – Focused on repeatable events – Prior knowledge is introduced in an ad hoc format – Usually need lots of data
  • 36. In the end... ● Frequentist and Bayesian use the same rules of probabilities ● Difference exists in set-up: “What is random?” – Bayesians: uncertainty in knowledge – Frequentist: intrinsic randomness
  • 37. Take Home ● Different problems should use different approaches! – Both schools are awesome!~ ● Be aware of what you're using and be consistent!