What is probability?
In 2015 there were about 4 million babies born in the US, and 48.8% of the newborns
were girls.
We write: P(newborn is a girl)= 48.8%.
The probability of an event is defined as the proportion of times this event occurs in
many repetitions.
This is the standard definition of probility. It requires that it is possible to repeat this
chance experiment many times.
While interned during the Second World War, John Kerrich tossed a coin 10,000 times
and observed 5,067 tosses resulting in heads.
1 / 2
What is probability?
The long-run interpretation of probability can make it difficult to interpret it for single
event:
‘What I was wrong about this year’ by David Leonhardt (12/24/2017)
Sometimes people use a different interpretation:
‘The probability that my best friend calls today is 30%’.
Such a ‘subjective probability’ is not based on experiments, and different people may
assign different subjective probabilities to the same event.
2 / 2
Four basic rules
Probabilities are always between 0 and 1. Computing probabilities comes down to
applying a few basic rules repeatedly:
We saw that P(newborn is a girl)= 48.8%.
Therefore P(newborn is a boy)= 51.2%.
More formally, write A for an event, such as A=’newborn is a girl’.
Complement rule: P(A does not occur) = 1−P(A)
Now let’s look at a different chance experiment: Rolling a die.
Since each of its six faces is equally likely to come up, each has probability 1/6.
Rule for equally likely outcomes: If there are n possible outcomes and they are equally
likely, then P(A) = number of outcomes in A
n
1 / 3
Four basic rules
The next two rules make it possible to express probabilities of multiple events as those
of the individual events.
A and B are mutually exclusive if they cannot occur at the same time.
As an example, we roll a die twice.
A=  on first roll, B= on first roll, C=  on second roll
A and B are mutually exclusive, but A and C are not.
Addition rule: If A and B are mutually exclusive, then
P(A or B) = P(A) + P(B)
Two events are independent if knowing that one occurs does not change the probability
that the other occurs.
B and C are independent, but A and B are not.
Multiplication rule: If A and B are independent, then
P(A and B) = P(A) P(B)
2 / 3
Four basic rules
Roll a die three times. What is P(at least one ) ?
We could write ‘at least one ’ as follows:
on the first roll or on the second roll or on the third roll.
But these three events are not mutually exclusive, so we can’t use the addition rule.
Better way: Start with the complement rule.
P(at least one in three rolls)
= 1−P(no in three rolls)
= 1−P (no in first roll) and (no in second roll) and (no in third roll)

= 1−P(no in first roll)× P(no in second roll)×P(no in third roll)
= 1 − 5
6 × 5
6 × 5
6
= 41.1%
3 / 3
Conditional probability
Spam e-mail has a higher chance to contain the word ‘money’ than ham e-mail:
P(‘money’ in e-mail | spam)= 8%, P(‘money’ in e-mail | ham)= 1%.
The conditional probability of B given A is
P(B|A) =
P(A and B)
P(A)
General multiplication rule: P(A and B) = P(A) P(B|A)
In the special case where A and B are independent: P(A and B) = P(A) P(B)
1 / 4
Conditional probability
Computing probabilities by total enumeration:
P(spam) = 20%. What is the probability that ‘money’ appears in an e-mail?
From data we know P(money | spam)= 8%, P(money | ham)= 1%.
Idea is to artificially introduce the event ‘spam/ham’.
The event ‘money appears in the e-mail’ can be written as:
money appears and e-mail is spam or money appears and e-mail is ham
P(money appears)
=P(money and spam) + P(money and ham)
=P(money | spam) P(spam) + P(money | ham) P(ham)
= 0.08 × 0.2 + 0.01 × 0.8
= 2.4%
2 / 4
Bayes’ rule
From data we know P(money appears in e-mail | e-mail is spam)= 8%, but what we
need to build a spam filter is P(e-mail is spam | money appears in e-mail).
P(B|A)=
P(A and B)
P(A)
=
P(B and A)
P(A)
=
P(A|B) P(B)
P(A)
P(spam | money)=
P(money | spam) P(spam)
P(money)
= 0.08×0.2
0.024 = 67%
Bayes’ rule:
P(B|A)=
P(A|B) P(B)
P(A)
=
P(A|B) P(B)
P(A|B)P(B) + P(A|not B) P(not B)
3 / 4
Bayesian analysis
The spam filter classifies e-mail as spam via a Bayesian analysis:
I Before examining the e-mail, there is a prior probability of 20% that it is spam.
I After examining the e-mail for certain keywords such as ‘money’, the filter updates
this prior probability using Bayes’ rule to arrive at the posterior probability that the
e-mail is spam.
4 / 4
Examples and case studies: False positives
1% of the population has a certain disease. If an infected person is tested, then there is
a 95% chance that the test is positive. If the person is not infected, then there is a 2%
chance that the test gives an erroneous positive result (‘false positive’).
Given that a person tests positive, what are the chances that he has the disease?
Know P(D)= 1% , P(+|D)= 95% , P(+|no D)= 2%.
Want P(D|+) =
P(+|D) P(D)
P(+)
=
P(+|D) P(D)
P(+|D) P(D) + P(+|no D) P(no D)
= 0.95×0.01
0.95×0.01+0.02×0.99 = 32.4%
1 / 3
Case study: Warner’s randomized response model
What percentage of students have cheated during an exam in college?
Problem: Students may be too embarrassed to answer truthfully.
Randomization comes to the rescue:
We do a survey that first instructs students to toss a coin twice. If the student gets
‘tails’ on the first toss, then the student has to answer question 1, otherwise the
student answers question 2.
Q1: Have you ever cheated on an exam in college?
Q2: Did you get ‘tails’ on the second toss?
So the answer will be partly random: We don’t know whether a ‘yes’ answer is due to
the student cheating or to getting tails on the second toss. This should put the student
at ease to answer truthfully.
2 / 3
Key point: While we don’t know what an individual ‘yes’ means, we can estimate the
proportion of cheaters using all the answers collectively:
P(yes) = P(yes and Q1) + P(yes and Q2)
= P(yes | Q1) P(Q1) + P(yes | Q2) P(Q2)
Solve for P(yes | Q1) =
P(yes)−P(yes | Q2) P(Q2)
P(Q1)
In one survey, 27 students answered ‘yes’ and 30 answered ‘no’.
So we estimate P(yes) = 27
27+30 = 47% and get P(yes | Q1)= 0.47−0.5×0.5
0.5 = 44%
3 / 3

More Related Content

PPTX
Probability basics and bayes' theorem
PPTX
conditional probability from probability.pptx
PPTX
loi de distribution 2024preventive medcine introduction in biostatistics
PPTX
Complements and Conditional Probability, and Bayes' Theorem
PPT
PPTX
Probability Theory
PPT
Probability theory graphical model seee.ppt
PPTX
Introduction to probability
Probability basics and bayes' theorem
conditional probability from probability.pptx
loi de distribution 2024preventive medcine introduction in biostatistics
Complements and Conditional Probability, and Bayes' Theorem
Probability Theory
Probability theory graphical model seee.ppt
Introduction to probability

Similar to 03 Probability.pdf (20)

PPTX
Addition rule and multiplication rule
PDF
Basic concepts of probability
PPTX
Basic probability concept
PPTX
Join-Probability-Distribution-Estuita-Marjorie-M.pptx
PPT
Chapter 4 260110 044531
PPTX
Complements conditional probability bayes theorem
PDF
Probability Theory MSc BA Sem 2.pdf
PPT
Statistics: Probability
PPTX
Probability definitions and properties
PPTX
Statistical Analysis with R -II
PPTX
basic probability Lecture 9.pptx
PPTX
Laws of probability
PPTX
MATHS CA1.pptxivugviviyiygiygiygiygiuygiygigi
PPTX
Probabilty1.pptx
ODP
QT1 - 04 - Probability
PPTX
ch4 probablity and probablity destrubition
PPTX
603-probability mj.pptx
PPTX
Probability and Randomness
PPTX
CHAPTER 6 - PROBABILITY, BUSINESS STATISTICS.pptx
Addition rule and multiplication rule
Basic concepts of probability
Basic probability concept
Join-Probability-Distribution-Estuita-Marjorie-M.pptx
Chapter 4 260110 044531
Complements conditional probability bayes theorem
Probability Theory MSc BA Sem 2.pdf
Statistics: Probability
Probability definitions and properties
Statistical Analysis with R -II
basic probability Lecture 9.pptx
Laws of probability
MATHS CA1.pptxivugviviyiygiygiygiygiuygiygigi
Probabilty1.pptx
QT1 - 04 - Probability
ch4 probablity and probablity destrubition
603-probability mj.pptx
Probability and Randomness
CHAPTER 6 - PROBABILITY, BUSINESS STATISTICS.pptx
Ad

Recently uploaded (20)

PPTX
Module 8- Technological and Communication Skills.pptx
PPTX
CyberSecurity Mobile and Wireless Devices
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PPTX
Fundamentals of Mechanical Engineering.pptx
PDF
737-MAX_SRG.pdf student reference guides
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
PPTX
Information Storage and Retrieval Techniques Unit III
PDF
August 2025 - Top 10 Read Articles in Network Security & Its Applications
PPTX
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
PDF
ChapteR012372321DFGDSFGDFGDFSGDFGDFGDFGSDFGDFGFD
PPTX
communication and presentation skills 01
PDF
August -2025_Top10 Read_Articles_ijait.pdf
PPTX
Management Information system : MIS-e-Business Systems.pptx
PPT
Total quality management ppt for engineering students
PPTX
Graph Data Structures with Types, Traversals, Connectivity, and Real-Life App...
PDF
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
PDF
Improvement effect of pyrolyzed agro-food biochar on the properties of.pdf
PPTX
Amdahl’s law is explained in the above power point presentations
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PDF
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
Module 8- Technological and Communication Skills.pptx
CyberSecurity Mobile and Wireless Devices
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
Fundamentals of Mechanical Engineering.pptx
737-MAX_SRG.pdf student reference guides
distributed database system" (DDBS) is often used to refer to both the distri...
Information Storage and Retrieval Techniques Unit III
August 2025 - Top 10 Read Articles in Network Security & Its Applications
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
ChapteR012372321DFGDSFGDFGDFSGDFGDFGDFGSDFGDFGFD
communication and presentation skills 01
August -2025_Top10 Read_Articles_ijait.pdf
Management Information system : MIS-e-Business Systems.pptx
Total quality management ppt for engineering students
Graph Data Structures with Types, Traversals, Connectivity, and Real-Life App...
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
Improvement effect of pyrolyzed agro-food biochar on the properties of.pdf
Amdahl’s law is explained in the above power point presentations
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
Ad

03 Probability.pdf

  • 1. What is probability? In 2015 there were about 4 million babies born in the US, and 48.8% of the newborns were girls. We write: P(newborn is a girl)= 48.8%. The probability of an event is defined as the proportion of times this event occurs in many repetitions. This is the standard definition of probility. It requires that it is possible to repeat this chance experiment many times. While interned during the Second World War, John Kerrich tossed a coin 10,000 times and observed 5,067 tosses resulting in heads. 1 / 2
  • 2. What is probability? The long-run interpretation of probability can make it difficult to interpret it for single event: ‘What I was wrong about this year’ by David Leonhardt (12/24/2017) Sometimes people use a different interpretation: ‘The probability that my best friend calls today is 30%’. Such a ‘subjective probability’ is not based on experiments, and different people may assign different subjective probabilities to the same event. 2 / 2
  • 3. Four basic rules Probabilities are always between 0 and 1. Computing probabilities comes down to applying a few basic rules repeatedly: We saw that P(newborn is a girl)= 48.8%. Therefore P(newborn is a boy)= 51.2%. More formally, write A for an event, such as A=’newborn is a girl’. Complement rule: P(A does not occur) = 1−P(A) Now let’s look at a different chance experiment: Rolling a die. Since each of its six faces is equally likely to come up, each has probability 1/6. Rule for equally likely outcomes: If there are n possible outcomes and they are equally likely, then P(A) = number of outcomes in A n 1 / 3
  • 4. Four basic rules The next two rules make it possible to express probabilities of multiple events as those of the individual events. A and B are mutually exclusive if they cannot occur at the same time. As an example, we roll a die twice. A= on first roll, B= on first roll, C= on second roll A and B are mutually exclusive, but A and C are not. Addition rule: If A and B are mutually exclusive, then P(A or B) = P(A) + P(B) Two events are independent if knowing that one occurs does not change the probability that the other occurs. B and C are independent, but A and B are not. Multiplication rule: If A and B are independent, then P(A and B) = P(A) P(B) 2 / 3
  • 5. Four basic rules Roll a die three times. What is P(at least one ) ? We could write ‘at least one ’ as follows: on the first roll or on the second roll or on the third roll. But these three events are not mutually exclusive, so we can’t use the addition rule. Better way: Start with the complement rule. P(at least one in three rolls) = 1−P(no in three rolls) = 1−P (no in first roll) and (no in second roll) and (no in third roll) = 1−P(no in first roll)× P(no in second roll)×P(no in third roll) = 1 − 5 6 × 5 6 × 5 6 = 41.1% 3 / 3
  • 6. Conditional probability Spam e-mail has a higher chance to contain the word ‘money’ than ham e-mail: P(‘money’ in e-mail | spam)= 8%, P(‘money’ in e-mail | ham)= 1%. The conditional probability of B given A is P(B|A) = P(A and B) P(A) General multiplication rule: P(A and B) = P(A) P(B|A) In the special case where A and B are independent: P(A and B) = P(A) P(B) 1 / 4
  • 7. Conditional probability Computing probabilities by total enumeration: P(spam) = 20%. What is the probability that ‘money’ appears in an e-mail? From data we know P(money | spam)= 8%, P(money | ham)= 1%. Idea is to artificially introduce the event ‘spam/ham’. The event ‘money appears in the e-mail’ can be written as: money appears and e-mail is spam or money appears and e-mail is ham P(money appears) =P(money and spam) + P(money and ham) =P(money | spam) P(spam) + P(money | ham) P(ham) = 0.08 × 0.2 + 0.01 × 0.8 = 2.4% 2 / 4
  • 8. Bayes’ rule From data we know P(money appears in e-mail | e-mail is spam)= 8%, but what we need to build a spam filter is P(e-mail is spam | money appears in e-mail). P(B|A)= P(A and B) P(A) = P(B and A) P(A) = P(A|B) P(B) P(A) P(spam | money)= P(money | spam) P(spam) P(money) = 0.08×0.2 0.024 = 67% Bayes’ rule: P(B|A)= P(A|B) P(B) P(A) = P(A|B) P(B) P(A|B)P(B) + P(A|not B) P(not B) 3 / 4
  • 9. Bayesian analysis The spam filter classifies e-mail as spam via a Bayesian analysis: I Before examining the e-mail, there is a prior probability of 20% that it is spam. I After examining the e-mail for certain keywords such as ‘money’, the filter updates this prior probability using Bayes’ rule to arrive at the posterior probability that the e-mail is spam. 4 / 4
  • 10. Examples and case studies: False positives 1% of the population has a certain disease. If an infected person is tested, then there is a 95% chance that the test is positive. If the person is not infected, then there is a 2% chance that the test gives an erroneous positive result (‘false positive’). Given that a person tests positive, what are the chances that he has the disease? Know P(D)= 1% , P(+|D)= 95% , P(+|no D)= 2%. Want P(D|+) = P(+|D) P(D) P(+) = P(+|D) P(D) P(+|D) P(D) + P(+|no D) P(no D) = 0.95×0.01 0.95×0.01+0.02×0.99 = 32.4% 1 / 3
  • 11. Case study: Warner’s randomized response model What percentage of students have cheated during an exam in college? Problem: Students may be too embarrassed to answer truthfully. Randomization comes to the rescue: We do a survey that first instructs students to toss a coin twice. If the student gets ‘tails’ on the first toss, then the student has to answer question 1, otherwise the student answers question 2. Q1: Have you ever cheated on an exam in college? Q2: Did you get ‘tails’ on the second toss? So the answer will be partly random: We don’t know whether a ‘yes’ answer is due to the student cheating or to getting tails on the second toss. This should put the student at ease to answer truthfully. 2 / 3
  • 12. Key point: While we don’t know what an individual ‘yes’ means, we can estimate the proportion of cheaters using all the answers collectively: P(yes) = P(yes and Q1) + P(yes and Q2) = P(yes | Q1) P(Q1) + P(yes | Q2) P(Q2) Solve for P(yes | Q1) = P(yes)−P(yes | Q2) P(Q2) P(Q1) In one survey, 27 students answered ‘yes’ and 30 answered ‘no’. So we estimate P(yes) = 27 27+30 = 47% and get P(yes | Q1)= 0.47−0.5×0.5 0.5 = 44% 3 / 3