SlideShare a Scribd company logo
2
Most read
6
Most read
13
Most read
Swipe
Basic Probability Theory and
Statistics
These are some very fundamental terms/concepts
related to probability and statistics that often
come across any literature related to Machine
Learning and AI.
Random Experiment
Sample Space
Random Variables
Probability
Conditional Probability
Variance
Probability Distribution
Joint Probability Distribution
Conditional Probability Distribution (CPD)
Factor
Basic Probability Theory and Statistics
A random experiment is a physical situation
whose outcome cannot be predicted until it is
observed.
Random Experiment
A sample space, is a set of all possible outcomes of
a random experiment.
Sample Space
A random variable, is a variable whose possible
values are numerical outcomes of a random
experiment. There are two types of random
variables.
Discrete Random Variable is one which may
take on only a countable number of distinct
values such as 0,1,2,3,4,…….. Discrete random
variables are usually (but not necessarily)
counts.
Continuous Random Variable is one which
takes an infinite number of possible values.
Continuous random variables are usually
measurements.
Random Variables
Probability is the measure of the likelihood that
an event will occur in a Random Experiment.
Probability is quantified as a number between 0
and 1, where, loosely speaking, 0 indicates
impossibility and 1 indicates certainty.
The higher the probability of an event, the more
likely it is that the event will occur.
Probability
Conditional Probability is a measure of the
probability of an event given that (by assumption,
presumption, assertion or evidence) another
event has already occurred.
If the event of interest is A and the event B is
known or assumed to have occurred, “the
conditional probability of A given B”, is usually
written as P(A|B).
Conditional Probability
The variance of a random variable X is a measure
of how concentrated the distribution of a random
variable X is around its mean.
Variance
Is a mathematical function that maps the all
possible outcomes of an random experiment with
it’s associated probability. It depends on the
Random Variable X , whether it’s discrete or
continues.
Discrete Probability Distribution: The
mathematical definition of a discrete
probability function, p(x), is a function that
satisfies the following properties. This is
referred as Probability Mass Function.
Continuous Probability Distribution: The
mathematical definition of a continuous
probability function, f(x), is a function that
satisfies the following properties. This is
referred as Probability Density Function.
Probability Distribution
If X and Y are two random variables, the
probability distribution that defines their
simultaneous behaviour during outcomes of a
random experiment is called a joint probability
distribution.
Joint Probability Distribution
If Z is random variable who is dependent on other
variables X and Y, then the distribution of P(Z|X,Y)
is called CPD of Z w.r.t X and Y.
It means for every possible combination of
random variables X, Y we represent a probability
distribution over Z.
There are a number of operations that one can
perform over any probability distribution to get
interesting results. Some of the important
operations are :-
Conditioning/Reduction
Marginalisation
Conditional Probability Distribution (CPD)
If we have a probability distribution of n random
variables X1, X2 … Xn and we make an observation
about k variables that they acquired certain
values a1, a2, …, ak.
It means we already know their assignment. Then
the rows in the JD which are not consistent with
the observation is simply can removed and that
leave us with lesser number of rows.
This operation is known as Reduction.
Conditioning/Reduction
This operation takes a probability distribution
over a large set random variables and produces a
probability distribution over a smaller subset of
the variables.
This operation is known as marginalising a subset
of random variables.
This operation is very useful when we have large
set of random variables as features and we are
interested in a smaller set of variables, and how it
affects output.
Marginalisation
R-programming
Data security
Business analytics
Stay Tuned with
Topics for next Post

More Related Content

PDF
multivariate normal distribution.pdf
PPTX
Sampling Distributions and Estimators
PDF
Random variable,Discrete and Continuous
PPTX
Normal distribution
PPTX
Population & sample lecture 04
PPTX
Probability distribution
PPTX
Binomial distribution
multivariate normal distribution.pdf
Sampling Distributions and Estimators
Random variable,Discrete and Continuous
Normal distribution
Population & sample lecture 04
Probability distribution
Binomial distribution

What's hot (20)

PPSX
Standard Deviation (Meaning, Characteristics and Calculation)
PPTX
Chapter 6 simple regression and correlation
PPTX
Pearson's correlation
PPTX
2.1 frequency distributions for organizing and summarizing data
PPT
Probability distribution
PPTX
Karl pearson's coefficient of correlation (1)
PPTX
Skewness
PDF
Chebyshev's inequality
PDF
Discrete probability distribution (complete)
PPT
Least square method
PPT
Multiple Regression.ppt
PPT
PROBABILITY AND PROBABILITY DISTRIBUTIONS.ppt
PPTX
STATISTICS: Normal Distribution
PPTX
Presentation On Regression
PPTX
T distribution | Statistics
PPT
Statistics: Probability
PPT
Simple Linier Regression
PPTX
Conditional-Probability-Powerpoint.pptx
PPTX
introduction to probability
PDF
Principal Component Analysis and Clustering
Standard Deviation (Meaning, Characteristics and Calculation)
Chapter 6 simple regression and correlation
Pearson's correlation
2.1 frequency distributions for organizing and summarizing data
Probability distribution
Karl pearson's coefficient of correlation (1)
Skewness
Chebyshev's inequality
Discrete probability distribution (complete)
Least square method
Multiple Regression.ppt
PROBABILITY AND PROBABILITY DISTRIBUTIONS.ppt
STATISTICS: Normal Distribution
Presentation On Regression
T distribution | Statistics
Statistics: Probability
Simple Linier Regression
Conditional-Probability-Powerpoint.pptx
introduction to probability
Principal Component Analysis and Clustering
Ad

Similar to Basic probability theory and statistics (20)

PPTX
Machine learning session2
PPTX
Fundamentals of Data Science Probability Distributions
PPTX
Statistic and Probability definition and terminologies .pptx
PPTX
probabiity distributions.pptx its about types of probability distributions
PDF
Appendix 2 Probability And Statistics
PDF
Bai giang Chapter 6 avandce math for engeneering
PPT
PDF
Statistics (recap)
PPTX
PA_EPGDM_2_2023.pptx
PPTX
Statistics and probability pp
PPT
Marketing management planning on it is a
PDF
Unit – III Spatial data Ajustment.pdf
PDF
CO Data Science - Workshop 1: Probability Distributions
PDF
CO Data Science - Workshop 1: Probability Distributiions
PPTX
this materials is useful for the students who studying masters level in elect...
PDF
Part1: Quest for DataScience 101
PDF
STAT-WEEK-1-2.pdfAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
PPTX
probability types and definition and how to measure
PPTX
probability for beginners masters in africa.ppt
PPTX
BASIC PROBABILITY THEORY Presentation By medical students
Machine learning session2
Fundamentals of Data Science Probability Distributions
Statistic and Probability definition and terminologies .pptx
probabiity distributions.pptx its about types of probability distributions
Appendix 2 Probability And Statistics
Bai giang Chapter 6 avandce math for engeneering
Statistics (recap)
PA_EPGDM_2_2023.pptx
Statistics and probability pp
Marketing management planning on it is a
Unit – III Spatial data Ajustment.pdf
CO Data Science - Workshop 1: Probability Distributions
CO Data Science - Workshop 1: Probability Distributiions
this materials is useful for the students who studying masters level in elect...
Part1: Quest for DataScience 101
STAT-WEEK-1-2.pdfAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
probability types and definition and how to measure
probability for beginners masters in africa.ppt
BASIC PROBABILITY THEORY Presentation By medical students
Ad

More from Learnbay Datascience (20)

PDF
Top data science projects
PDF
Python my SQL - create table
PDF
Python my SQL - create database
PDF
Python my sql database connection
PDF
Python - mySOL
PDF
AI - Issues and Terminology
PDF
AI - Fuzzy Logic Systems
PDF
AI - working of an ns
PDF
Artificial Intelligence- Neural Networks
PDF
AI - Robotics
PDF
Applications of expert system
PDF
Components of expert systems
PDF
Artificial intelligence - expert systems
PDF
AI - natural language processing
PDF
Ai popular search algorithms
PDF
AI - Agents & Environments
PDF
Artificial intelligence - research areas
PDF
Artificial intelligence composed
PDF
Artificial intelligence intelligent systems
PDF
Applications of ai
Top data science projects
Python my SQL - create table
Python my SQL - create database
Python my sql database connection
Python - mySOL
AI - Issues and Terminology
AI - Fuzzy Logic Systems
AI - working of an ns
Artificial Intelligence- Neural Networks
AI - Robotics
Applications of expert system
Components of expert systems
Artificial intelligence - expert systems
AI - natural language processing
Ai popular search algorithms
AI - Agents & Environments
Artificial intelligence - research areas
Artificial intelligence composed
Artificial intelligence intelligent systems
Applications of ai

Recently uploaded (20)

PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
01-Introduction-to-Information-Management.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Complications of Minimal Access Surgery at WLH
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
master seminar digital applications in india
PDF
Computing-Curriculum for Schools in Ghana
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
Cell Types and Its function , kingdom of life
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
human mycosis Human fungal infections are called human mycosis..pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
102 student loan defaulters named and shamed – Is someone you know on the list?
01-Introduction-to-Information-Management.pdf
VCE English Exam - Section C Student Revision Booklet
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Microbial disease of the cardiovascular and lymphatic systems
Complications of Minimal Access Surgery at WLH
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
master seminar digital applications in india
Computing-Curriculum for Schools in Ghana
STATICS OF THE RIGID BODIES Hibbelers.pdf
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Cell Types and Its function , kingdom of life
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf

Basic probability theory and statistics

  • 2. These are some very fundamental terms/concepts related to probability and statistics that often come across any literature related to Machine Learning and AI. Random Experiment Sample Space Random Variables Probability Conditional Probability Variance Probability Distribution Joint Probability Distribution Conditional Probability Distribution (CPD) Factor Basic Probability Theory and Statistics
  • 3. A random experiment is a physical situation whose outcome cannot be predicted until it is observed. Random Experiment
  • 4. A sample space, is a set of all possible outcomes of a random experiment. Sample Space
  • 5. A random variable, is a variable whose possible values are numerical outcomes of a random experiment. There are two types of random variables. Discrete Random Variable is one which may take on only a countable number of distinct values such as 0,1,2,3,4,…….. Discrete random variables are usually (but not necessarily) counts. Continuous Random Variable is one which takes an infinite number of possible values. Continuous random variables are usually measurements. Random Variables
  • 6. Probability is the measure of the likelihood that an event will occur in a Random Experiment. Probability is quantified as a number between 0 and 1, where, loosely speaking, 0 indicates impossibility and 1 indicates certainty. The higher the probability of an event, the more likely it is that the event will occur. Probability
  • 7. Conditional Probability is a measure of the probability of an event given that (by assumption, presumption, assertion or evidence) another event has already occurred. If the event of interest is A and the event B is known or assumed to have occurred, “the conditional probability of A given B”, is usually written as P(A|B). Conditional Probability
  • 8. The variance of a random variable X is a measure of how concentrated the distribution of a random variable X is around its mean. Variance
  • 9. Is a mathematical function that maps the all possible outcomes of an random experiment with it’s associated probability. It depends on the Random Variable X , whether it’s discrete or continues. Discrete Probability Distribution: The mathematical definition of a discrete probability function, p(x), is a function that satisfies the following properties. This is referred as Probability Mass Function. Continuous Probability Distribution: The mathematical definition of a continuous probability function, f(x), is a function that satisfies the following properties. This is referred as Probability Density Function. Probability Distribution
  • 10. If X and Y are two random variables, the probability distribution that defines their simultaneous behaviour during outcomes of a random experiment is called a joint probability distribution. Joint Probability Distribution
  • 11. If Z is random variable who is dependent on other variables X and Y, then the distribution of P(Z|X,Y) is called CPD of Z w.r.t X and Y. It means for every possible combination of random variables X, Y we represent a probability distribution over Z. There are a number of operations that one can perform over any probability distribution to get interesting results. Some of the important operations are :- Conditioning/Reduction Marginalisation Conditional Probability Distribution (CPD)
  • 12. If we have a probability distribution of n random variables X1, X2 … Xn and we make an observation about k variables that they acquired certain values a1, a2, …, ak. It means we already know their assignment. Then the rows in the JD which are not consistent with the observation is simply can removed and that leave us with lesser number of rows. This operation is known as Reduction. Conditioning/Reduction
  • 13. This operation takes a probability distribution over a large set random variables and produces a probability distribution over a smaller subset of the variables. This operation is known as marginalising a subset of random variables. This operation is very useful when we have large set of random variables as features and we are interested in a smaller set of variables, and how it affects output. Marginalisation
  • 14. R-programming Data security Business analytics Stay Tuned with Topics for next Post