SlideShare a Scribd company logo
3
Most read
5
Most read
9
Most read
Free Lessons on Artificial Intelligence and Machine Learning
kindsonthegenius.blogspot.com Click on:
For updates:
Free Lessons on Artificial Intelligence and Machine Learning
What we are going to cover:
 What is decision theory?
 Application of Decision Theory – Cancer Diagnosis
 Formal definition
 False positives/False negatives
 Minimizing misclassification
 Reducing Expected Loss
 Introduction to ROC
Click on:
For updates:
Free Lessons on Artificial Intelligence and Machine Learning
What is decision theory? What is the goal?
This is mathematical theory in the field of Machine Learning that allows us to
make optimal decisions in situations involving uncertainty.
So from the figure we can see that the goal of the physician would be to
get the highest score possible which is 100% and that is the objective of
Decision Theory, to make the most optima decision.
Free Lessons on Artificial Intelligence and Machine Learning
Application of Decision Theory – Cancer Diagnosis
Scenario 1: There is presence of cancer and the physician
decides to perform a surgery. That is 100% because its the
best decision to take.
Scenario 2: There is presence of cancer but the
physician decides not to perform a surgery. That is a
score of 0 as it is the worst case scenario and of course
the consequences would be very serious.
Scenario 3: Cancer is absent but the physician decides to perform a surgery anyway.
This is a low score but does not result in any serious consequence
Scenario 4: Cancer is absent and the physician decides not to perform a surgery.
This is a good decision as well.
Free Lessons on Artificial Intelligence and Machine Learning
Formal definition of Decision Theory
A corresponding vector t of the target variables ( which could be 1 or 0)
And the two classes C1 and C2 (C1 = presence of cancer, C2 = absence of cancer)
Consider that we have an input vector x
Let t = 1 correspond to class C1 and
t = 0 correspond to class C2
We need to determine the joint distribution p(x, Ck). Here k = 1,2. This is the same as p(x,t).
Decision theory is concerned with how to make optimal decisions given the appropriate
probabilities.
Click on:
For updates:
Free Lessons on Artificial Intelligence and Machine Learning
What are False positives/False negatives?
False Negative: There is presence of cancer but the physician decides not to perform a surgery.
That is a score of 0 as it is the worst case scenario and of course the consequences would be very
serious.
False Positive: Cancer is absent but the physician decides to perform a surgery
anyway. This is a low score but does not result in any serious consequence
Free Lessons on Artificial Intelligence and Machine Learning
Minimizing misclassification
The approach is to divide the input space into regions Rk called decision regions, one region for
each class.
Rk is assigned to class Ck
𝑝 𝑚𝑖𝑠𝑐𝑙𝑎𝑠𝑠𝑖𝑓𝑖𝑐𝑎𝑡𝑖𝑜𝑛 = 𝑝 𝐱 ∈ 𝑅2, 𝐶1 + 𝑝(𝐱 ∈ 𝑅1, 𝐶2)
𝑅1
𝑝 𝐱, 𝐶2 𝑑𝐱 +
𝑅2
𝑝 𝐱, 𝐶1 𝑑𝐱
To minimize misclassification, we must choose to assign x to which of the classes
has the smaller value of the integrand.
Consider the case of two classes C1 and C2. A mistake occurs when an input vector belonging
to R1 is assigned to C2 or vector x belonging to R2 is assigned to C1.
Free Lessons on Artificial Intelligence and Machine Learning
Reducing Expected Loss
The Loss Matrix
The Loss Matrix is a table showing the decision that was taken relative to
the true class
Assuming that for a new value of x, we assign it to class Cj whereas the real correct
class is Ck. It means we have incurred a loss Lkj, which is the k, j element of the loss
matrix.
𝐸[𝐿 ] =
𝑘 𝑗 𝑅 𝑗
𝐿 𝑘𝑗 𝑝(𝐱, 𝐶 𝑘)𝑑𝑥
The average loss function is given by the equation:
The best solution is one that minimizes the average loss function. For a given input
vector x, our uncertainty in the correct class is expressed through the joint
probability distribution p(x, Ck)
Free Lessons on Artificial Intelligence and Machine Learning
Introduction to ROC Curve
An ROC in Statistics is a graphical plot that illustrates the diagnostic ability of a
binary classifier system as its discrimination threshold is varied.
It is created by plotting the true positive rate against the false positive rate at
various threshold setting
Details of this in another presentation…
For updates:
Subscribe now for more videos
on
Artificial Intelligence and Machine Learning
Click on:

More Related Content

PDF
K - Nearest neighbor ( KNN )
PPTX
Issues in knowledge representation
PPTX
AI_Session 20 Horn clause.pptx
PPTX
Inference in First-Order Logic
PPTX
Problem solving agents
PDF
AI PPT-ALR_Unit-3-1.pdf
PPTX
Probabilistic Reasoning
PPTX
ID3 ALGORITHM
K - Nearest neighbor ( KNN )
Issues in knowledge representation
AI_Session 20 Horn clause.pptx
Inference in First-Order Logic
Problem solving agents
AI PPT-ALR_Unit-3-1.pdf
Probabilistic Reasoning
ID3 ALGORITHM

What's hot (20)

PDF
I. AO* SEARCH ALGORITHM
PPTX
knowledge representation using rules
PPTX
Dendral
PPTX
Random forest algorithm
PPTX
Dynamic Programming
PPTX
sutherland- Hodgeman Polygon clipping
PDF
L2. Evaluating Machine Learning Algorithms I
PPT
Artificial Intelligence 1 Planning In The Real World
PPT
Bayes Classification
PPTX
Directed Acyclic Graph Representation of basic blocks
PDF
I. Hill climbing algorithm II. Steepest hill climbing algorithm
PPTX
Classification Algorithm.
PPT
Heuristic Search Techniques Unit -II.ppt
PPTX
Analytical learning
PPTX
Evaluating hypothesis
PDF
Artificial Intelligence - Hill climbing.
PDF
Confusion Matrix Explained
PPTX
Using prior knowledge to initialize the hypothesis,kbann
PPTX
AI_Session 10 Local search in continious space.pptx
PPTX
Naïve Bayes Classifier Algorithm.pptx
I. AO* SEARCH ALGORITHM
knowledge representation using rules
Dendral
Random forest algorithm
Dynamic Programming
sutherland- Hodgeman Polygon clipping
L2. Evaluating Machine Learning Algorithms I
Artificial Intelligence 1 Planning In The Real World
Bayes Classification
Directed Acyclic Graph Representation of basic blocks
I. Hill climbing algorithm II. Steepest hill climbing algorithm
Classification Algorithm.
Heuristic Search Techniques Unit -II.ppt
Analytical learning
Evaluating hypothesis
Artificial Intelligence - Hill climbing.
Confusion Matrix Explained
Using prior knowledge to initialize the hypothesis,kbann
AI_Session 10 Local search in continious space.pptx
Naïve Bayes Classifier Algorithm.pptx
Ad

Similar to Basics of Statistical decision theory (20)

PPTX
lec+5+_part+1 cloud .pptx
PDF
Machine learning in credit risk modeling : a James white paper
PPT
Data Analysison Regression
PDF
Optimistic decision making using an
PPT
case control and cohort Sample estimation-DESKTOP-NGRKQF9.ppt
PDF
Faster and cheaper, smart ab experiments - public ver.
PPTX
Lec. 10: Making Assumptions of Missing data
PPTX
Machine Learning (Classification Models)
PPTX
Advanced Econometrics L5-6.pptx
PPTX
6._lect_Expected_utility__exceptions_(2).pptx
PPT
GraphDiseaseSpreadModels-Threshold&FirefighterCapeTown6-10-07
PDF
An Efficient System for Cancer Detection using Digital Mammograms
PPT
AI Lecture 7 (uncertainty)
PDF
V. pacáková, d. brebera
PPTX
Finding-the-Mean-of-Discrete-Random-Variable-1.pptx
PDF
Adaptive Machine Learning for Credit Card Fraud Detection
PPTX
Calibration of risk prediction models: decision making with the lights on or ...
PDF
Classification methods and assessment
PPTX
Machine learning ( Part 1 )
PPTX
QNT 561 Week 4 Weekly Learning Assessments
lec+5+_part+1 cloud .pptx
Machine learning in credit risk modeling : a James white paper
Data Analysison Regression
Optimistic decision making using an
case control and cohort Sample estimation-DESKTOP-NGRKQF9.ppt
Faster and cheaper, smart ab experiments - public ver.
Lec. 10: Making Assumptions of Missing data
Machine Learning (Classification Models)
Advanced Econometrics L5-6.pptx
6._lect_Expected_utility__exceptions_(2).pptx
GraphDiseaseSpreadModels-Threshold&FirefighterCapeTown6-10-07
An Efficient System for Cancer Detection using Digital Mammograms
AI Lecture 7 (uncertainty)
V. pacáková, d. brebera
Finding-the-Mean-of-Discrete-Random-Variable-1.pptx
Adaptive Machine Learning for Credit Card Fraud Detection
Calibration of risk prediction models: decision making with the lights on or ...
Classification methods and assessment
Machine learning ( Part 1 )
QNT 561 Week 4 Weekly Learning Assessments
Ad

Recently uploaded (20)

PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
The scientific heritage No 166 (166) (2025)
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
BIOMOLECULES PPT........................
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
An interstellar mission to test astrophysical black holes
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Taita Taveta Laboratory Technician Workshop Presentation.pptx
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
neck nodes and dissection types and lymph nodes levels
lecture 2026 of Sjogren's syndrome l .pdf
. Radiology Case Scenariosssssssssssssss
TOTAL hIP ARTHROPLASTY Presentation.pptx
2Systematics of Living Organisms t-.pptx
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
The scientific heritage No 166 (166) (2025)
HPLC-PPT.docx high performance liquid chromatography
BIOMOLECULES PPT........................
Placing the Near-Earth Object Impact Probability in Context
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
An interstellar mission to test astrophysical black holes
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Biophysics 2.pdffffffffffffffffffffffffff
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
7. General Toxicologyfor clinical phrmacy.pptx

Basics of Statistical decision theory

  • 1. Free Lessons on Artificial Intelligence and Machine Learning kindsonthegenius.blogspot.com Click on: For updates:
  • 2. Free Lessons on Artificial Intelligence and Machine Learning What we are going to cover:  What is decision theory?  Application of Decision Theory – Cancer Diagnosis  Formal definition  False positives/False negatives  Minimizing misclassification  Reducing Expected Loss  Introduction to ROC Click on: For updates:
  • 3. Free Lessons on Artificial Intelligence and Machine Learning What is decision theory? What is the goal? This is mathematical theory in the field of Machine Learning that allows us to make optimal decisions in situations involving uncertainty. So from the figure we can see that the goal of the physician would be to get the highest score possible which is 100% and that is the objective of Decision Theory, to make the most optima decision.
  • 4. Free Lessons on Artificial Intelligence and Machine Learning Application of Decision Theory – Cancer Diagnosis Scenario 1: There is presence of cancer and the physician decides to perform a surgery. That is 100% because its the best decision to take. Scenario 2: There is presence of cancer but the physician decides not to perform a surgery. That is a score of 0 as it is the worst case scenario and of course the consequences would be very serious. Scenario 3: Cancer is absent but the physician decides to perform a surgery anyway. This is a low score but does not result in any serious consequence Scenario 4: Cancer is absent and the physician decides not to perform a surgery. This is a good decision as well.
  • 5. Free Lessons on Artificial Intelligence and Machine Learning Formal definition of Decision Theory A corresponding vector t of the target variables ( which could be 1 or 0) And the two classes C1 and C2 (C1 = presence of cancer, C2 = absence of cancer) Consider that we have an input vector x Let t = 1 correspond to class C1 and t = 0 correspond to class C2 We need to determine the joint distribution p(x, Ck). Here k = 1,2. This is the same as p(x,t). Decision theory is concerned with how to make optimal decisions given the appropriate probabilities. Click on: For updates:
  • 6. Free Lessons on Artificial Intelligence and Machine Learning What are False positives/False negatives? False Negative: There is presence of cancer but the physician decides not to perform a surgery. That is a score of 0 as it is the worst case scenario and of course the consequences would be very serious. False Positive: Cancer is absent but the physician decides to perform a surgery anyway. This is a low score but does not result in any serious consequence
  • 7. Free Lessons on Artificial Intelligence and Machine Learning Minimizing misclassification The approach is to divide the input space into regions Rk called decision regions, one region for each class. Rk is assigned to class Ck 𝑝 𝑚𝑖𝑠𝑐𝑙𝑎𝑠𝑠𝑖𝑓𝑖𝑐𝑎𝑡𝑖𝑜𝑛 = 𝑝 𝐱 ∈ 𝑅2, 𝐶1 + 𝑝(𝐱 ∈ 𝑅1, 𝐶2) 𝑅1 𝑝 𝐱, 𝐶2 𝑑𝐱 + 𝑅2 𝑝 𝐱, 𝐶1 𝑑𝐱 To minimize misclassification, we must choose to assign x to which of the classes has the smaller value of the integrand. Consider the case of two classes C1 and C2. A mistake occurs when an input vector belonging to R1 is assigned to C2 or vector x belonging to R2 is assigned to C1.
  • 8. Free Lessons on Artificial Intelligence and Machine Learning Reducing Expected Loss The Loss Matrix The Loss Matrix is a table showing the decision that was taken relative to the true class Assuming that for a new value of x, we assign it to class Cj whereas the real correct class is Ck. It means we have incurred a loss Lkj, which is the k, j element of the loss matrix. 𝐸[𝐿 ] = 𝑘 𝑗 𝑅 𝑗 𝐿 𝑘𝑗 𝑝(𝐱, 𝐶 𝑘)𝑑𝑥 The average loss function is given by the equation: The best solution is one that minimizes the average loss function. For a given input vector x, our uncertainty in the correct class is expressed through the joint probability distribution p(x, Ck)
  • 9. Free Lessons on Artificial Intelligence and Machine Learning Introduction to ROC Curve An ROC in Statistics is a graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied. It is created by plotting the true positive rate against the false positive rate at various threshold setting Details of this in another presentation… For updates:
  • 10. Subscribe now for more videos on Artificial Intelligence and Machine Learning Click on: