SlideShare a Scribd company logo
Method to Improve Breast
Cancer Diagnosis
Anoop Vasant Kumar
Problem
Preventable medical error is a big killer.
In the US alone, 400,000 people die every year due to
avoidable medical error in hospitals - this is equivalent
to TWO JUMBO JETS crashing every single day!
--
NHS sets aside 26.1 Billion Dollars to settle outstanding
negligences and liabilities in clinical safety.
Causes of Avoidable Medical Errors
Procedures and training methods not reformed,
so mistakes happen again and again.
Features of the Dataset - Labelled 699 clinical cases
Nine real-valued features are chosen for each cell nucleus:
a) radius (mean of distances from center to points on the perimeter)
b) texture (standard deviation of gray-scale values)
c) perimeter
d) area
e) smoothness (local variation in radius lengths)
f) compactness (perimeter^2 / area - 1.0)
g) concavity (severity of concave portions of the contour)
h) concave points (number of concave portions of the contour)
i) symmetry
Some approaches to solving anomaly detection problem
683 cases of labelled data - benign/malignant
Imbalanced dataset
Spot check feature histograms
Visualizing Classification
Classification using Logistic Regression achieved an F1 score
of 0.95 on the anomaly class.
Classification by model on unseen data
Actual data
KNN - non parametric model to verify classification
Scaled feature vector
Identified precise k value using elbow method
For k = 3
We had an F1 score
0.95 for the anomaly
class
Unsupervised Anomaly Detection using SVM - Gaussian Kernel Trick
1)Objective is to train a one class svm gaussian
hypersphere that quarantines the benign cells.
2)Dropped labels from dataset and is split into benign and
malignant datasets.
3)Benign dataset is used to train the model.
4)Malignant dataset, the dataset that contains the outliers
is used to test.
5)A single class SVM is trained with a low gamma value,
that captures the influence of training examples on
classification.
Gaussian Distribution for benign and malignant cells
Benign multivariate gaussian Malignant multivariate
gaussian
Anomaly Detection using SIngle Class SVM with Gaussian Kernel
Single class SVM with gaussian trick - 100% Accuracy

More Related Content

PDF
Ki-67 for further classification of triple negative breast cancer
DOCX
Computer aided diagnosis of mammographic masses using scalable image retrieval
PPTX
Breastcancer_lbp_poster
PPT
Combined Raman spectroscopy and autofluorescence method for tumors research
PDF
GLIIFCA 22 final (1)
PPS
Recent Advances in Pathologic Evaluation of Melanoma Sentinel Lymph Nodes. Sl...
PDF
Lecture10 outilier l0_svdd
PDF
Lecture6 svdd
Ki-67 for further classification of triple negative breast cancer
Computer aided diagnosis of mammographic masses using scalable image retrieval
Breastcancer_lbp_poster
Combined Raman spectroscopy and autofluorescence method for tumors research
GLIIFCA 22 final (1)
Recent Advances in Pathologic Evaluation of Melanoma Sentinel Lymph Nodes. Sl...
Lecture10 outilier l0_svdd
Lecture6 svdd

Similar to Anomaly Detection using SIngle Class SVM with Gaussian Kernel (20)

PPTX
Machine Learning - Breast Cancer Diagnosis
PPTX
Cancer detection using data mining
PPTX
Breast Cancer Risk Prediction: Leveraging Data for Early Detection and Preven...
PPTX
Wisconsin Breast Cancer dataset.pptx
PDF
My own Machine Learning project - Breast Cancer Prediction
PDF
Impact.Tech "Statistical Literacy for Deep Tech"
PDF
Breast cancer diagnosis and recurrence prediction using machine learning tech...
PPTX
Breast cancer diagnosis machine learning ppt
PPTX
Predicting Breast Cancer
PPTX
wisconsinbreastcancerdataset-230822052318-0641b91a.pptx
PDF
breastcancerdiagnosismachinelearningppt-181123031720.pdf
PDF
"Statistical Literacy for Deep Tech" by Noel Jee
PPT
2010 Spring, Bioinformatics II Presentation
PDF
Breast Cancer Classification.pdf
PDF
Ensemble Classifier Approach in Breast Cancer Detection and Malignancy Gradin...
PDF
IRJET- Breast Cancer Prediction using Support Vector Machine
PPTX
Breast cancer classification
PDF
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
PDF
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
PDF
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Machine Learning - Breast Cancer Diagnosis
Cancer detection using data mining
Breast Cancer Risk Prediction: Leveraging Data for Early Detection and Preven...
Wisconsin Breast Cancer dataset.pptx
My own Machine Learning project - Breast Cancer Prediction
Impact.Tech "Statistical Literacy for Deep Tech"
Breast cancer diagnosis and recurrence prediction using machine learning tech...
Breast cancer diagnosis machine learning ppt
Predicting Breast Cancer
wisconsinbreastcancerdataset-230822052318-0641b91a.pptx
breastcancerdiagnosismachinelearningppt-181123031720.pdf
"Statistical Literacy for Deep Tech" by Noel Jee
2010 Spring, Bioinformatics II Presentation
Breast Cancer Classification.pdf
Ensemble Classifier Approach in Breast Cancer Detection and Malignancy Gradin...
IRJET- Breast Cancer Prediction using Support Vector Machine
Breast cancer classification
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Ad

Recently uploaded (20)

PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Foundation of Data Science unit number two notes
oil_refinery_comprehensive_20250804084928 (1).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Business Ppt On Nestle.pptx huunnnhhgfvu
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Qualitative Qantitative and Mixed Methods.pptx
Mega Projects Data Mega Projects Data
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
climate analysis of Dhaka ,Banglades.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction to Knowledge Engineering Part 1
Fluorescence-microscope_Botany_detailed content
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Ad

Anomaly Detection using SIngle Class SVM with Gaussian Kernel

  • 1. Method to Improve Breast Cancer Diagnosis Anoop Vasant Kumar
  • 2. Problem Preventable medical error is a big killer. In the US alone, 400,000 people die every year due to avoidable medical error in hospitals - this is equivalent to TWO JUMBO JETS crashing every single day! -- NHS sets aside 26.1 Billion Dollars to settle outstanding negligences and liabilities in clinical safety.
  • 3. Causes of Avoidable Medical Errors Procedures and training methods not reformed, so mistakes happen again and again.
  • 4. Features of the Dataset - Labelled 699 clinical cases Nine real-valued features are chosen for each cell nucleus: a) radius (mean of distances from center to points on the perimeter) b) texture (standard deviation of gray-scale values) c) perimeter d) area e) smoothness (local variation in radius lengths) f) compactness (perimeter^2 / area - 1.0) g) concavity (severity of concave portions of the contour) h) concave points (number of concave portions of the contour) i) symmetry
  • 5. Some approaches to solving anomaly detection problem
  • 6. 683 cases of labelled data - benign/malignant Imbalanced dataset
  • 7. Spot check feature histograms
  • 8. Visualizing Classification Classification using Logistic Regression achieved an F1 score of 0.95 on the anomaly class. Classification by model on unseen data Actual data
  • 9. KNN - non parametric model to verify classification Scaled feature vector Identified precise k value using elbow method For k = 3 We had an F1 score 0.95 for the anomaly class
  • 10. Unsupervised Anomaly Detection using SVM - Gaussian Kernel Trick 1)Objective is to train a one class svm gaussian hypersphere that quarantines the benign cells. 2)Dropped labels from dataset and is split into benign and malignant datasets. 3)Benign dataset is used to train the model. 4)Malignant dataset, the dataset that contains the outliers is used to test. 5)A single class SVM is trained with a low gamma value, that captures the influence of training examples on classification.
  • 11. Gaussian Distribution for benign and malignant cells Benign multivariate gaussian Malignant multivariate gaussian
  • 13. Single class SVM with gaussian trick - 100% Accuracy