Heart
Disease
Prediction
• Made By:
• Mayur Sawaisarje (BECOC367)
• Shreyas Harnale (BECOC318)
• Nishant Indalkar (BECOC320)
Outline:
• Problem Definition
• Motivation
• Requirements
• Dataset Description
• Conclusion
Problem
Statement
TO EFFECTIVELY
CLASSIFY HEART
DISEASES USING
DIFFERENT
ALGORITHMS
Motivation
• The main motivation of this project is :
1. According to WHO, by 2030 at least 23.6 million people will be suffering from
heart diseases.
2. To reduce this number we want to help doctors identify patients with heart disease.
3. To identify the symptoms and problems faced by patients of heart diseases.
4. To develop efficient model to predict heart disease using machine learning
algorithms.
Requirements
• Technology: Python
• Libraries: scikit, pandas, numpy, matplotlib
• Operating System: Windows
• IDE: Jupyter Notebook/Google Colab
Dataset
• The dataset consists of data from 304 people.
• It has 14 different attributes.
• The attributes include: age, sex, cholesterol, ecg, etc.
K-Nearest Neighbour
• K-Nearest Neighbour is one of the simplest Machine Learning
algorithms based on Supervised Learning technique
• K-NN algorithm assumes the similarity between the new
case/data and available cases and put the new case into
the category that is most similar to the available categories.
• K-NN algorithm stores all the available data and classifies a
new data point based on the similarity. This means when new
data appears then it can be easily classified into a well suite
category by using K- NN algorithm.
K-Nearest Neighbour
Random Forest
• As the name suggests, "Random Forest is a classifier that
contains a number of decision trees on various subsets of
the given dataset and takes the average to improve the
predictive accuracy of that dataset."
Decision Tree
• Non-linear classifier
• Easy to use
• Easy to interpret
• Susceptible to overfitting but can be avoided.
Anatomy of a decision tree
overcast
high normal false
true
sunny
rain
No No
Yes Yes
Yes
Outlook
Humidity
Windy
Each node is a test on
one attribute
Possible attribute values
of the node
Leafs are the
decisions
Conclusion
• By using the above methods we can classify whether the
person has a heart disease or not.

More Related Content

PPTX
Emotion_Recognition_using_Brainwave_Datasets_Bethlehem_Seid (1).pptx
PDF
HEALTH PREDICTION ANALYSIS USING DATA MINING
PDF
Hybrid Technique for Associative Classification of Heart Diseases
PDF
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
PDF
Data preprocessing and unsupervised learning methods in Bioinformatics
PPTX
prediction of heart disease using machine learning
PPTX
Stroke Prediction
PPTX
Researc-paper_Project Work Phase-1 PPT (21CS09).pptx
Emotion_Recognition_using_Brainwave_Datasets_Bethlehem_Seid (1).pptx
HEALTH PREDICTION ANALYSIS USING DATA MINING
Hybrid Technique for Associative Classification of Heart Diseases
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Data preprocessing and unsupervised learning methods in Bioinformatics
prediction of heart disease using machine learning
Stroke Prediction
Researc-paper_Project Work Phase-1 PPT (21CS09).pptx

Similar to DMW PPT created for engg submission in college (20)

PPTX
ACHIVERS TEAM _PPT__ML.pptx
PDF
E04733639
PPTX
Project PPT(N) for a good buiness development ND BITCOIN.pptx
PDF
H0342044046
PPTX
ML-Lecture06_07.pptxxxx bjkf mlllllllllll
PPTX
Final_Presentation.pptx
DOCX
10_PPT__ML.docx
PPTX
Presentation
PDF
Paper id 212014112
PDF
Biological Foundations for Deep Learning: Towards Decision Networks
PPTX
Module -3 expert system.pptx
PPTX
data mining using SQL server Analysis services
PPTX
heart final last sem.pptx
PPTX
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbh
PPTX
PPT.pptx
PPTX
Visual tools for databade queries and analysis
PPTX
Computer-Aided Detection (1).pptx
PPTX
Big Data at Geisinger Health System: Big Wins in a Short Time
PPTX
Introduction Data collection Attributes used Technology used Library used Dat...
PDF
A Survey on Heart Disease Prediction Techniques
ACHIVERS TEAM _PPT__ML.pptx
E04733639
Project PPT(N) for a good buiness development ND BITCOIN.pptx
H0342044046
ML-Lecture06_07.pptxxxx bjkf mlllllllllll
Final_Presentation.pptx
10_PPT__ML.docx
Presentation
Paper id 212014112
Biological Foundations for Deep Learning: Towards Decision Networks
Module -3 expert system.pptx
data mining using SQL server Analysis services
heart final last sem.pptx
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbh
PPT.pptx
Visual tools for databade queries and analysis
Computer-Aided Detection (1).pptx
Big Data at Geisinger Health System: Big Wins in a Short Time
Introduction Data collection Attributes used Technology used Library used Dat...
A Survey on Heart Disease Prediction Techniques
Ad

Recently uploaded (20)

PPTX
Management Information system : MIS-e-Business Systems.pptx
PPTX
Information Storage and Retrieval Techniques Unit III
PPTX
Feature types and data preprocessing steps
PDF
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
PDF
Exploratory_Data_Analysis_Fundamentals.pdf
PDF
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
PPTX
PRASUNET_20240614003_231416_0000[1].pptx
PPTX
wireless networks, mobile computing.pptx
PPT
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
PPTX
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
PDF
20250617 - IR - Global Guide for HR - 51 pages.pdf
PPTX
Petroleum Refining & Petrochemicals.pptx
PPTX
"Array and Linked List in Data Structures with Types, Operations, Implementat...
DOC
T Pandian CV Madurai pandi kokkaf illaya
PDF
Computer organization and architecuture Digital Notes....pdf
PPTX
Software Engineering and software moduleing
PPTX
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
PPTX
CONTRACTS IN CONSTRUCTION PROJECTS: TYPES
PDF
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
Management Information system : MIS-e-Business Systems.pptx
Information Storage and Retrieval Techniques Unit III
Feature types and data preprocessing steps
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
Exploratory_Data_Analysis_Fundamentals.pdf
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
PRASUNET_20240614003_231416_0000[1].pptx
wireless networks, mobile computing.pptx
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
20250617 - IR - Global Guide for HR - 51 pages.pdf
Petroleum Refining & Petrochemicals.pptx
"Array and Linked List in Data Structures with Types, Operations, Implementat...
T Pandian CV Madurai pandi kokkaf illaya
Computer organization and architecuture Digital Notes....pdf
Software Engineering and software moduleing
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
CONTRACTS IN CONSTRUCTION PROJECTS: TYPES
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
distributed database system" (DDBS) is often used to refer to both the distri...
Ad

DMW PPT created for engg submission in college

  • 1. Heart Disease Prediction • Made By: • Mayur Sawaisarje (BECOC367) • Shreyas Harnale (BECOC318) • Nishant Indalkar (BECOC320)
  • 2. Outline: • Problem Definition • Motivation • Requirements • Dataset Description • Conclusion
  • 4. Motivation • The main motivation of this project is : 1. According to WHO, by 2030 at least 23.6 million people will be suffering from heart diseases. 2. To reduce this number we want to help doctors identify patients with heart disease. 3. To identify the symptoms and problems faced by patients of heart diseases. 4. To develop efficient model to predict heart disease using machine learning algorithms.
  • 5. Requirements • Technology: Python • Libraries: scikit, pandas, numpy, matplotlib • Operating System: Windows • IDE: Jupyter Notebook/Google Colab
  • 6. Dataset • The dataset consists of data from 304 people. • It has 14 different attributes. • The attributes include: age, sex, cholesterol, ecg, etc.
  • 7. K-Nearest Neighbour • K-Nearest Neighbour is one of the simplest Machine Learning algorithms based on Supervised Learning technique • K-NN algorithm assumes the similarity between the new case/data and available cases and put the new case into the category that is most similar to the available categories. • K-NN algorithm stores all the available data and classifies a new data point based on the similarity. This means when new data appears then it can be easily classified into a well suite category by using K- NN algorithm.
  • 9. Random Forest • As the name suggests, "Random Forest is a classifier that contains a number of decision trees on various subsets of the given dataset and takes the average to improve the predictive accuracy of that dataset."
  • 10. Decision Tree • Non-linear classifier • Easy to use • Easy to interpret • Susceptible to overfitting but can be avoided.
  • 11. Anatomy of a decision tree overcast high normal false true sunny rain No No Yes Yes Yes Outlook Humidity Windy Each node is a test on one attribute Possible attribute values of the node Leafs are the decisions
  • 12. Conclusion • By using the above methods we can classify whether the person has a heart disease or not.