SlideShare a Scribd company logo
FEATURE BASE HEART DISEASE
PREDICTION
USING RFCNN ADVANCE MACHINE LEARNING ALGORITHMS
STUDENT DETAILS:
QAMAR BEGUM
ROLL NO. 161022742009
M. E, COMPUTER SCIENCE
PROJECT GUIDE:
Dr.MOHAMMED SANAULLAH QASEEM
PROFESSOR & HOD
CSE DEPARTMENT
USING RFCNN ADVANCE MACHINE LEARNING ALGORITHM
FEATURE BASE HEART DISEASE
PREDICTION
TABLE OF CONTENT
• Abstract
• Existing system
• Disadvantages
• Proposed System
• Libraries
• Test Accuracy
• Results
• Application
• Conclusion
• Future Scope
• Literature survey
• System requirement
• Dendrogram
• Use case Diagram
• System
Implementation
ABSTRACT
• Heart disease is one of the leading causes of death worldwide,
• Predictive healthcare solutions frequently use machine learning (ML) techniques.
But the accuracy of the majority of conventional ML models is less than 80%.
• This procedure serves as a feature selector, identifying the most important
features from the dataset and eliminating irrelevant or noisy ones.
• Results will be useful to recognize patterns, correlations, and interactions among
the selected features.
EXISTING SYSTEM
• Lack of feature selection and data representation limits their effectiveness.
• Most models are sensitive to noise and overfitting.
• Accuracy is often below 80%, which is insufficient for critical
medical predictions.
• Less work on analysis identifying the most important features
from the dataset
DISADVANTAGES
• Inadequate feature selection results in poor data representation.
• Limited ability to capture non-linear patterns in data.
• High sensitivity to noise and outliers in the dataset.
• Overfitting to training data, leading to poor generalization.
PROPOSED SYSTEM
• We will be understanding advanced techniques to address the limitations of
existing methods
• We will be able to understand predictions from multiple models to improve
robustness.
• Algorithm designed for high accuracy and scalability.
• Ensures diversity in predictions by splitting data more randomly.
• This ensemble approach leverages the strengths of each algorithm to deliver higher
accuracy, precision, and recall for heart disease prediction
Training Data
Instance
Training each decision tree on a random subset
Bagging (majority)
Prediction Output
...............
Class A Class A class B
pOutput
RF
ALGORITHMS
Step 1: Select random K data points from the training set.
Step 2:Build the decision trees associated with the selected data
points(Subsets).
Step 3:Choose the number N for decision trees that you want to
build.
Step 4:Repeat Step 1 and 2.
Step 5: For new data points, find the predictions of each decision
tree, and assign the new data points to the category
STEPS
DECISION TREE
input layer
pooling
Layer
convolution layers
flatten
layer
output layer
CNN LAYERS
Feature
Selection
Data
Preprocessing
Training
set
Testing
set
Heart disease
data set
Data
Splitting
Model Training
RF CNN
RFCNN Trained
Model
Model Evaluation
Heart Disease
severity Prediction
LIBRARIES
• Tensorflow-Building & training neural network
• Numpy -Numerical computations
• Flask- lightweight Python web framework
• Keras- implementation of neural networks easy
• sklearn-Data preprocessing, evaluation
• matplotlib-Data visualization
LITERATURE SURVEY
PAPER TITLE AND YEAR ALGORITHMS ADVANTAGES LIMITATIONS
Deep Learning Neural
Networks for Predictive
Healthcare
2020
Deep Learning
Neural Networks
(DLNN)
• High accuracy due
to automated
feature extraction.
• Computationally
expensive.
• Requires extensive
labeled data for
training.
Genetic Algorithm and TRFNN
for Optimized Healthcare
Predictions
2021
Genetic
Algorithms (GA)
with Transfer
Function Neural
Networks
(TRFNN)
• Improved feature
selection
enhances model
accuracy.
• Adaptive learning
process.
• Computational
overhead due to
GA.
• Risk of local optima
in feature selection.
LITERATURE SURVEY
PAPER TITLE AND YEAR ALGORITHMS ADVANTAGES LIMITATIONS
Hybrid SCNN-SVM for Image-
Based Disease Detection
2022
Sparse
Convolutional Neural
Network (SCNN)
with Support Vector
Machines (SVM)
• Reduces
computational
cost with
sparse
representations
.
• Limited scalability for
very large datasets.
• Complexity in model
integration.
PFFBPNN for Predictive
Analytics in Healthcare
2020
Partial Feedback
Feedforward
Backpropagation
Neural Networks
(PFFBPNN)
• Reduces
training time
compared to
standard
backpropagatio
n.
• .Sensitive to
hyperparameter tuning.
• Requires large memory
resources for
computation.
LITERATURE SURVEY
PAPER TITLE AND YEAR ALGORITHMS ADVANTAGES LIMITATIONS
IoT-Enabled Random Forest
Model for Remote Health
Monitoring 2021
IoT and Random
Forest for real-
time monitoring
• Handles high-
dimensional data
effectively.
• Limited by IoT
device bandwidth.
• Dependent on
stable internet
connectivity
SOFTWARE
REQUIREMENTS
HARDWARE
REQUIREMENTS
• OS: WINDOWS
• PYTHON : PYTHON 3.X AND ABOVE
• SETUP TOOLS AND PIP TO BE
INSTALLED FOR 3.6.X AND ABOVE
•RAM : 1GB AND HIGHER
•Processor : Intel i3 and above
•Hard Disk : 5GB minimum is required
DENDROGRAM
Heart Disease Clinical Data
Data Processing
Feature selection
Feature Normalization
Classification
Heart Disease
Absent
Heart disease
present
SYSTEM IMPLEMENTATION
RFCNN Algorithm Steps:
• Hybrid Approach
• Feature Reduction
• Combination of Strengths
• Feature Selection process
HYBRID APPROACH
Random Forest serves as a feature selector.
• Identifying the most important features.
• Eliminating irrelevant or noisy ones. .
CNN is used as the classification engine.
• Recognize patterns
• Recognize correlations
• Interactions among the selected features
p(2
)
p(1
)
p(3
)
p(4
)
p(2
)
p(1
)
p(3
)
p(4
)
P(1)=
P(2)=
p(3)=
P(4)=
Final Prediction {p(1),P(2),p(3),p(4)}
Instance
Machine
Learning
Deep Learning
RF CN
N
RF + PCNN)
P
(
RF + PCNN)
P
(
RF + PCNN)
P
(
RF + PCNN)
P
(
/2
/2
/2
/2
FEATURE REDUCTION
• Random Forest calculates feature importance scores
and selects the top-N features
• This reduces dimensionality and improves Convolution
Neural Network efficiency by focusing only on
meaningful data.
COMBINATION OF STRENGTHS
• Random Forest is robust to overfitting due to its
ensemble structure, making it effective for feature
selection.
• The hybrid model can manage complex relationships
in the data since CNN is excellent at learning complex,
hierarchical patterns.
FEATURE SELECTION
Step:1 Train the Random Forest Model
step:2 Compute Feature Importance
Step:3 Rank Features by Importance
Step:4 Select Top Features
Step:5 Feature Subset
Input: The complete dataset with all features and the target variable.
RESULTS
TEST ACCURACY
RF CNN Accuracy Graph 83 percent accuracy
RFCNN compared with CNN
SPLIT-80:20
80 20
15 90
confusion matrix for RCNN
Predicted Label
Expected
label
100
80
60
40
20
0
No Heart Disease
Heart Disease
No Heart Disease Heart Disease
APPLICATIONS
Accurately detect
abnormalities
Analyze vast
amounts of patient
data
Predict the
likelihood of
developing certain
cardiovascular
diseases
CONCLUSION
• Prediction of heart diseases is done successfully in achieving superior
performance compared to other machine learning models
• 80/20 split dataset gives highest accuracy of impressive 83% underscoring
its effectiveness in heart disease prediction
• RFCNN's capability to handle complex medical datasets and provide
reliable predictions
• The Flask-based implementation ensures a user-friendly interface for
medical professionals and patients, with seamless integration into
healthcare systems.
FUTURE SCOPE
Enhance the model's
performance by
incorporating additional
features like genetic data or
lifestyle factors.
Expand the system to
predict other
cardiovascular conditions
for broader applicability.
Explore the integration of
blockchain for secure and
decentralized patient data
management.
Collaborate with healthcare
providers to validate the
system in real-world clinical
environments
THANK YOU
Any Questions ?

More Related Content

PPTX
Heart_Disease_Prediction_Presentation.pptx
PPTX
Short story.pptx
PPTX
prediction of heart disease using machine learning
PDF
Prediction of heart disease using classification mining technique on spark
PPTX
Short story_2.pptx
PDF
IRJET- Heart Disease Prediction System
PDF
Hybrid CNN and LSTM Network For Heart Disease Prediction
PDF
A hybrid model for heart disease prediction using recurrent neural network an...
Heart_Disease_Prediction_Presentation.pptx
Short story.pptx
prediction of heart disease using machine learning
Prediction of heart disease using classification mining technique on spark
Short story_2.pptx
IRJET- Heart Disease Prediction System
Hybrid CNN and LSTM Network For Heart Disease Prediction
A hybrid model for heart disease prediction using recurrent neural network an...

Similar to Feature based heart disease prediction approach (20)

PDF
predictionofheartdiseaseusingmachinelearning.pdf
PPTX
Prediction of heart disease using machine learning.pptx
PPTX
PPT.pptx
PPTX
Project Phase-01.pptx guj.fjfjyfjfhjjfgj
PPTX
Presentation
PDF
Prediction of heart disease using neural network
PDF
Heart Disease Prediction using Machine Learning Algorithms
PDF
Risk Of Heart Disease Prediction Using Machine Learning
PDF
HEART DISEASE PREDICTION RANDOM FOREST ALGORITHMS
PDF
Heart Disease Prediction Using Machine Learning Techniques
PPTX
Heart Disease prediction using machine learning
PDF
Heart disease classification using Random Forest
PPTX
HRFLM important engineering topics for first sem.pptx
PDF
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
PPTX
Introduction Data collection Attributes used Technology used Library used Dat...
PDF
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
PDF
Heart Disease Prediction using Machine Learning
PDF
711201905
PDF
711201905
PDF
711201905
predictionofheartdiseaseusingmachinelearning.pdf
Prediction of heart disease using machine learning.pptx
PPT.pptx
Project Phase-01.pptx guj.fjfjyfjfhjjfgj
Presentation
Prediction of heart disease using neural network
Heart Disease Prediction using Machine Learning Algorithms
Risk Of Heart Disease Prediction Using Machine Learning
HEART DISEASE PREDICTION RANDOM FOREST ALGORITHMS
Heart Disease Prediction Using Machine Learning Techniques
Heart Disease prediction using machine learning
Heart disease classification using Random Forest
HRFLM important engineering topics for first sem.pptx
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
Introduction Data collection Attributes used Technology used Library used Dat...
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
Heart Disease Prediction using Machine Learning
711201905
711201905
711201905
Ad

More from Syeda Nasiha (12)

PPTX
review_major 2nd phase project report on automated
PDF
Python learning basicss Presentation.pdf
PPT
Project on system for patient monitoring.ppt
PPTX
Switched capacitor voltage boosting converter for electric and hybrid electri...
PPTX
Automatic_Plant_Watering_System_using_Arduino_UNO_.pptx
PPSX
Internship PPT.ppsx
PPT
Arm7 architecture
PPTX
PPT
On chip cache
PPTX
PPTX
3 vy14lvs24
PPT
review_major 2nd phase project report on automated
Python learning basicss Presentation.pdf
Project on system for patient monitoring.ppt
Switched capacitor voltage boosting converter for electric and hybrid electri...
Automatic_Plant_Watering_System_using_Arduino_UNO_.pptx
Internship PPT.ppsx
Arm7 architecture
On chip cache
3 vy14lvs24
Ad

Recently uploaded (20)

PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
web development for engineering and engineering
PDF
Well-logging-methods_new................
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
Construction Project Organization Group 2.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
Sustainable Sites - Green Building Construction
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
OOP with Java - Java Introduction (Basics)
PPT
Project quality management in manufacturing
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
DOCX
573137875-Attendance-Management-System-original
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
web development for engineering and engineering
Well-logging-methods_new................
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Construction Project Organization Group 2.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Strings in CPP - Strings in C++ are sequences of characters used to store and...
Lesson 3_Tessellation.pptx finite Mathematics
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Sustainable Sites - Green Building Construction
CYBER-CRIMES AND SECURITY A guide to understanding
OOP with Java - Java Introduction (Basics)
Project quality management in manufacturing
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
573137875-Attendance-Management-System-original

Feature based heart disease prediction approach

  • 1. FEATURE BASE HEART DISEASE PREDICTION USING RFCNN ADVANCE MACHINE LEARNING ALGORITHMS STUDENT DETAILS: QAMAR BEGUM ROLL NO. 161022742009 M. E, COMPUTER SCIENCE PROJECT GUIDE: Dr.MOHAMMED SANAULLAH QASEEM PROFESSOR & HOD CSE DEPARTMENT
  • 2. USING RFCNN ADVANCE MACHINE LEARNING ALGORITHM FEATURE BASE HEART DISEASE PREDICTION
  • 3. TABLE OF CONTENT • Abstract • Existing system • Disadvantages • Proposed System • Libraries • Test Accuracy • Results • Application • Conclusion • Future Scope • Literature survey • System requirement • Dendrogram • Use case Diagram • System Implementation
  • 4. ABSTRACT • Heart disease is one of the leading causes of death worldwide, • Predictive healthcare solutions frequently use machine learning (ML) techniques. But the accuracy of the majority of conventional ML models is less than 80%. • This procedure serves as a feature selector, identifying the most important features from the dataset and eliminating irrelevant or noisy ones. • Results will be useful to recognize patterns, correlations, and interactions among the selected features.
  • 5. EXISTING SYSTEM • Lack of feature selection and data representation limits their effectiveness. • Most models are sensitive to noise and overfitting. • Accuracy is often below 80%, which is insufficient for critical medical predictions. • Less work on analysis identifying the most important features from the dataset
  • 6. DISADVANTAGES • Inadequate feature selection results in poor data representation. • Limited ability to capture non-linear patterns in data. • High sensitivity to noise and outliers in the dataset. • Overfitting to training data, leading to poor generalization.
  • 7. PROPOSED SYSTEM • We will be understanding advanced techniques to address the limitations of existing methods • We will be able to understand predictions from multiple models to improve robustness. • Algorithm designed for high accuracy and scalability. • Ensures diversity in predictions by splitting data more randomly. • This ensemble approach leverages the strengths of each algorithm to deliver higher accuracy, precision, and recall for heart disease prediction
  • 8. Training Data Instance Training each decision tree on a random subset Bagging (majority) Prediction Output ............... Class A Class A class B pOutput RF ALGORITHMS
  • 9. Step 1: Select random K data points from the training set. Step 2:Build the decision trees associated with the selected data points(Subsets). Step 3:Choose the number N for decision trees that you want to build. Step 4:Repeat Step 1 and 2. Step 5: For new data points, find the predictions of each decision tree, and assign the new data points to the category STEPS
  • 12. Feature Selection Data Preprocessing Training set Testing set Heart disease data set Data Splitting Model Training RF CNN RFCNN Trained Model Model Evaluation Heart Disease severity Prediction
  • 13. LIBRARIES • Tensorflow-Building & training neural network • Numpy -Numerical computations • Flask- lightweight Python web framework • Keras- implementation of neural networks easy • sklearn-Data preprocessing, evaluation • matplotlib-Data visualization
  • 14. LITERATURE SURVEY PAPER TITLE AND YEAR ALGORITHMS ADVANTAGES LIMITATIONS Deep Learning Neural Networks for Predictive Healthcare 2020 Deep Learning Neural Networks (DLNN) • High accuracy due to automated feature extraction. • Computationally expensive. • Requires extensive labeled data for training. Genetic Algorithm and TRFNN for Optimized Healthcare Predictions 2021 Genetic Algorithms (GA) with Transfer Function Neural Networks (TRFNN) • Improved feature selection enhances model accuracy. • Adaptive learning process. • Computational overhead due to GA. • Risk of local optima in feature selection.
  • 15. LITERATURE SURVEY PAPER TITLE AND YEAR ALGORITHMS ADVANTAGES LIMITATIONS Hybrid SCNN-SVM for Image- Based Disease Detection 2022 Sparse Convolutional Neural Network (SCNN) with Support Vector Machines (SVM) • Reduces computational cost with sparse representations . • Limited scalability for very large datasets. • Complexity in model integration. PFFBPNN for Predictive Analytics in Healthcare 2020 Partial Feedback Feedforward Backpropagation Neural Networks (PFFBPNN) • Reduces training time compared to standard backpropagatio n. • .Sensitive to hyperparameter tuning. • Requires large memory resources for computation.
  • 16. LITERATURE SURVEY PAPER TITLE AND YEAR ALGORITHMS ADVANTAGES LIMITATIONS IoT-Enabled Random Forest Model for Remote Health Monitoring 2021 IoT and Random Forest for real- time monitoring • Handles high- dimensional data effectively. • Limited by IoT device bandwidth. • Dependent on stable internet connectivity
  • 17. SOFTWARE REQUIREMENTS HARDWARE REQUIREMENTS • OS: WINDOWS • PYTHON : PYTHON 3.X AND ABOVE • SETUP TOOLS AND PIP TO BE INSTALLED FOR 3.6.X AND ABOVE •RAM : 1GB AND HIGHER •Processor : Intel i3 and above •Hard Disk : 5GB minimum is required
  • 18. DENDROGRAM Heart Disease Clinical Data Data Processing Feature selection Feature Normalization Classification Heart Disease Absent Heart disease present
  • 19. SYSTEM IMPLEMENTATION RFCNN Algorithm Steps: • Hybrid Approach • Feature Reduction • Combination of Strengths • Feature Selection process
  • 20. HYBRID APPROACH Random Forest serves as a feature selector. • Identifying the most important features. • Eliminating irrelevant or noisy ones. . CNN is used as the classification engine. • Recognize patterns • Recognize correlations • Interactions among the selected features
  • 22. FEATURE REDUCTION • Random Forest calculates feature importance scores and selects the top-N features • This reduces dimensionality and improves Convolution Neural Network efficiency by focusing only on meaningful data.
  • 23. COMBINATION OF STRENGTHS • Random Forest is robust to overfitting due to its ensemble structure, making it effective for feature selection. • The hybrid model can manage complex relationships in the data since CNN is excellent at learning complex, hierarchical patterns.
  • 24. FEATURE SELECTION Step:1 Train the Random Forest Model step:2 Compute Feature Importance Step:3 Rank Features by Importance Step:4 Select Top Features Step:5 Feature Subset Input: The complete dataset with all features and the target variable.
  • 26. TEST ACCURACY RF CNN Accuracy Graph 83 percent accuracy RFCNN compared with CNN
  • 27. SPLIT-80:20 80 20 15 90 confusion matrix for RCNN Predicted Label Expected label 100 80 60 40 20 0 No Heart Disease Heart Disease No Heart Disease Heart Disease
  • 28. APPLICATIONS Accurately detect abnormalities Analyze vast amounts of patient data Predict the likelihood of developing certain cardiovascular diseases
  • 29. CONCLUSION • Prediction of heart diseases is done successfully in achieving superior performance compared to other machine learning models • 80/20 split dataset gives highest accuracy of impressive 83% underscoring its effectiveness in heart disease prediction • RFCNN's capability to handle complex medical datasets and provide reliable predictions • The Flask-based implementation ensures a user-friendly interface for medical professionals and patients, with seamless integration into healthcare systems.
  • 30. FUTURE SCOPE Enhance the model's performance by incorporating additional features like genetic data or lifestyle factors. Expand the system to predict other cardiovascular conditions for broader applicability. Explore the integration of blockchain for secure and decentralized patient data management. Collaborate with healthcare providers to validate the system in real-world clinical environments