SlideShare a Scribd company logo
2
Most read
3
Most read
9
Most read
DISEASE PREDICTION SYSTEM USING
DATA MINING
Under Guidance
of
asst. prof. Ashutosh Pandey
Presented By:-
Anand kumar mishra (1616210020)
Siddhesh shukla
(1616210112)
Shivani yadav
(1616210103)
B.Tech. (CS) – 4th year.
OUTLINE
• Introduction
▪ Objective
▪ Motivation
• Literature Survey
• Some Key Features of Disease
• Plan of Action
• Methodology Adopted
▪ Data Collection
▪ Steps to be Performed
▪ Functional Architecture
• Expected Result
• Conclusion
• References
OBJECTIVE
Identifying hidden patterns and relationships
among various attributes that can lead to:
▪ better diagnosis,
▪ better medicines,
▪ better treatment
▪ Early diagnosis may predict the chances of
Disease and lead to take
preventive measures before the situation
becomes critical.
MOTIVATION
• The prevalence of Diabetes is increasing in all
countries and its prevention has become a public
health priority.
• The predictors of Diabetes risk are insufficiently
understood.
WORLDWIDE STATISTICS
❏ The number of people with diabetes has risen
from 108 million in 1980 to 422 million in 2014.
❏ The global prevalence of diabetes among adults
over 18 years of age has risen from 4.7% in 1980
to 8.5% in 2014
❏ Diabetes is a major cause of blindness, kidney
failure, heart attacks, stroke and lower limb
amputation.
❏ In 2016, an estimated 1.6 million deaths were
directly caused by diabetes.
❏ Almost half of all deaths attributable to high
blood glucose occur before the age of 70 years.
DISEASE PREDICTION SYSTEM USING  DATA MINING
INDIAN STATISTICS
� Diabetes currently affects more than 62 million
Indians, which is more than 7.1% of the adult
population.
� Nearly 1 million Indians die due to diabetes
every year.
TOTAL
DEATHS
(in million)
SEX % DEATH DUE
TO DIABETES
40.2 MALE 8
23.8 FEMALE 12
MOTIVATION
❖ Recent research has shown that the onset of can be
postponed or prevented with lifestyle intervention or by
medication.
❖ Identifying individuals at high risk of cancer has therefore
become a priority for targeting preventive measures
effectively.
❖ Symptoms are often less marked, thus the disease may be
diagnosed several years after onset, once complications
have already arisen
SYMPTOMS OF DIABETES
� increased urine output,
� excessive thirst,
� weight loss,
� hunger,
� fatigue,
� skin problems
� slow healing wounds,
� yeast infections, and
� tingling or numbness in the feet or toes.
DISEASE PREDICTION SYSTEM USING  DATA MINING
PLAN OF WORK
Data Collection / Data Analysis
& Literature Review
(3-4 months)
Data Warehouse Construction
(1-2 months)
Building Classifier
(3-3.5 months)
Experimental Result Analysis
(1-2 months)
Report Preparation
(1-1.5 months)
METHODOLOGY ADOPTED
Step 1- Preprocessing of data
Step 2-Dividing the patients into different group
Step 3- Apply the Fuzzy Inference
Step 4-Using Apriori Algorithms to find the relative pattern
Step 5- Build Classifier
FUNCTIONAL ARCHITECTURE
DATA
SET
PREPROCESSING
ASSOCIATION
RULE MINNING
FUZZIFICATION
OF
ATTRIBUTES
RESULT
BUILD
CLASSIFIER
DIVIDE THE
DATA SET INTO 3
CLUSTERS
PREPROCESSING
-- MERGING DATA FROM MULTIPLE SOURCES INTO
UNIQUE FORMAT
-- MISSING VALUE HANDLING
Use the attribute mean for all samples belonging to the
same as the given tuple.
DIVIDE THE DATA SET INTO 3 CLUSTER
Dividing the patients into 3 three different group
according to different condition of patients
� Very serious
� Serious
� Normal
FUZZY LOGIC
� Intuitionistic Fuzzy Set : claim that an
element x belongs to a given degree μA(x) to a fuzzy
set x should not belong to A to the extent 1-
μA(x)
α - Cut:
Let α be a number between 0 and 1. The α-cut of fuzzy
set A at level α is the set of those elements of A where
membership function is greater than or equal to α.
16
APRIORI ALGORITHM
We use apriori algorithm to find relative pattern
Suppose we get association rule
A→B confidence 95%
Means if patient has attribute A then it will has
attribute B also with confidence of 95 %
BUILD CLASSIFIER
� Various techniques may be applied –
✔ Multilayer Back propagation feed-forward ANN
-- Train ANN using the weight of attributes
calculate
from association rules confidence value.
✔ Simple Weighted Sum Method
� Expected Result :
� -- Class Label prediction of the patient as either:
� normal
� serious or
� very serious
✔
EXPECTED RESULT
� Class Label prediction of the patient as either:
� normal
� serious or
� very serious
o Association among various attributes with
respective confidence level (A🡪B , CL)
CONCLUSION
We are finding the relative pattern of patients in a
hospital we uses IFS, α-cuts, and Apriori algorithm for
discovering the knowledge of patients.
Our approach will successfully protect the patients’
personal data privacy and will achieve some gratifying
results from the experiments.
Certainly, the approach is not limited in a disease, it
can be used in other fields in the long run.
REFERENCES
• “Mining Cancer data with Discrete Particle Swarm Optimization and Rule
Pruning “
Yao Liu and Yuk Ying Chun
• “Identifying HotSpots in Lung Cancer Data Using Association Rule
Mining “
Ankit Agrawal and Alok Choudhary
• “Comparison of feature selection methods for multiclass cancer
classification based on microarray data” Xiaobo Li1,2*,
Sihua Peng3, Xiaosi Zhan1
• “Lung cancer statistics,” centers for Disease Control and Prevention,
URL:http://www.cdc.gov/cancer/lung/statistics
• en.wikipedia.org/wiki/World_Health_Organization
• www.whoindia.org
• A. Jemal, F. Bray, M.M. Center, J. Ferlay, E. Ward, D. Forman(2011).
"Global cancer tatistics". CA: a cancer journal forclinicians61

More Related Content

PPTX
Machine Learning for Disease Prediction
PPTX
Project on disease prediction
PPTX
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
PPTX
HEALTH PREDICTION ANALYSIS USING DATA MINING
PPTX
Disease prediction using machine learning
PDF
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
PDF
Heart disease prediction
PPTX
Disease prediction and doctor recommendation system
Machine Learning for Disease Prediction
Project on disease prediction
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
HEALTH PREDICTION ANALYSIS USING DATA MINING
Disease prediction using machine learning
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart disease prediction
Disease prediction and doctor recommendation system

What's hot (20)

PPTX
Machine Learning - Breast Cancer Diagnosis
PPTX
Disease Prediction And Doctor Appointment system
PPTX
Final ppt
PDF
Heart Attack Prediction using Machine Learning
PPTX
Predict Breast Cancer using Deep Learning
PPTX
Predicting Diabetes Using Machine Learning
PDF
Diabetes Prediction Using Machine Learning
DOCX
DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...
PDF
Machine Learning in Healthcare
PDF
Deep learning and Healthcare
PPTX
Breast cancer diagnosis machine learning ppt
PPT
Diabetes prediction using machine learning
PPTX
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
PPT
Survey on data mining techniques in heart disease prediction
PDF
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
PPTX
Prediction of heart disease using machine learning.pptx
PPTX
DIABETES PREDICTION SYSTEM .pptx
PPTX
Machine Learning in Healthcare Diagnostics
PPTX
Brain Tumour Detection.pptx
PPT
Machine Learning
Machine Learning - Breast Cancer Diagnosis
Disease Prediction And Doctor Appointment system
Final ppt
Heart Attack Prediction using Machine Learning
Predict Breast Cancer using Deep Learning
Predicting Diabetes Using Machine Learning
Diabetes Prediction Using Machine Learning
DISEASE PREDICTION BY MACHINE LEARNING OVER BIG DATA FROM HEALTHCARE COMMUNI...
Machine Learning in Healthcare
Deep learning and Healthcare
Breast cancer diagnosis machine learning ppt
Diabetes prediction using machine learning
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
Survey on data mining techniques in heart disease prediction
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of heart disease using machine learning.pptx
DIABETES PREDICTION SYSTEM .pptx
Machine Learning in Healthcare Diagnostics
Brain Tumour Detection.pptx
Machine Learning
Ad

Similar to DISEASE PREDICTION SYSTEM USING DATA MINING (20)

PDF
IRJET- Predicting Diabetes Disease using Effective Classification Techniques
PDF
IRJET- Result on the Application for Multiple Disease Prediction from Symptom...
PDF
Multiple Disease Prediction System: A Review
PDF
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...
PDF
Multiple Disease Prediction System
PDF
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
PDF
GIHAT: An Efficient Prediction Technique for Measure for Diabetes Mellitus
PDF
IRJET- Disease Prediction System
PDF
FORESTALLING GROWTH RATE IN TYPE II DIABETIC PATIENTS USING DATA MINING AND A...
PDF
Machine learning approach for predicting heart and diabetes diseases using da...
PDF
DIABETES PROGNOSTICATION UTILIZING MACHINE LEARNING
PDF
Multi Disease Detection using Deep Learning
PDF
Prognosis of Diabetes by Performing Data Mining of HbA1c
PDF
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
PDF
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
PDF
Diabetes Mellitus Prediction System Using Data Mining
PDF
IRJET- Diabetes Prediction by Machine Learning over Big Data from Healthc...
PDF
IRJET - Deep Multiple Instance Learning for Automatic Detection of Diabetic R...
PDF
PREDICTION OF DIABETES (SUGAR) USING MACHINE LEARNING TECHNIQUES
PDF
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
IRJET- Predicting Diabetes Disease using Effective Classification Techniques
IRJET- Result on the Application for Multiple Disease Prediction from Symptom...
Multiple Disease Prediction System: A Review
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...
Multiple Disease Prediction System
Diabetes Prediction by Supervised and Unsupervised Approaches with Feature Se...
GIHAT: An Efficient Prediction Technique for Measure for Diabetes Mellitus
IRJET- Disease Prediction System
FORESTALLING GROWTH RATE IN TYPE II DIABETIC PATIENTS USING DATA MINING AND A...
Machine learning approach for predicting heart and diabetes diseases using da...
DIABETES PROGNOSTICATION UTILIZING MACHINE LEARNING
Multi Disease Detection using Deep Learning
Prognosis of Diabetes by Performing Data Mining of HbA1c
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
Diabetes Mellitus Prediction System Using Data Mining
IRJET- Diabetes Prediction by Machine Learning over Big Data from Healthc...
IRJET - Deep Multiple Instance Learning for Automatic Detection of Diabetic R...
PREDICTION OF DIABETES (SUGAR) USING MACHINE LEARNING TECHNIQUES
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
Ad

Recently uploaded (20)

PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Structs to JSON How Go Powers REST APIs.pdf
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPT
Mechanical Engineering MATERIALS Selection
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Welding lecture in detail for understanding
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
additive manufacturing of ss316l using mig welding
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
Digital Logic Computer Design lecture notes
DOCX
573137875-Attendance-Management-System-original
PPTX
Construction Project Organization Group 2.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
PPT on Performance Review to get promotions
UNIT-1 - COAL BASED THERMAL POWER PLANTS
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Structs to JSON How Go Powers REST APIs.pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Mechanical Engineering MATERIALS Selection
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Welding lecture in detail for understanding
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
additive manufacturing of ss316l using mig welding
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Lesson 3_Tessellation.pptx finite Mathematics
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Model Code of Practice - Construction Work - 21102022 .pdf
Digital Logic Computer Design lecture notes
573137875-Attendance-Management-System-original
Construction Project Organization Group 2.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPT on Performance Review to get promotions

DISEASE PREDICTION SYSTEM USING DATA MINING

  • 1. DISEASE PREDICTION SYSTEM USING DATA MINING Under Guidance of asst. prof. Ashutosh Pandey Presented By:- Anand kumar mishra (1616210020) Siddhesh shukla (1616210112) Shivani yadav (1616210103) B.Tech. (CS) – 4th year.
  • 2. OUTLINE • Introduction ▪ Objective ▪ Motivation • Literature Survey • Some Key Features of Disease • Plan of Action • Methodology Adopted ▪ Data Collection ▪ Steps to be Performed ▪ Functional Architecture • Expected Result • Conclusion • References
  • 3. OBJECTIVE Identifying hidden patterns and relationships among various attributes that can lead to: ▪ better diagnosis, ▪ better medicines, ▪ better treatment ▪ Early diagnosis may predict the chances of Disease and lead to take preventive measures before the situation becomes critical.
  • 4. MOTIVATION • The prevalence of Diabetes is increasing in all countries and its prevention has become a public health priority. • The predictors of Diabetes risk are insufficiently understood.
  • 5. WORLDWIDE STATISTICS ❏ The number of people with diabetes has risen from 108 million in 1980 to 422 million in 2014. ❏ The global prevalence of diabetes among adults over 18 years of age has risen from 4.7% in 1980 to 8.5% in 2014 ❏ Diabetes is a major cause of blindness, kidney failure, heart attacks, stroke and lower limb amputation. ❏ In 2016, an estimated 1.6 million deaths were directly caused by diabetes. ❏ Almost half of all deaths attributable to high blood glucose occur before the age of 70 years.
  • 7. INDIAN STATISTICS � Diabetes currently affects more than 62 million Indians, which is more than 7.1% of the adult population. � Nearly 1 million Indians die due to diabetes every year. TOTAL DEATHS (in million) SEX % DEATH DUE TO DIABETES 40.2 MALE 8 23.8 FEMALE 12
  • 8. MOTIVATION ❖ Recent research has shown that the onset of can be postponed or prevented with lifestyle intervention or by medication. ❖ Identifying individuals at high risk of cancer has therefore become a priority for targeting preventive measures effectively. ❖ Symptoms are often less marked, thus the disease may be diagnosed several years after onset, once complications have already arisen
  • 9. SYMPTOMS OF DIABETES � increased urine output, � excessive thirst, � weight loss, � hunger, � fatigue, � skin problems � slow healing wounds, � yeast infections, and � tingling or numbness in the feet or toes.
  • 11. PLAN OF WORK Data Collection / Data Analysis & Literature Review (3-4 months) Data Warehouse Construction (1-2 months) Building Classifier (3-3.5 months) Experimental Result Analysis (1-2 months) Report Preparation (1-1.5 months)
  • 12. METHODOLOGY ADOPTED Step 1- Preprocessing of data Step 2-Dividing the patients into different group Step 3- Apply the Fuzzy Inference Step 4-Using Apriori Algorithms to find the relative pattern Step 5- Build Classifier
  • 14. PREPROCESSING -- MERGING DATA FROM MULTIPLE SOURCES INTO UNIQUE FORMAT -- MISSING VALUE HANDLING Use the attribute mean for all samples belonging to the same as the given tuple.
  • 15. DIVIDE THE DATA SET INTO 3 CLUSTER Dividing the patients into 3 three different group according to different condition of patients � Very serious � Serious � Normal
  • 16. FUZZY LOGIC � Intuitionistic Fuzzy Set : claim that an element x belongs to a given degree μA(x) to a fuzzy set x should not belong to A to the extent 1- μA(x) α - Cut: Let α be a number between 0 and 1. The α-cut of fuzzy set A at level α is the set of those elements of A where membership function is greater than or equal to α. 16
  • 17. APRIORI ALGORITHM We use apriori algorithm to find relative pattern Suppose we get association rule A→B confidence 95% Means if patient has attribute A then it will has attribute B also with confidence of 95 %
  • 18. BUILD CLASSIFIER � Various techniques may be applied – ✔ Multilayer Back propagation feed-forward ANN -- Train ANN using the weight of attributes calculate from association rules confidence value. ✔ Simple Weighted Sum Method � Expected Result : � -- Class Label prediction of the patient as either: � normal � serious or � very serious ✔
  • 19. EXPECTED RESULT � Class Label prediction of the patient as either: � normal � serious or � very serious o Association among various attributes with respective confidence level (A🡪B , CL)
  • 20. CONCLUSION We are finding the relative pattern of patients in a hospital we uses IFS, α-cuts, and Apriori algorithm for discovering the knowledge of patients. Our approach will successfully protect the patients’ personal data privacy and will achieve some gratifying results from the experiments. Certainly, the approach is not limited in a disease, it can be used in other fields in the long run.
  • 21. REFERENCES • “Mining Cancer data with Discrete Particle Swarm Optimization and Rule Pruning “ Yao Liu and Yuk Ying Chun • “Identifying HotSpots in Lung Cancer Data Using Association Rule Mining “ Ankit Agrawal and Alok Choudhary • “Comparison of feature selection methods for multiclass cancer classification based on microarray data” Xiaobo Li1,2*, Sihua Peng3, Xiaosi Zhan1 • “Lung cancer statistics,” centers for Disease Control and Prevention, URL:http://www.cdc.gov/cancer/lung/statistics • en.wikipedia.org/wiki/World_Health_Organization • www.whoindia.org • A. Jemal, F. Bray, M.M. Center, J. Ferlay, E. Ward, D. Forman(2011). "Global cancer tatistics". CA: a cancer journal forclinicians61