SlideShare a Scribd company logo
Copyright : Futuretext Ltd. London0
Data Mining for Wearable Sensors in Health Monitoring
Systems: A Review of Recent Trends and Challenges
Hadi Banaee *, Mobyen Uddin Ahmed and Amy Loutfi
Center for Applied Autonomous Sensor Systems, O¨ rebro University, SE-
70182 O¨ rebro, Sweden; E-Mails: mobyen.ahmed@oru.se (M.U.A.);
amy.loutfi@oru.se (A.L.)
Shift in focus from data collection and simple apps (calculating steps, sleep
etc) to Data analytics based on context awareness and Personalization
Specifically we concentrated the review on the following vital sign
parameters:
electrocardiogram (ECG), oxygen saturation (SpO2 ), heart rate (HR),
Photoplethysmography (PPG), blood glucose (BG), respiratory rate (RR),
and blood pressure (BP).
Copyright : Futuretext Ltd. London1
Three types of data mining tasks:
Anomaly detection(including raising alarms)
Prediction
and Diagnosis
Three analysis dimensions.
a) Setting in which the monitoring occurs(ex independent living)
b) Type of subjects used (ex healthy, specific illness etc)
c) How and where data is processed
Copyright : Futuretext Ltd. London2
Anomaly Detection
Anomaly detection techniques are often developed based on a
classification methods to distinguish the data set into normal class and
outliers. For example, support vector machines , Markov models and
Wavelet analysis are used in health monitoring systems for anomaly
detection.
a) Usually deal with short term and multivariate data sets in order to
characterize the entire the data to find discords.
b) Finding irregular patterns in vital signs time series such as abnormal
episodes in ECG pulses , SpO2 signal and blood glucose level which
mostly discover unusual temporal patterns in continuous data.
c) Use domain knowledge and predefined information to detect anomalies
for decision making such as anomaly detection in sleep episodes and
finding hazardous stress levels
Copyright : Futuretext Ltd. London3
Prediction
Supervised learning models where it includes feature extraction,
training and testing steps while performing the prediction of the behavior.
Examples: blood glucose level prediction, mortality prediction by
clustering electronic health data, and a predictive decision making system
for dialysis patients.
Diagnosis/Decision Making
Like anomaly detection but not necessary detection abnormalities.
Examples: estimating the severity of health episodes of patients
suffering chronic disease, sleep issues such as polysomnography and
apnea, estimation and classification of health conditions and emotion
recognition. Most of these researches have used online databases with
annotated episodes in order to have sufficient and trustable real-world
disease labels to evaluate the decision making process. Considering the
complexity of the data Neural Networks and decision trees used.
Copyright : Futuretext Ltd. London4
Other Data Mining Tasks for Wearable Sensors
(1) data acquisition using the adequate sensor set;
(2)transmission of data from subject to clinician;
(3)integration of data with other descriptive data; and
(4)data storage.
Several data mining techniques are applied such as wavelet analysis for
artifact reduction and data compression , rule-based methods for data
summarizing and transmitting , and Gaussian process for secure
authentication .
Preprocessing
(1) filter unusual data to remove artifacts and
(2) remove high frequency noise
Ex ECG data to remove frequency noise, the other methods in frequency
domain
Copyright : Futuretext Ltd. London5
Time Domain Spectral Domain Other Features
Mean R-R, Std R-R, Mean HR, Spectral energy [27,62], Power
Std HR [39], Number of R-R spectral density [32], Low-pass
ECG interval [27], Mean R-R, Std filter [45], Low/high -
R-R interval [64]. frequency [39,64].
Mean, zero crossing counts,
Drift from normality range [61],
SpO2 entropy [48], Mean, Slope [61], Energy, Low frequency [60].
Self-similarity [60]. Entropy [60].
Energy, Low/high frequency [60], Low/high frequency [36],
Mean, Slope [61], Mean, Wavelet coefficients of data Drift from normality range [61],
HR Entropy, Co-occurrence
Self-similarity, Std [60]. segments [45], Low/high
frequency, Power spectral coefficients [60]. density [42].
Rise Times, Max, Min,
PPG Mean [36]. Low/high frequency [36]. -
BP Mean, Slope [61]. - Rule based features [56].
RR Mean, Min, Max [64]. - Residual and tidal volume [64].
Zero crossings count, Peak
value, Rise time (EMG) [68], Spectral energy (EEG) [27],
Mean, Duration (GSR) [36], Median and mean Frequency, Bandwidth, Peaks count
Other Pick value, Min, Max Spectral energy (EMG) [68], (GSR) [36].
(SCR) [51], Total magnitude, Energy (GSR) [36].
Duration (GSR) [39].
Copyright : Futuretext Ltd. London6
Three most popular approaches for dimension reduction in medical
domain are PCA, ICA, and LDA
Other tools for feature selection used in the literature includes threshold-
based rules, analysis of variance (ANOVA) , and Fourier transforms.
Common health parameters considered by SVM methods are ECG, HR,
and SpO2 which are mostly used in the short term and annotated form.
In general, SVM techniques are often proposed for anomaly detection
and decision making tasks in healthcare services.
The ability of the NN is to model highly nonlinear systems such as
physiological records where the correlation of the input parameters is not
easily detectable
Copyright : Futuretext Ltd. London7
Copyright : Futuretext Ltd. London8
Data Properties
• Time Horizon (long term/short term):
Some data analysis systems in healthcare were designed to process
short signals such as few minutes of ECG data , a few hours of heart
rate or oxygen saturation and the measurement of blood pressures for
a day and even more (Blood glucose).
• Scale (large/small): considering a big number of subjects (patient or
healthy) are counted as large scale studies [30].
• Labeling (annotated/unlabeled):annotations also acquired using
another source of knowledge like electronic health record (EHR),
coronary syndromes, and also history of vital signs
• Continuous/Discrete:
• Single Sensor/Multi Sensors:
Copyright : Futuretext Ltd. London9
Challenges
• Need for Large scale monitoring in non-clinical context
• Dealing with annotated data sets: few benchmark data sets are
available also the challenge of how data annotation (labeling) can be
best done for such target groups.
• Multiple measurements: Another challenge in this field is to exploit
the multiple measurements of vital signs simultaneously. Esp with
sensor fusion techniques
• Contextual information:
• Reliability, level of trust to the system: the amount of trust between the
data analysis system and the experts who use the system for decision
making tasks.
• Discovering of unseen features
• Post processing

More Related Content

PDF
Analysis of Heart Rate Variability Via Health Care Platform
PDF
Prediction of Heart Disease Using Data Mining Techniques- A Review
PDF
A comparative study on remote tracking of parkinson’s disease progression usi...
PDF
Ijarcet vol-2-issue-4-1393-1397
PPTX
Stroke Prediction
PDF
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
PDF
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
PDF
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
Analysis of Heart Rate Variability Via Health Care Platform
Prediction of Heart Disease Using Data Mining Techniques- A Review
A comparative study on remote tracking of parkinson’s disease progression usi...
Ijarcet vol-2-issue-4-1393-1397
Stroke Prediction
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique

What's hot (20)

PDF
Chronic Kidney Disease Prediction
PDF
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
PDF
Heart Disease Prediction using Machine Learning Algorithm
PDF
IRJET - An Effective Stroke Prediction System using Predictive Models
PDF
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine Learning
PDF
Mining of medical data to identify risk factors of heart disease using freque...
PDF
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
PDF
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
PDF
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
DOCX
Using AI to Predict Strokes
PDF
Survey on data mining techniques in heart disease prediction
PDF
Propose a Enhanced Framework for Prediction of Heart Disease
PDF
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
PDF
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
PDF
Heart Disease Prediction Using Associative Relational Classification Techniq...
PDF
Heart Disease Prediction Using Data Mining Techniques
PPTX
Final ppt
PDF
Psdot 14 using data mining techniques in heart
PDF
Prediction for Pulmonary Disease Based on Diagnostic Reciepes and Classification
PDF
A Heart Disease Prediction Model using Logistic Regression
Chronic Kidney Disease Prediction
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
Heart Disease Prediction using Machine Learning Algorithm
IRJET - An Effective Stroke Prediction System using Predictive Models
IRJET - Chronic Kidney Disease Prediction using Data Mining and Machine Learning
Mining of medical data to identify risk factors of heart disease using freque...
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Using AI to Predict Strokes
Survey on data mining techniques in heart disease prediction
Propose a Enhanced Framework for Prediction of Heart Disease
A Heart Disease Prediction Model using Logistic Regression By Cleveland DataBase
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Prediction Using Associative Relational Classification Techniq...
Heart Disease Prediction Using Data Mining Techniques
Final ppt
Psdot 14 using data mining techniques in heart
Prediction for Pulmonary Disease Based on Diagnostic Reciepes and Classification
A Heart Disease Prediction Model using Logistic Regression
Ad

Similar to Iot analytics in wearables (20)

PDF
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
PDF
IRJET- Arrhythmia Detection using One Dimensional Convolutional Neural Network
PDF
Real-time Analytics for the Healthcare Industry: Arrythmia Detection- Impetus...
PPTX
Cardiovascular_Diseases_PPTX[1].pptxandwell
PPTX
Healthcare arpan pal gws
PPTX
A review on early hospital mortality prediction using vital signals
PDF
IRJET- Detection of Abnormal ECG Signal using DWT Feature Extraction and CNN
PPTX
Early hospital mortality prediction using vital signals
PDF
Identification and alertness of cardiovascular disease using MATLAB with IoT
PPTX
HEALTH PREDICTION ANALYSIS USING DATA MINING
PDF
A_Healthcare_Monitoring_System_for_the_Diagnosis_of_Heart_Disease_in_the_IoMT...
PDF
IRJET- Medical Database Mining for Heart Disease Precautions and Early Ca...
PDF
50720140101001 2
PDF
50720140101001 2
PPTX
Heart Disease Classification: Machine Learning Analysis
PDF
Comparing Data Mining Techniques used for Heart Disease Prediction
PPTX
Role of Big Data in Medical Diagnostics
PDF
Classification of cardiac vascular disease from ecg signals for enhancing mod...
PDF
A Survey on Heart Disease Prediction Techniques
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
IRJET- Arrhythmia Detection using One Dimensional Convolutional Neural Network
Real-time Analytics for the Healthcare Industry: Arrythmia Detection- Impetus...
Cardiovascular_Diseases_PPTX[1].pptxandwell
Healthcare arpan pal gws
A review on early hospital mortality prediction using vital signals
IRJET- Detection of Abnormal ECG Signal using DWT Feature Extraction and CNN
Early hospital mortality prediction using vital signals
Identification and alertness of cardiovascular disease using MATLAB with IoT
HEALTH PREDICTION ANALYSIS USING DATA MINING
A_Healthcare_Monitoring_System_for_the_Diagnosis_of_Heart_Disease_in_the_IoMT...
IRJET- Medical Database Mining for Heart Disease Precautions and Early Ca...
50720140101001 2
50720140101001 2
Heart Disease Classification: Machine Learning Analysis
Comparing Data Mining Techniques used for Heart Disease Prediction
Role of Big Data in Medical Diagnostics
Classification of cardiac vascular disease from ecg signals for enhancing mod...
A Survey on Heart Disease Prediction Techniques
Ad

More from Jessica Willis (7)

PDF
ODSC Hackathon for Health October 2016
PDF
Jon Sedar topic modelling presentation #odsc 2016
PDF
Knime customer intelligence on social media odsc london
PDF
Deep learning frameworks v0.40
PDF
Ian huston getting started with cloud foundry
PDF
Data Science for Internet of Things with Ajit Jaokar
PDF
Open-Source Bioinformatics for Data Scientists with Amanda Schierz
ODSC Hackathon for Health October 2016
Jon Sedar topic modelling presentation #odsc 2016
Knime customer intelligence on social media odsc london
Deep learning frameworks v0.40
Ian huston getting started with cloud foundry
Data Science for Internet of Things with Ajit Jaokar
Open-Source Bioinformatics for Data Scientists with Amanda Schierz

Recently uploaded (20)

PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Lecture1 pattern recognition............
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
Database Infoormation System (DBIS).pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Introduction to Data Science and Data Analysis
PPTX
Introduction to machine learning and Linear Models
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Computer network topology notes for revision
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
Lecture1 pattern recognition............
Miokarditis (Inflamasi pada Otot Jantung)
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Business Analytics and business intelligence.pdf
Database Infoormation System (DBIS).pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Introduction to Data Science and Data Analysis
Introduction to machine learning and Linear Models
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
.pdf is not working space design for the following data for the following dat...
oil_refinery_comprehensive_20250804084928 (1).pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
[EN] Industrial Machine Downtime Prediction
1_Introduction to advance data techniques.pptx
Computer network topology notes for revision
MODULE 8 - DISASTER risk PREPAREDNESS.pptx

Iot analytics in wearables

  • 1. Copyright : Futuretext Ltd. London0 Data Mining for Wearable Sensors in Health Monitoring Systems: A Review of Recent Trends and Challenges Hadi Banaee *, Mobyen Uddin Ahmed and Amy Loutfi Center for Applied Autonomous Sensor Systems, O¨ rebro University, SE- 70182 O¨ rebro, Sweden; E-Mails: mobyen.ahmed@oru.se (M.U.A.); amy.loutfi@oru.se (A.L.) Shift in focus from data collection and simple apps (calculating steps, sleep etc) to Data analytics based on context awareness and Personalization Specifically we concentrated the review on the following vital sign parameters: electrocardiogram (ECG), oxygen saturation (SpO2 ), heart rate (HR), Photoplethysmography (PPG), blood glucose (BG), respiratory rate (RR), and blood pressure (BP).
  • 2. Copyright : Futuretext Ltd. London1 Three types of data mining tasks: Anomaly detection(including raising alarms) Prediction and Diagnosis Three analysis dimensions. a) Setting in which the monitoring occurs(ex independent living) b) Type of subjects used (ex healthy, specific illness etc) c) How and where data is processed
  • 3. Copyright : Futuretext Ltd. London2 Anomaly Detection Anomaly detection techniques are often developed based on a classification methods to distinguish the data set into normal class and outliers. For example, support vector machines , Markov models and Wavelet analysis are used in health monitoring systems for anomaly detection. a) Usually deal with short term and multivariate data sets in order to characterize the entire the data to find discords. b) Finding irregular patterns in vital signs time series such as abnormal episodes in ECG pulses , SpO2 signal and blood glucose level which mostly discover unusual temporal patterns in continuous data. c) Use domain knowledge and predefined information to detect anomalies for decision making such as anomaly detection in sleep episodes and finding hazardous stress levels
  • 4. Copyright : Futuretext Ltd. London3 Prediction Supervised learning models where it includes feature extraction, training and testing steps while performing the prediction of the behavior. Examples: blood glucose level prediction, mortality prediction by clustering electronic health data, and a predictive decision making system for dialysis patients. Diagnosis/Decision Making Like anomaly detection but not necessary detection abnormalities. Examples: estimating the severity of health episodes of patients suffering chronic disease, sleep issues such as polysomnography and apnea, estimation and classification of health conditions and emotion recognition. Most of these researches have used online databases with annotated episodes in order to have sufficient and trustable real-world disease labels to evaluate the decision making process. Considering the complexity of the data Neural Networks and decision trees used.
  • 5. Copyright : Futuretext Ltd. London4 Other Data Mining Tasks for Wearable Sensors (1) data acquisition using the adequate sensor set; (2)transmission of data from subject to clinician; (3)integration of data with other descriptive data; and (4)data storage. Several data mining techniques are applied such as wavelet analysis for artifact reduction and data compression , rule-based methods for data summarizing and transmitting , and Gaussian process for secure authentication . Preprocessing (1) filter unusual data to remove artifacts and (2) remove high frequency noise Ex ECG data to remove frequency noise, the other methods in frequency domain
  • 6. Copyright : Futuretext Ltd. London5 Time Domain Spectral Domain Other Features Mean R-R, Std R-R, Mean HR, Spectral energy [27,62], Power Std HR [39], Number of R-R spectral density [32], Low-pass ECG interval [27], Mean R-R, Std filter [45], Low/high - R-R interval [64]. frequency [39,64]. Mean, zero crossing counts, Drift from normality range [61], SpO2 entropy [48], Mean, Slope [61], Energy, Low frequency [60]. Self-similarity [60]. Entropy [60]. Energy, Low/high frequency [60], Low/high frequency [36], Mean, Slope [61], Mean, Wavelet coefficients of data Drift from normality range [61], HR Entropy, Co-occurrence Self-similarity, Std [60]. segments [45], Low/high frequency, Power spectral coefficients [60]. density [42]. Rise Times, Max, Min, PPG Mean [36]. Low/high frequency [36]. - BP Mean, Slope [61]. - Rule based features [56]. RR Mean, Min, Max [64]. - Residual and tidal volume [64]. Zero crossings count, Peak value, Rise time (EMG) [68], Spectral energy (EEG) [27], Mean, Duration (GSR) [36], Median and mean Frequency, Bandwidth, Peaks count Other Pick value, Min, Max Spectral energy (EMG) [68], (GSR) [36]. (SCR) [51], Total magnitude, Energy (GSR) [36]. Duration (GSR) [39].
  • 7. Copyright : Futuretext Ltd. London6 Three most popular approaches for dimension reduction in medical domain are PCA, ICA, and LDA Other tools for feature selection used in the literature includes threshold- based rules, analysis of variance (ANOVA) , and Fourier transforms. Common health parameters considered by SVM methods are ECG, HR, and SpO2 which are mostly used in the short term and annotated form. In general, SVM techniques are often proposed for anomaly detection and decision making tasks in healthcare services. The ability of the NN is to model highly nonlinear systems such as physiological records where the correlation of the input parameters is not easily detectable
  • 8. Copyright : Futuretext Ltd. London7
  • 9. Copyright : Futuretext Ltd. London8 Data Properties • Time Horizon (long term/short term): Some data analysis systems in healthcare were designed to process short signals such as few minutes of ECG data , a few hours of heart rate or oxygen saturation and the measurement of blood pressures for a day and even more (Blood glucose). • Scale (large/small): considering a big number of subjects (patient or healthy) are counted as large scale studies [30]. • Labeling (annotated/unlabeled):annotations also acquired using another source of knowledge like electronic health record (EHR), coronary syndromes, and also history of vital signs • Continuous/Discrete: • Single Sensor/Multi Sensors:
  • 10. Copyright : Futuretext Ltd. London9 Challenges • Need for Large scale monitoring in non-clinical context • Dealing with annotated data sets: few benchmark data sets are available also the challenge of how data annotation (labeling) can be best done for such target groups. • Multiple measurements: Another challenge in this field is to exploit the multiple measurements of vital signs simultaneously. Esp with sensor fusion techniques • Contextual information: • Reliability, level of trust to the system: the amount of trust between the data analysis system and the experts who use the system for decision making tasks. • Discovering of unseen features • Post processing