International Journal of Trend in Scientific Research and Development (IJTSRD)
Volume: 3 | Issue: 3 | Mar-Apr 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 - 6470
@ IJTSRD | Unique Paper ID – IJTSRD22810 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1496
Mining Health Examination Records - A Graph Based Approach
Jayashri A. Sonawane, Dr. Swati A. Bhavsar
Assistant Professor, Department of Computer Engineering, MCERC, Eklahare, Nashik, Maharashtra, India
How to cite this paper: Jayashri A.
Sonawane | Dr. Swati A. Bhavsar
"Mining Health Examination Records- A
Graph Based Approach" Published in
International Journal of Trend in
Scientific Research and Development
(ijtsrd), ISSN: 2456-
6470, Volume-3 |
Issue-3, April 2019,
pp.1496-1498, URL:
https://www.ijtsrd.c
om/papers/ijtsrd22
810.pdf
Copyright © 2019 by author(s) and
International Journal of Trend in
Scientific Research and Development
Journal. This is an Open Access article
distributed under
the terms of the
Creative Commons
Attribution License (CC BY 4.0)
(http://creativeco
mons.org/licenses/by/4.0)
ABSTRACT
EHR(Electronic Health Records) collects data on yearly basis and it is used in
many countries for healthcare.HER(Health Examination Records) collects the
data on regular basis and identifies the participants at risk that is important for
early warning and prevention.the fundamental challenge is for learning
classification model for risk prediction with unlabelled data and live data string
that established the majority of the collected dataset.the unlabelled data string
describes the participants in health examintions whosehealthconditionscan be
vary from healthy to highly risky or very ill.in this paper, we propose a graph
based,semisupervised learning algorithm called SHG health (semi-supervised
heterogenous graph on Health) for risk prediction and assessment to classify a
progressively developing condition with the majority of the data unlabelled. An
efficient iterative algorithm is designed and developed to proof the convergence
is given.extensive experiments based on both real health examination dataset
and live datasets to show effectiveness of our method.
KEYWORDS: live data string, hetero HER, classifier
I. INTRODUCTION
Huge amounts of ElectronicHealth Records(EHRs)collected
over the years have provided a rich baseforrisk analysisand
prediction. An EHR contains digitally stored healthcare
information about an individual, such as observations,
laboratory tests, diagnostic reports, medications,
procedures, patient identifying information, and allergies. A
special type of HER is the HealthExamination Records (HER)
from annual general health check-ups. For example,
governments such as Australia, U.K., and Taiwan , offer
periodic geriatric health examinations as an integral part of
their aged care programs. Since clinical care often has a
specific problem in mind, at a point in time, only a limited
and often small set of measures considered necessary are
collected and stored in a persons EHR. By contrast,HERs are
collected for regular surveillance and preventive purposes,
covering a comprehensive set of general health measures ,
all collected at a point in time in a systematic way paper
proposes a semi-supervised heterogeneous graph-based
algorithm called SHG-Health (Semi supervised
Heterogeneous Graph on Health) as an evidence-based risk
prediction approach to mining longitudinal health
examination records. To handle heterogeneity, it explores a
Heterogeneous graph based on Health Examination Records
called Hetero HER graph, where examination items in
different categories are modelled as different types of nodes
and their temporal relationships may be time-consuming,
finding ways of alleviating the labelling costs is critical for
our ability to automatically learn such models.Sheng, W
Ruan, X Li, S Wang, Z Yang[8]proposes The health risks are
calculated using the information from the cause of death
(COD) dataset that is linked to the GME dataset. a data
mining-based method forprediction of personalhealth index
based on annual geriatric medical examination records.
Eichelberg M., Aden T., Riesmeier J., Dogac A., Laleci
G.[10]propose introduction of electronic health records,
boosting the efficiency of medical services at a lower cost,
at the same time offering still a vast range of research
challenges. In this, The analysis of the documents that were
gathered through these terms yielded additional keywords
and references to additional document sources. The
following keywords or combinations were used: software,
quality, certification, Electronic/Personal, Medical/Health
Record, HER Standards, EHR certification
II. REVIEW OF LITERATURE
MF Ghalwash, V Radosavljevic, Z Obradovic[1]proposed an
approach , a temporal data mining method is proposed for
extracting interpretable patterns from multivariate time
series data, which can be used to assist in providing
interpretable early diagnosis. The problem is formulated as
an optimization based binary classificationtask addressed in
three steps. in this classification is often employed as a data
exploration step, where summa- rization of the data in a
target class using interpretabledistinctfeaturesbecomes the
central task. To the best of our knowledge, the problem of
extracting interpretable features for early classification on
time series. Tran, T., Phung, D., Luo, W., Venkatesh, S[2]This
constructs a novel ordinal regression framework for
predicting medical risk stratification from EMR. First, a
IJTSRD22810
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD22810 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1497
conceptual view of EMR as a temporal image is constructed
to extract a diverse set of features. Second, ordinal modeling
is applied for predicting cumulative or progressive risk.Mas
S Mohktar, Stephen J Redmond, Nick C Antoniades[3]
proposed The use of telehealth technologies to remotely
monitor patients suffering chronic diseases may enable
preemptive treatment.As a means of detecting exacerbation
earlier, and at the resolution of a single day, it has been
proposed that patients with COPD might use a home
telehealth service daily to evaluate their health status .
Existing home telehealth services offer a range of vital sign
monitoring modalities, for measurements including
lungs.Jin-Mao Wei, Shu-Qin Wang,Xiao-JieYuan[4] proposes
Cancer classification is the critical basis for patient-tailore
therapy. Conventional histological analysis tends to be
unreliable because different tumors may have similar
appearance. Various machine learning methods can be
employed to classify cancer tissue samples based on
microarray data.J. Simon, Pedro J. Caraballo, Terry M.
Therneau, Steven S[7] In this paper to maintain a EMR
(Electronic Medical Record) and apply association rule
mining to discover sets of risk factors and their. Association
Rules, Survival Analysis, Association Rule Summarization.
Yanbing Xue and Milos Hauskrecht[6] Learning of
classification models in medicine often relieson datalabeled
by a human expert. Since labeling of clinical may be time-
consuming, finding ways of alleviating the labeling costs is
critical for our ability to automatically learn such
models.Sheng, W Ruan, X Li, S Wang, Z Yang[8]proposes The
health risks are calculated using the information from the
cause of death (COD) dataset that is linked to the GME
dataset. a data mining-based method for prediction of
personal health index based on annual geriatric medical
examination records. Eichelberg M., Aden T., Riesmeier J.,
Dogac A., Laleci G.[10]propose introduction of electronic
health records, boosting the efficiency of medical services at
a lower cost, at the same time offering still a vast range of
research challenges. In this, The analysis of the documents
that were gathered through these terms yielded additional
keywords and references to additional document sources.
The following keywords or combinations m were used:
software, quality, certification, Electronic/Personal
Medical/Health Record, HER Standards, EHR certification
III. SYSTEM ARCHITECTURE / SYSTEM OVERVIEW
Health risk prediction is necessary for prevention and
proper diagnosis before disease completely developed. The
proposed system is used efficient and robust classification
algorithm based on live data string.the electronic health
records is not good for live or currentdatabecauseitcollects
the records on yearly basis.so, the proposed system is used
to predict the future risk of the participants on live data
string for prevention and early diagnosis before the disease
completely developed.
IV. SYSTEM ANALYSIS
The general architecture of the project is described below
Fig.1. Overview of the system Architecture
Live Data String:
in this, we give live data to the system which consist of
known and unknown symptoms. on the basis of thisdata the
future risks of the participants in predicted.
HeterogenousHER:
A graph represents model data that is meager. To capture
the heterogeneity naturally found in health examination
items, we constructed a graph called HeteroHER consisting
of multi-type nodes based on health examination
records.health risk prediction based on health examination
records with heterogeneity in line and large unlabeled data
problems, we present a semi-supervised heterogeneous
graph-based algorithm called SHG-Health.
Semi-Supervised Learning:
The third component of our method is a semi-supervised
learning al- gorithm for the constructionof HeteroHERgraph
The algorithm combines the advantages of for class
discovery and for handling heterogeneitytoisolateaspecific
problem caused by evidence-based risk prediction from
health examination records.
Classifier:
in the system solves the problem of unsupervised learning
by applying semi-supervised approach.this can be done by
maintaining graph of known and unknown symptoms.these
graphs are given to the classifier basically,it consist of two
types of data that is,training data and testing data.intraining
data we have to learn classier that which symptoms are
found and what to say that disease then classier gives the
specified prediction of risk.
Result Analysis:
in this section the high risk disease are analyzed on thebasis
of records obtained from classifier.
V. ADVANTAGES
1. the SHG-Health algorithm to handle a challengingmulti-
class classification problem with substantial unlabeled
cases which may or may not belong to the known
classes. This work pioneers in risk prediction based on
health examination records in the presence of large
unlabeled data.
2. A novel graph extraction mechanism is introduced for
handling heterogeneity found in longitudinal health
examination records.
3. The proposed graph-based semi-supervised learning
algorithm SHG-Health that combines the advantages
from heterogeneous graph learning and class discovery
shows significant performance gain on a large and
comprehensive real health examination dataset of
participants as well as synthetic datasets
ACKNOWLEDGMENT
Inspiration and guidance are invaluable in every aspect of
life, especially in the field of education, whichI havereceived
from my respected guide Prof. S.A.Bhavsar who has guided
me and gave earnest co-operation whenever required. I
would like to express my sincere gratitude towards her. She
always provided me with access to the latest technologyand
facilities and encouragement at every point and took active
participation in the achievement of my objective.
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD22810 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1498
VI. CONCLUSION
The proposed system, shows data fusion for the health
examination records to be integrated with other types
datasets such as hospitalbasedelectronichealthrecords and
participants living conditions.aSHGalgo-rithmmakesuseof
heteroHER and semi-supervised learningfor findingvarious
known and unknown symptoms in livedatawhichisgiven to
the system and predict the future risk.
REFERENCES
[1] M. F. Ghalwash, V. Radosavljevic, and Z. Obradovic,
Extraction of interpretable multivariate patterns for
early diagnostics, 2013.
[2] T. Tran, D. Phung, W. Luo, and S. Venkatesh, Stabilized
sparse ordinal regression for medical risk stratication,
Knowledge and In- formation Systems, Mar. 2014.
[3] M. S. Mohktar, S. J. Redmond, N. C. Antoniades, P. D.
Rochford, J.J. Pretto, J. Basilakis, N. H. Lovell, and C. F.
McDonald, Pre- dicting the risk of exacerbation in
patients with chronic obstructive pulmonary disease
using home telehealth measurement data, Articial
Intelligence in Medicine, vol. 63, 2015.
[4] Q. Nguyen, H. Valizadegan, and M. Hauskrecht,
Learning classication models with soft-label
information. vol. 21, 2014.
[5] G. J. Simon, P. J. Caraballo, T. M. Therneau, S. S. Cha, M.
R. Castro,and P. W. Li, Extending Association Rule
Summarization Techniques to Assess Risk of Diabetes
Mellitus, vol. 27, 2015.
[6] L. Chen, X. Li, S. Wang, H.-Y. Hu, N. Huang, Q. Z. Sheng,
and M.Sharaf, Mining Personal Health Index from
Annual Geriatric Medical Examinations, 2014.
[7] S. Pan, J. Wu, and X. Zhu, CogBoost: Boosting for Fast
Costsensitive Graph Classication, vol. 6, 2015.
[8] M. Eichelberg, T. Aden, J. Riesmeier, A. Dogac, and G. B.
Laleci,A survey and analysis of Electronic Healthcare
Record standards,vol. 37,2005.
[9] C. Y. Wu, Y. C. Chou, N. Huang, Y. J. Chou, H. Y. Hu, and C.
P. Li,Cognitive impairment assessed at annualgeriatric
health examinations predicts mortality among the
elderly, vol. 67, 2014. 21
[10] L. Krogsbll, K. Jrgensen, C. Grnhj Larsen, and P. Gtzsche,
General health checks in adults for reducing morbidity
and mortality from disease (Review),vol no. 10, 2012.
[11] J. Kim and H. Shin,Breast cancer survivability
prediction usinglabeled,unlabeled,andpseudo-labeled
patient data, vol. 20, 2013.

More Related Content

PPT
Clinical Research Informatics (CRI) Year-in-Review 2014
PDF
AMIA 2015 CRI Year-in-Review
PPT
Embi cri review-2013-final
PDF
IRJET - Chronic or Acute Disease with Doctor Specialist using Data Mining
PPT
Embi cri review-2012-final
PDF
Estimating the Statistical Significance of Classifiers used in the Predictio...
PPTX
Analytics in healthcare
PDF
2016 CRI Year-in-Review
Clinical Research Informatics (CRI) Year-in-Review 2014
AMIA 2015 CRI Year-in-Review
Embi cri review-2013-final
IRJET - Chronic or Acute Disease with Doctor Specialist using Data Mining
Embi cri review-2012-final
Estimating the Statistical Significance of Classifiers used in the Predictio...
Analytics in healthcare
2016 CRI Year-in-Review

What's hot (20)

PDF
E-health technologies show promise in developing countries
PPT
RML Rendezvous: Transcending Borders Globally
DOCX
Medical and Health-Related Errors - The Impact of Recordkeeping upon Patient ...
PPTX
Introduction to Healthcare Analytics
PDF
Using real-world evidence to investigate clinical research questions
PPTX
A study on “impact of artificial intelligence in covid19 diagnosis”
PPTX
Csph talk
PDF
Prediction for Pulmonary Disease Based on Diagnostic Reciepes and Classification
PDF
J1803026569
PPT
Introduction to biostatistics by Niraj Kumar Yadav
PPTX
Leveraging Text Classification Strategies for Clinical and Public Health Appl...
PPTX
Informatics and nursing 2015 2016.odette richards
PDF
Towards EHR Interoperability in Tanzania Hospitals : Issues, Challenges and O...
PDF
CV_Min_Jiang
PPT
Simplifying semantics for biomedical applications
PPT
HIV Tracking System in Forsyth County, NC
DOCX
Dr. Obumneke Amadi _Transcript
PPT
CATCH-IT Journal Club presentation Shamsa Jiwani
PPTX
Week5hcs451 presentation1
PPTX
Predictive Analytics Help Achieve the Triple Aim
E-health technologies show promise in developing countries
RML Rendezvous: Transcending Borders Globally
Medical and Health-Related Errors - The Impact of Recordkeeping upon Patient ...
Introduction to Healthcare Analytics
Using real-world evidence to investigate clinical research questions
A study on “impact of artificial intelligence in covid19 diagnosis”
Csph talk
Prediction for Pulmonary Disease Based on Diagnostic Reciepes and Classification
J1803026569
Introduction to biostatistics by Niraj Kumar Yadav
Leveraging Text Classification Strategies for Clinical and Public Health Appl...
Informatics and nursing 2015 2016.odette richards
Towards EHR Interoperability in Tanzania Hospitals : Issues, Challenges and O...
CV_Min_Jiang
Simplifying semantics for biomedical applications
HIV Tracking System in Forsyth County, NC
Dr. Obumneke Amadi _Transcript
CATCH-IT Journal Club presentation Shamsa Jiwani
Week5hcs451 presentation1
Predictive Analytics Help Achieve the Triple Aim
Ad

Similar to Mining Health Examination Records A Graph Based Approach (20)

DOCX
Running Head WEEK 1 .docx
DOCX
Running Head WEEK 1 .docx
PDF
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
PDF
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
PDF
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
PDF
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
PDF
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
PDF
1-s2.0-S0167923620300944-main.pdf
PDF
EHR- 2016 Eeshika Mitra
PPTX
Statistics For Health Science and Its Impacts
PDF
Ascendable Clarification for Coronary Illness Prediction using Classification...
PDF
Clinical Data Science and its Future
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PDF
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
PDF
APPLICATION OF DATA SCIENCE IN HEALTHCARE
PDF
The Perception of Emergency Medical Staff on the Use of Electronic Patient Cl...
DOCX
Chapter 9 Patient Safety, Quality and ValueHarry Burke MD P.docx
DOCX
Chapter 9 Patient Safety, Quality and ValueHarry Burke MD P.docx
PDF
IMS Health Enriched Real-World Data Study
PDF
Factors Affecting the Adoption of Electronic Health Records by Nurse
Running Head WEEK 1 .docx
Running Head WEEK 1 .docx
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
IMPACT OF HEALTH INFORMATICS TECHNOLOGY ON THE IMPLEMENTATION OF A MODIFIED E...
1-s2.0-S0167923620300944-main.pdf
EHR- 2016 Eeshika Mitra
Statistics For Health Science and Its Impacts
Ascendable Clarification for Coronary Illness Prediction using Classification...
Clinical Data Science and its Future
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
APPLICATION OF DATA SCIENCE IN HEALTHCARE
The Perception of Emergency Medical Staff on the Use of Electronic Patient Cl...
Chapter 9 Patient Safety, Quality and ValueHarry Burke MD P.docx
Chapter 9 Patient Safety, Quality and ValueHarry Burke MD P.docx
IMS Health Enriched Real-World Data Study
Factors Affecting the Adoption of Electronic Health Records by Nurse
Ad

More from ijtsrd (20)

PDF
A Study of School Dropout in Rural Districts of Darjeeling and Its Causes
PDF
Pre extension Demonstration and Evaluation of Soybean Technologies in Fedis D...
PDF
Pre extension Demonstration and Evaluation of Potato Technologies in Selected...
PDF
Pre extension Demonstration and Evaluation of Animal Drawn Potato Digger in S...
PDF
Pre extension Demonstration and Evaluation of Drought Tolerant and Early Matu...
PDF
Pre extension Demonstration and Evaluation of Double Cropping Practice Legume...
PDF
Pre extension Demonstration and Evaluation of Common Bean Technology in Low L...
PDF
Enhancing Image Quality in Compression and Fading Channels A Wavelet Based Ap...
PDF
Manpower Training and Employee Performance in Mellienium Ltdawka, Anambra State
PDF
A Statistical Analysis on the Growth Rate of Selected Sectors of Nigerian Eco...
PDF
Automatic Accident Detection and Emergency Alert System using IoT
PDF
Corporate Social Responsibility Dimensions and Corporate Image of Selected Up...
PDF
The Role of Media in Tribal Health and Educational Progress of Odisha
PDF
Advancements and Future Trends in Advanced Quantum Algorithms A Prompt Scienc...
PDF
A Study on Seismic Analysis of High Rise Building with Mass Irregularities, T...
PDF
Descriptive Study to Assess the Knowledge of B.Sc. Interns Regarding Biomedic...
PDF
Performance of Grid Connected Solar PV Power Plant at Clear Sky Day
PDF
Vitiligo Treated Homoeopathically A Case Report
PDF
Vitiligo Treated Homoeopathically A Case Report
PDF
Uterine Fibroids Homoeopathic Perspectives
A Study of School Dropout in Rural Districts of Darjeeling and Its Causes
Pre extension Demonstration and Evaluation of Soybean Technologies in Fedis D...
Pre extension Demonstration and Evaluation of Potato Technologies in Selected...
Pre extension Demonstration and Evaluation of Animal Drawn Potato Digger in S...
Pre extension Demonstration and Evaluation of Drought Tolerant and Early Matu...
Pre extension Demonstration and Evaluation of Double Cropping Practice Legume...
Pre extension Demonstration and Evaluation of Common Bean Technology in Low L...
Enhancing Image Quality in Compression and Fading Channels A Wavelet Based Ap...
Manpower Training and Employee Performance in Mellienium Ltdawka, Anambra State
A Statistical Analysis on the Growth Rate of Selected Sectors of Nigerian Eco...
Automatic Accident Detection and Emergency Alert System using IoT
Corporate Social Responsibility Dimensions and Corporate Image of Selected Up...
The Role of Media in Tribal Health and Educational Progress of Odisha
Advancements and Future Trends in Advanced Quantum Algorithms A Prompt Scienc...
A Study on Seismic Analysis of High Rise Building with Mass Irregularities, T...
Descriptive Study to Assess the Knowledge of B.Sc. Interns Regarding Biomedic...
Performance of Grid Connected Solar PV Power Plant at Clear Sky Day
Vitiligo Treated Homoeopathically A Case Report
Vitiligo Treated Homoeopathically A Case Report
Uterine Fibroids Homoeopathic Perspectives

Recently uploaded (20)

PDF
HVAC Specification 2024 according to central public works department
PPTX
Virtual and Augmented Reality in Current Scenario
PDF
Journal of Dental Science - UDMY (2021).pdf
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PDF
Race Reva University – Shaping Future Leaders in Artificial Intelligence
PDF
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI .pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
advance database management system book.pdf
PDF
International_Financial_Reporting_Standa.pdf
PDF
English Textual Question & Ans (12th Class).pdf
PPTX
Computer Architecture Input Output Memory.pptx
PDF
Mucosal Drug Delivery system_NDDS_BPHARMACY__SEM VII_PCI.pdf
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PDF
Empowerment Technology for Senior High School Guide
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
HVAC Specification 2024 according to central public works department
Virtual and Augmented Reality in Current Scenario
Journal of Dental Science - UDMY (2021).pdf
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
Race Reva University – Shaping Future Leaders in Artificial Intelligence
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI .pdf
Hazard Identification & Risk Assessment .pdf
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
advance database management system book.pdf
International_Financial_Reporting_Standa.pdf
English Textual Question & Ans (12th Class).pdf
Computer Architecture Input Output Memory.pptx
Mucosal Drug Delivery system_NDDS_BPHARMACY__SEM VII_PCI.pdf
B.Sc. DS Unit 2 Software Engineering.pptx
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
Empowerment Technology for Senior High School Guide
FORM 1 BIOLOGY MIND MAPS and their schemes
Environmental Education MCQ BD2EE - Share Source.pdf

Mining Health Examination Records A Graph Based Approach

  • 1. International Journal of Trend in Scientific Research and Development (IJTSRD) Volume: 3 | Issue: 3 | Mar-Apr 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 - 6470 @ IJTSRD | Unique Paper ID – IJTSRD22810 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1496 Mining Health Examination Records - A Graph Based Approach Jayashri A. Sonawane, Dr. Swati A. Bhavsar Assistant Professor, Department of Computer Engineering, MCERC, Eklahare, Nashik, Maharashtra, India How to cite this paper: Jayashri A. Sonawane | Dr. Swati A. Bhavsar "Mining Health Examination Records- A Graph Based Approach" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456- 6470, Volume-3 | Issue-3, April 2019, pp.1496-1498, URL: https://www.ijtsrd.c om/papers/ijtsrd22 810.pdf Copyright © 2019 by author(s) and International Journal of Trend in Scientific Research and Development Journal. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0) (http://creativeco mons.org/licenses/by/4.0) ABSTRACT EHR(Electronic Health Records) collects data on yearly basis and it is used in many countries for healthcare.HER(Health Examination Records) collects the data on regular basis and identifies the participants at risk that is important for early warning and prevention.the fundamental challenge is for learning classification model for risk prediction with unlabelled data and live data string that established the majority of the collected dataset.the unlabelled data string describes the participants in health examintions whosehealthconditionscan be vary from healthy to highly risky or very ill.in this paper, we propose a graph based,semisupervised learning algorithm called SHG health (semi-supervised heterogenous graph on Health) for risk prediction and assessment to classify a progressively developing condition with the majority of the data unlabelled. An efficient iterative algorithm is designed and developed to proof the convergence is given.extensive experiments based on both real health examination dataset and live datasets to show effectiveness of our method. KEYWORDS: live data string, hetero HER, classifier I. INTRODUCTION Huge amounts of ElectronicHealth Records(EHRs)collected over the years have provided a rich baseforrisk analysisand prediction. An EHR contains digitally stored healthcare information about an individual, such as observations, laboratory tests, diagnostic reports, medications, procedures, patient identifying information, and allergies. A special type of HER is the HealthExamination Records (HER) from annual general health check-ups. For example, governments such as Australia, U.K., and Taiwan , offer periodic geriatric health examinations as an integral part of their aged care programs. Since clinical care often has a specific problem in mind, at a point in time, only a limited and often small set of measures considered necessary are collected and stored in a persons EHR. By contrast,HERs are collected for regular surveillance and preventive purposes, covering a comprehensive set of general health measures , all collected at a point in time in a systematic way paper proposes a semi-supervised heterogeneous graph-based algorithm called SHG-Health (Semi supervised Heterogeneous Graph on Health) as an evidence-based risk prediction approach to mining longitudinal health examination records. To handle heterogeneity, it explores a Heterogeneous graph based on Health Examination Records called Hetero HER graph, where examination items in different categories are modelled as different types of nodes and their temporal relationships may be time-consuming, finding ways of alleviating the labelling costs is critical for our ability to automatically learn such models.Sheng, W Ruan, X Li, S Wang, Z Yang[8]proposes The health risks are calculated using the information from the cause of death (COD) dataset that is linked to the GME dataset. a data mining-based method forprediction of personalhealth index based on annual geriatric medical examination records. Eichelberg M., Aden T., Riesmeier J., Dogac A., Laleci G.[10]propose introduction of electronic health records, boosting the efficiency of medical services at a lower cost, at the same time offering still a vast range of research challenges. In this, The analysis of the documents that were gathered through these terms yielded additional keywords and references to additional document sources. The following keywords or combinations were used: software, quality, certification, Electronic/Personal, Medical/Health Record, HER Standards, EHR certification II. REVIEW OF LITERATURE MF Ghalwash, V Radosavljevic, Z Obradovic[1]proposed an approach , a temporal data mining method is proposed for extracting interpretable patterns from multivariate time series data, which can be used to assist in providing interpretable early diagnosis. The problem is formulated as an optimization based binary classificationtask addressed in three steps. in this classification is often employed as a data exploration step, where summa- rization of the data in a target class using interpretabledistinctfeaturesbecomes the central task. To the best of our knowledge, the problem of extracting interpretable features for early classification on time series. Tran, T., Phung, D., Luo, W., Venkatesh, S[2]This constructs a novel ordinal regression framework for predicting medical risk stratification from EMR. First, a IJTSRD22810
  • 2. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD22810 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1497 conceptual view of EMR as a temporal image is constructed to extract a diverse set of features. Second, ordinal modeling is applied for predicting cumulative or progressive risk.Mas S Mohktar, Stephen J Redmond, Nick C Antoniades[3] proposed The use of telehealth technologies to remotely monitor patients suffering chronic diseases may enable preemptive treatment.As a means of detecting exacerbation earlier, and at the resolution of a single day, it has been proposed that patients with COPD might use a home telehealth service daily to evaluate their health status . Existing home telehealth services offer a range of vital sign monitoring modalities, for measurements including lungs.Jin-Mao Wei, Shu-Qin Wang,Xiao-JieYuan[4] proposes Cancer classification is the critical basis for patient-tailore therapy. Conventional histological analysis tends to be unreliable because different tumors may have similar appearance. Various machine learning methods can be employed to classify cancer tissue samples based on microarray data.J. Simon, Pedro J. Caraballo, Terry M. Therneau, Steven S[7] In this paper to maintain a EMR (Electronic Medical Record) and apply association rule mining to discover sets of risk factors and their. Association Rules, Survival Analysis, Association Rule Summarization. Yanbing Xue and Milos Hauskrecht[6] Learning of classification models in medicine often relieson datalabeled by a human expert. Since labeling of clinical may be time- consuming, finding ways of alleviating the labeling costs is critical for our ability to automatically learn such models.Sheng, W Ruan, X Li, S Wang, Z Yang[8]proposes The health risks are calculated using the information from the cause of death (COD) dataset that is linked to the GME dataset. a data mining-based method for prediction of personal health index based on annual geriatric medical examination records. Eichelberg M., Aden T., Riesmeier J., Dogac A., Laleci G.[10]propose introduction of electronic health records, boosting the efficiency of medical services at a lower cost, at the same time offering still a vast range of research challenges. In this, The analysis of the documents that were gathered through these terms yielded additional keywords and references to additional document sources. The following keywords or combinations m were used: software, quality, certification, Electronic/Personal Medical/Health Record, HER Standards, EHR certification III. SYSTEM ARCHITECTURE / SYSTEM OVERVIEW Health risk prediction is necessary for prevention and proper diagnosis before disease completely developed. The proposed system is used efficient and robust classification algorithm based on live data string.the electronic health records is not good for live or currentdatabecauseitcollects the records on yearly basis.so, the proposed system is used to predict the future risk of the participants on live data string for prevention and early diagnosis before the disease completely developed. IV. SYSTEM ANALYSIS The general architecture of the project is described below Fig.1. Overview of the system Architecture Live Data String: in this, we give live data to the system which consist of known and unknown symptoms. on the basis of thisdata the future risks of the participants in predicted. HeterogenousHER: A graph represents model data that is meager. To capture the heterogeneity naturally found in health examination items, we constructed a graph called HeteroHER consisting of multi-type nodes based on health examination records.health risk prediction based on health examination records with heterogeneity in line and large unlabeled data problems, we present a semi-supervised heterogeneous graph-based algorithm called SHG-Health. Semi-Supervised Learning: The third component of our method is a semi-supervised learning al- gorithm for the constructionof HeteroHERgraph The algorithm combines the advantages of for class discovery and for handling heterogeneitytoisolateaspecific problem caused by evidence-based risk prediction from health examination records. Classifier: in the system solves the problem of unsupervised learning by applying semi-supervised approach.this can be done by maintaining graph of known and unknown symptoms.these graphs are given to the classifier basically,it consist of two types of data that is,training data and testing data.intraining data we have to learn classier that which symptoms are found and what to say that disease then classier gives the specified prediction of risk. Result Analysis: in this section the high risk disease are analyzed on thebasis of records obtained from classifier. V. ADVANTAGES 1. the SHG-Health algorithm to handle a challengingmulti- class classification problem with substantial unlabeled cases which may or may not belong to the known classes. This work pioneers in risk prediction based on health examination records in the presence of large unlabeled data. 2. A novel graph extraction mechanism is introduced for handling heterogeneity found in longitudinal health examination records. 3. The proposed graph-based semi-supervised learning algorithm SHG-Health that combines the advantages from heterogeneous graph learning and class discovery shows significant performance gain on a large and comprehensive real health examination dataset of participants as well as synthetic datasets ACKNOWLEDGMENT Inspiration and guidance are invaluable in every aspect of life, especially in the field of education, whichI havereceived from my respected guide Prof. S.A.Bhavsar who has guided me and gave earnest co-operation whenever required. I would like to express my sincere gratitude towards her. She always provided me with access to the latest technologyand facilities and encouragement at every point and took active participation in the achievement of my objective.
  • 3. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD22810 | Volume – 3 | Issue – 3 | Mar-Apr 2019 Page: 1498 VI. CONCLUSION The proposed system, shows data fusion for the health examination records to be integrated with other types datasets such as hospitalbasedelectronichealthrecords and participants living conditions.aSHGalgo-rithmmakesuseof heteroHER and semi-supervised learningfor findingvarious known and unknown symptoms in livedatawhichisgiven to the system and predict the future risk. REFERENCES [1] M. F. Ghalwash, V. Radosavljevic, and Z. Obradovic, Extraction of interpretable multivariate patterns for early diagnostics, 2013. [2] T. Tran, D. Phung, W. Luo, and S. Venkatesh, Stabilized sparse ordinal regression for medical risk stratication, Knowledge and In- formation Systems, Mar. 2014. [3] M. S. Mohktar, S. J. Redmond, N. C. Antoniades, P. D. Rochford, J.J. Pretto, J. Basilakis, N. H. Lovell, and C. F. McDonald, Pre- dicting the risk of exacerbation in patients with chronic obstructive pulmonary disease using home telehealth measurement data, Articial Intelligence in Medicine, vol. 63, 2015. [4] Q. Nguyen, H. Valizadegan, and M. Hauskrecht, Learning classication models with soft-label information. vol. 21, 2014. [5] G. J. Simon, P. J. Caraballo, T. M. Therneau, S. S. Cha, M. R. Castro,and P. W. Li, Extending Association Rule Summarization Techniques to Assess Risk of Diabetes Mellitus, vol. 27, 2015. [6] L. Chen, X. Li, S. Wang, H.-Y. Hu, N. Huang, Q. Z. Sheng, and M.Sharaf, Mining Personal Health Index from Annual Geriatric Medical Examinations, 2014. [7] S. Pan, J. Wu, and X. Zhu, CogBoost: Boosting for Fast Costsensitive Graph Classication, vol. 6, 2015. [8] M. Eichelberg, T. Aden, J. Riesmeier, A. Dogac, and G. B. Laleci,A survey and analysis of Electronic Healthcare Record standards,vol. 37,2005. [9] C. Y. Wu, Y. C. Chou, N. Huang, Y. J. Chou, H. Y. Hu, and C. P. Li,Cognitive impairment assessed at annualgeriatric health examinations predicts mortality among the elderly, vol. 67, 2014. 21 [10] L. Krogsbll, K. Jrgensen, C. Grnhj Larsen, and P. Gtzsche, General health checks in adults for reducing morbidity and mortality from disease (Review),vol no. 10, 2012. [11] J. Kim and H. Shin,Breast cancer survivability prediction usinglabeled,unlabeled,andpseudo-labeled patient data, vol. 20, 2013.