This document presents a graph-based approach for mining health examination records to predict future health risks. It proposes a semi-supervised heterogeneous graph (SHG-Health) algorithm to handle classification with large amounts of unlabeled data. The SHG-Health algorithm constructs a graph (HeteroHER) from health examination records, where different item types are modeled as different node types. It then applies semi-supervised learning to classify nodes and predict risks. The authors evaluate the approach on real and synthetic health examination datasets, showing it can effectively predict risks from live data streams and handle heterogeneous and unlabeled data.
Related topics: