This document discusses using Neo4j's graph data science capabilities to classify diabetes patients based on transcriptomics data and connect patients to knowledge graphs. It demonstrates loading patient data containing transcripts measured for each patient, generating graph embeddings to represent the data while preserving topology, training a node classification model to predict diabetes status, and performing community detection for subphenotyping. The goal is to transform heterogeneous clinical data into an integrated graph representation to power machine learning and connect insights to existing domain knowledge.