SlideShare a Scribd company logo
© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
1
Neo4j Demo
Classify Diabetes Patients
and Connect to Knowledge Graphs
Dr Alexander Jarasch
Field Engineering Specialist Pharma / Healthcare / Biotech
2
Use Cases for the Entire Drug Life Cycle
Target
Discovery
Hit
Generation
Lead
Identification
Lead
Optimization
Animal
Models
Clinical
Trials
FDA/EMA
Review &
Approval
Post
Approval
Manufacturing
F100
F100
Cyber
security
F100
© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
3
Native Graph Database
The foundation of the Neo4j platform; delivers enterprise-scale and performance,
security, and data integrity for transaction and analytical workloads.
Data Science and Analytics
Explorative tools, rich algorithm library, and Integrated supervised Machine Learning
framework.
Development Tools & Frameworks
Tooling, APIs, query builder, multi-language support for development, admin, modelling,
and rapid prototyping needs.
Discovery & Visualization
Code-free querying, data modeling and exploration tools for data scientists, developers,
and analysts.
Graph Query Language Support
Cypher & openCypher; Ongoing leadership and standards work (GQL) to establish
lingua franca for graphs.
Ecosystem & Integrations
Rich ecosystem of tech and integration partners. Ingestion tools (JDBC, Kafka, Spark, BI
Tools, etc.) for bulk and streaming needs.
Runs Anywhere
Deploy as-a-Service (AuraDB) or self-hosted within your cloud of choice (AWS, GCP,
Azure) via their marketplace, or on-premises.
Neo4j Graph Data Platform
© 2022 Neo4j, Inc. All rights reserved.
4
65+ Graph Algorithms - Out of the Box
Pathfinding & Search Centrality Community Detection
❏ Delta-Stepping Single-Source
❏ Dijkstra’s Single-Source
❏ Dijkstra Source-Target
❏ All Pairs Shortest Path
❏ A* Shortest Path
❏ Yen’s K Shortest Path
❏ Minimum Weight Spanning Tree
❏ Random Walk
❏ Breadth & Depth First Search
❏ Degree Centrality
❏ Closeness Centrality
❏ Harmonic Centrality
❏ Betweenness Centrality & Approx.
❏ PageRank
❏ Personalized PageRank
❏ ArticleRank
❏ Eigenvector Centrality
❏ Hyperlink Induced Topic Search (HITS)
❏ Influence Maximization (Greedy,
CELF)
❏ Weakly Connected Components
❏ Strongly Connected Components
❏ Label Propagation
❏ Leiden
❏ Louvain
❏ K-Means Clustering
❏ K-1 Coloring
❏ Modularity Optimization
❏ Speaker Listener Label Propagation
❏ Approximate Max K-Cut
❏ Triangle Count
❏ Local Clustering Coefficient
❏ Conductance Metric
Heuristic LP Similarity Graph Embeddings
❏ Adamic Adar
❏ Common Neighbors
❏ Preferential Attachment
❏ Resource Allocations
❏ Same Community
❏ Total Neighbors
❏ K-Nearest Neighbors (KNN)
❏ Filtered K-Nearest Neighbors (KNN)
❏ Node Similarity
❏ Filtered Node Similarity
❏ Similarity Functions
❏ Fast Random Projection (FastRP)
❏ Node2Vec
❏ GraphSAGE
❏ Node Classification Pipeline
❏ Link Prediction Pipeline
❏ Node Regression Pipeline
© 2022 Neo4j, Inc. All rights reserved.
5
What We Do in This Demo
• GDS Node Classification Pipeline on patient data
• Loading data
• Data preparation -> Fast Random Projections
• Model training
• Predict T2D / non-T2D based on transcriptomics
• GDS Community Detection
• For Subphenotyping
© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
From Chaos to
Structure:
Neo4j Graph Data
Science is Changing
How Machine Learning
Gets Done
Graph Embeddings summarize the enhanced
explicit knowledge of a graph
6
© 2022 Neo4j, Inc. All rights reserved.
7
Dataset
© 2022 Neo4j, Inc. All rights reserved.
8
Dataset
• 63 patients (total)
• 51 classified as T2D / non-T2D
• 12 unclassified
• 18k transcripts measured on each patient
© 2022 Neo4j, Inc. All rights reserved.
9
Data Model
Transcript
Patient
:MEASURED
value
:SIMILAR
similarityScore
© 2022 Neo4j, Inc. All rights reserved.
10
Demo Time
© 2022 Neo4j, Inc. All rights reserved.
11
Coming Back to Knowledge Graphs
© 2022 Neo4j, Inc. All rights reserved.
12
Clinical Data from Patients
Gender Age BMI diab.status Sample coll #samples #used
surgery disease Histological diagnosis
Neo4j, Inc. All rights reserved 2022
13
Transform Data with GDS - Fast Random Projections
CALL gds.fastRP.write(
'patients',
{
embeddingDimension: 50,
writeProperty: ‘fastRP-
embedding’
}
)
YIELD nodePropertiesWritten
Neo4j, Inc. All rights reserved 2022
14
Connect Patient Data with Your Knowledge Graph
Performance Increase
20 - 12’000x
© 2022 Neo4j, Inc. All rights reserved.
15
Summary
• Graph Data Science library to perform out of the box machine learning
• Using Graph Embeddings (FastRP) to represent heterogenous data as
vector keeping the topology
• New GDS pipeline for Node Classification
• GDS Community Detection
© 2022 Neo4j, Inc. All rights reserved.
© 2022 Neo4j, Inc. All rights reserved.
16
Thank you!
Contact me
alexander.jarasch@neo4j.com

More Related Content

PDF
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
PPTX
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
PDF
Graph Database 101- What, Why and How?.pdf
PDF
Modeling Cybersecurity with Neo4j, Based on Real-Life Data Insights
PPTX
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
PPTX
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
PDF
A Knowledge Graph for Reaction & Synthesis Prediction (AstraZeneca)
PPTX
Encrypting and Protecting Your Data in Neo4j(Jeff_Tallman).pptx
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Graph Database 101- What, Why and How?.pdf
Modeling Cybersecurity with Neo4j, Based on Real-Life Data Insights
Neo4j GraphSummit London March 2023 Emil Eifrem Keynote.pptx
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
A Knowledge Graph for Reaction & Synthesis Prediction (AstraZeneca)
Encrypting and Protecting Your Data in Neo4j(Jeff_Tallman).pptx

What's hot (20)

PPTX
Smarter Fraud Detection With Graph Data Science
PDF
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
PPTX
ENEL Electricity Grids on Neo4j Graph DB
PPTX
SITA WorldTracer - Lost & Found Property
PPTX
How Graph Data Science can turbocharge your Knowledge Graph
PPTX
Demystifying Graph Neural Networks
PPTX
The art of the possible with graph technology_Neo4j GraphSummit Dublin 2023.pptx
PPTX
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
PDF
Supply Chain Twin Demo - Companion Deck
PDF
Neo4j : Graphes de Connaissance, IA et LLMs
PDF
Neo4j Graph Data Science - Webinar
PDF
Introduction to Neo4j
PPTX
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
PDF
Graphs for Finance - AML with Neo4j Graph Data Science
PDF
Danish Business Authority: Explainability and causality in relation to ML Ops
PDF
Knowledge Graphs, Ontologies, and AI Applications
PDF
Building a modern data stack to maintain an efficient and safe electrical grid
PPTX
Neo4j Graph Use Cases, Bruno Ungermann, Neo4j
PPTX
Easily Identify Sources of Supply Chain Gridlock
PDF
The Data Platform for Today’s Intelligent Applications
Smarter Fraud Detection With Graph Data Science
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
ENEL Electricity Grids on Neo4j Graph DB
SITA WorldTracer - Lost & Found Property
How Graph Data Science can turbocharge your Knowledge Graph
Demystifying Graph Neural Networks
The art of the possible with graph technology_Neo4j GraphSummit Dublin 2023.pptx
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
Supply Chain Twin Demo - Companion Deck
Neo4j : Graphes de Connaissance, IA et LLMs
Neo4j Graph Data Science - Webinar
Introduction to Neo4j
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
Graphs for Finance - AML with Neo4j Graph Data Science
Danish Business Authority: Explainability and causality in relation to ML Ops
Knowledge Graphs, Ontologies, and AI Applications
Building a modern data stack to maintain an efficient and safe electrical grid
Neo4j Graph Use Cases, Bruno Ungermann, Neo4j
Easily Identify Sources of Supply Chain Gridlock
The Data Platform for Today’s Intelligent Applications
Ad

Similar to Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithKline) (20)

PDF
From Target to Product - Accelerating the Drug Lifecycle with Knowledge Graph...
PDF
Workshop Tel Aviv - Graph Data Science
PDF
Relationships Matter: Using Connected Data for Better Machine Learning
PDF
Neo4j for Healthcare & Life Sciences
PDF
The Path To Success With Graph Database and Analytics
PDF
Einstieg in Neo4j Graph Data Science
PDF
GPT and Graph Data Science to power your Knowledge Graph
PDF
The path to success with Graph Database and Graph Data Science
PDF
ntroducing to the Power of Graph Technology
PPTX
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
PDF
Graphs for Data Science and Machine Learning
PPTX
The path to success with graph database and graph data science_ Neo4j GraphSu...
PDF
Graph Data Science with Neo4j: Nordics Webinar
PDF
GraphSummit Toronto: Leveraging Graphs for AI and ML
PDF
Deeper Insights with Graph Data Science
PDF
GraphSummit Toronto: Keynote - Innovating with Graphs
PDF
Graph Machine Learning in Production with Neo4j
PDF
3. Relationships Matter: Using Connected Data for Better Machine Learning
PDF
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
PDF
Workshop - Neo4j Graph Data Science
From Target to Product - Accelerating the Drug Lifecycle with Knowledge Graph...
Workshop Tel Aviv - Graph Data Science
Relationships Matter: Using Connected Data for Better Machine Learning
Neo4j for Healthcare & Life Sciences
The Path To Success With Graph Database and Analytics
Einstieg in Neo4j Graph Data Science
GPT and Graph Data Science to power your Knowledge Graph
The path to success with Graph Database and Graph Data Science
ntroducing to the Power of Graph Technology
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Graphs for Data Science and Machine Learning
The path to success with graph database and graph data science_ Neo4j GraphSu...
Graph Data Science with Neo4j: Nordics Webinar
GraphSummit Toronto: Leveraging Graphs for AI and ML
Deeper Insights with Graph Data Science
GraphSummit Toronto: Keynote - Innovating with Graphs
Graph Machine Learning in Production with Neo4j
3. Relationships Matter: Using Connected Data for Better Machine Learning
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Workshop - Neo4j Graph Data Science
Ad

More from Neo4j (20)

PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
PDF
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
PDF
GraphSummit Singapore Master Deck - May 20, 2025
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
PPTX
Neo4j Knowledge for Customer Experience.pptx
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
PDF
Neo4j: The Art of the Possible with Graph
PDF
Smarter Knowledge Graphs For Public Sector
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
PDF
ANZ Presentation: GraphSummit Melbourne 2024
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
PDF
Démonstration Digital Twin Building Wire Management
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
PDF
Démonstration Supply Chain - GraphTalk Paris
PDF
The Art of Possible - GraphTalk Paris Opening Session
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
GraphSummit Singapore Master Deck - May 20, 2025
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j Knowledge for Customer Experience.pptx
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j: The Art of the Possible with Graph
Smarter Knowledge Graphs For Public Sector
GraphRAG and Knowledge Graphs Exploring AI's Future
Matinée GenAI & GraphRAG Paris - Décembre 24
ANZ Presentation: GraphSummit Melbourne 2024
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Démonstration Digital Twin Building Wire Management
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Démonstration Supply Chain - GraphTalk Paris
The Art of Possible - GraphTalk Paris Opening Session
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Encapsulation theory and applications.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
KodekX | Application Modernization Development
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Empathic Computing: Creating Shared Understanding
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
A Presentation on Artificial Intelligence
PDF
Review of recent advances in non-invasive hemoglobin estimation
“AI and Expert System Decision Support & Business Intelligence Systems”
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Diabetes mellitus diagnosis method based random forest with bat algorithm
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Dropbox Q2 2025 Financial Results & Investor Presentation
Encapsulation theory and applications.pdf
cuic standard and advanced reporting.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
KodekX | Application Modernization Development
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Empathic Computing: Creating Shared Understanding
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Machine learning based COVID-19 study performance prediction
A Presentation on Artificial Intelligence
Review of recent advances in non-invasive hemoglobin estimation

Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithKline)

  • 1. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. 1 Neo4j Demo Classify Diabetes Patients and Connect to Knowledge Graphs Dr Alexander Jarasch Field Engineering Specialist Pharma / Healthcare / Biotech
  • 2. 2 Use Cases for the Entire Drug Life Cycle Target Discovery Hit Generation Lead Identification Lead Optimization Animal Models Clinical Trials FDA/EMA Review & Approval Post Approval Manufacturing F100 F100 Cyber security F100
  • 3. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. 3 Native Graph Database The foundation of the Neo4j platform; delivers enterprise-scale and performance, security, and data integrity for transaction and analytical workloads. Data Science and Analytics Explorative tools, rich algorithm library, and Integrated supervised Machine Learning framework. Development Tools & Frameworks Tooling, APIs, query builder, multi-language support for development, admin, modelling, and rapid prototyping needs. Discovery & Visualization Code-free querying, data modeling and exploration tools for data scientists, developers, and analysts. Graph Query Language Support Cypher & openCypher; Ongoing leadership and standards work (GQL) to establish lingua franca for graphs. Ecosystem & Integrations Rich ecosystem of tech and integration partners. Ingestion tools (JDBC, Kafka, Spark, BI Tools, etc.) for bulk and streaming needs. Runs Anywhere Deploy as-a-Service (AuraDB) or self-hosted within your cloud of choice (AWS, GCP, Azure) via their marketplace, or on-premises. Neo4j Graph Data Platform
  • 4. © 2022 Neo4j, Inc. All rights reserved. 4 65+ Graph Algorithms - Out of the Box Pathfinding & Search Centrality Community Detection ❏ Delta-Stepping Single-Source ❏ Dijkstra’s Single-Source ❏ Dijkstra Source-Target ❏ All Pairs Shortest Path ❏ A* Shortest Path ❏ Yen’s K Shortest Path ❏ Minimum Weight Spanning Tree ❏ Random Walk ❏ Breadth & Depth First Search ❏ Degree Centrality ❏ Closeness Centrality ❏ Harmonic Centrality ❏ Betweenness Centrality & Approx. ❏ PageRank ❏ Personalized PageRank ❏ ArticleRank ❏ Eigenvector Centrality ❏ Hyperlink Induced Topic Search (HITS) ❏ Influence Maximization (Greedy, CELF) ❏ Weakly Connected Components ❏ Strongly Connected Components ❏ Label Propagation ❏ Leiden ❏ Louvain ❏ K-Means Clustering ❏ K-1 Coloring ❏ Modularity Optimization ❏ Speaker Listener Label Propagation ❏ Approximate Max K-Cut ❏ Triangle Count ❏ Local Clustering Coefficient ❏ Conductance Metric Heuristic LP Similarity Graph Embeddings ❏ Adamic Adar ❏ Common Neighbors ❏ Preferential Attachment ❏ Resource Allocations ❏ Same Community ❏ Total Neighbors ❏ K-Nearest Neighbors (KNN) ❏ Filtered K-Nearest Neighbors (KNN) ❏ Node Similarity ❏ Filtered Node Similarity ❏ Similarity Functions ❏ Fast Random Projection (FastRP) ❏ Node2Vec ❏ GraphSAGE ❏ Node Classification Pipeline ❏ Link Prediction Pipeline ❏ Node Regression Pipeline
  • 5. © 2022 Neo4j, Inc. All rights reserved. 5 What We Do in This Demo • GDS Node Classification Pipeline on patient data • Loading data • Data preparation -> Fast Random Projections • Model training • Predict T2D / non-T2D based on transcriptomics • GDS Community Detection • For Subphenotyping
  • 6. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. From Chaos to Structure: Neo4j Graph Data Science is Changing How Machine Learning Gets Done Graph Embeddings summarize the enhanced explicit knowledge of a graph 6
  • 7. © 2022 Neo4j, Inc. All rights reserved. 7 Dataset
  • 8. © 2022 Neo4j, Inc. All rights reserved. 8 Dataset • 63 patients (total) • 51 classified as T2D / non-T2D • 12 unclassified • 18k transcripts measured on each patient
  • 9. © 2022 Neo4j, Inc. All rights reserved. 9 Data Model Transcript Patient :MEASURED value :SIMILAR similarityScore
  • 10. © 2022 Neo4j, Inc. All rights reserved. 10 Demo Time
  • 11. © 2022 Neo4j, Inc. All rights reserved. 11 Coming Back to Knowledge Graphs
  • 12. © 2022 Neo4j, Inc. All rights reserved. 12 Clinical Data from Patients Gender Age BMI diab.status Sample coll #samples #used surgery disease Histological diagnosis
  • 13. Neo4j, Inc. All rights reserved 2022 13 Transform Data with GDS - Fast Random Projections CALL gds.fastRP.write( 'patients', { embeddingDimension: 50, writeProperty: ‘fastRP- embedding’ } ) YIELD nodePropertiesWritten
  • 14. Neo4j, Inc. All rights reserved 2022 14 Connect Patient Data with Your Knowledge Graph Performance Increase 20 - 12’000x
  • 15. © 2022 Neo4j, Inc. All rights reserved. 15 Summary • Graph Data Science library to perform out of the box machine learning • Using Graph Embeddings (FastRP) to represent heterogenous data as vector keeping the topology • New GDS pipeline for Node Classification • GDS Community Detection
  • 16. © 2022 Neo4j, Inc. All rights reserved. © 2022 Neo4j, Inc. All rights reserved. 16 Thank you! Contact me alexander.jarasch@neo4j.com