Welcome!
bruno.ungermann@neo4j.com
Neo4j Graphday: Health & Life Sciences
9.00- 9:30 Breakfast & Networking
9.30- 12.30 Presentations
Introduction to Graph Databases and Neo4j
Bruno Ungermann, Neo4j
The Germany Centre of Diabetes Research Greatly Improves Research Capabilities with Graph Technology
Dr. Alexander Jarasch, Deutsches Zentrum für Diabetesforschung
Big Data in Genomics: How Neo4j enables personalized therapies
Dr. Martin Preusse, Knowing Health
Neo4j Bloom – Visualization & Analysis for Everyone
Michael Hunger, Neo4j
12.30 Lunch Break
How to Make your Graph Project a Success with Neo4j
Stefan Kolmar, Neo4j
Workshop: New Possibilities in Health & Life Sciences with Graphs
Michael Hunger, Dr. Martin Preusse
15.30 – Coffee & Open Discussion
Agenda Health & Life Sciences
Complexity
Connectedness
Bootcamp
Domain Model Logistics Process
Traditional Approach: Fixed Schema, Tables
Graph Model: Nodes & Relationships
Containe
r
Load
USING ROUTE
Depart 2014-04-15
Arrive 2014-04-28
USING_CARRIER
Vessel
Physical
Container
Shipment Carrier
Emission
Class A
Shipment:
ID 256787
Carrier:
DHL
Route
10520km
Route:
823km
Fueling
Max Wgt
80
Type Gas
B
Town:
Tokyo
Town:
Hong
Kong
Town:
Hamburg
Container
LoadContainer
LoadContainer
Load
Parcel
Weight
15.5kg
Container
Load
Intuitiveness
Flexibility: no fixed schema
Flexibility & Agility
“We found Neo4j to be literally thousands of times
faster than our prior MySQL solution, with queries
that require 10-100 times less code. Today, Neo4j
provides eBay with functionality that was previously
impossible.” - Volker Pacher, Senior Developer
“Minutes to milliseconds” performance
Queries up to 1000x faster than other tested database types
Speed
Graph Based Success
Neo4j - The Graph Company
500+
7/10
12/25
8/10
53K+
100+
250+
450+
Adoption
Top Retail Firms
Top Financial Firms
Top Software Vendors
Customers Partners
• Creator of the Neo4j Graph Platform
• ~250 employees
• HQ in Silicon Valley, other offices include
London, Munich, Paris and Malmö
(Sweden)
• $160M in funding from Morgan Stanley,
Fidelity, Sunstone, Conor, Creandum, and
Greenbridge Capital
• Over 10M+ downloads,
• 250+ enterprise subscription customers
with over half with >$1B in revenue
Ecosystem
Startups in program
Enterprise customers
Partners
Meet up members
Events per year
Industry’s Largest Dedicated Investment in Graphs
15
• Record “Cyber Monday” sales
• About 35M daily transactions
• Each transaction is 3-22 hops
• Queries executed in 4ms or less
• Replaced IBM Websphere commerce
• 300M pricing operations per day
• 10x transaction throughput on half the
hardware compared to Oracle
• Replaced Oracle database
• Large postal service with over 500k
employees
• Neo4j routes 10M+ packages daily at peak,
with peaks of 5,000+ routing operations per
second.
Handling Large Graph Work Loads for Enterprises
Real-time promotion
recommendations
Marriott’s Real-time
Pricing Engine
Handling Package
Routing in Real-Time
Discrete Data
Minimally
connected data
Neo4j is designed for data relationships
Other NoSQL
Relational
DBMS
Neo4j Graph DB
Connected Data
Focused on
Data Relationships
Development Benefits
Easy model maintenance
Easy query
Deployment Benefits
Ultra high performance
Minimal resource usage
Use the Right Database for the Right Job
How Neo4j Fits — Common Architecture Patterns
From Disparate Silos
To Cross-Silo Connections
From Tabular Data
To Connected Data
From Data Lake Analytics
to Real-Time Operations
18
Common Graph Technology Use Cases
Network & IT Operations
Application Management
Meta Data
Management
Real-Time
Recommendations
Identity & Access
Management, Security
Knowledge
Management
Fraud Detection, AML
Compliance, GDPR
19
Biological and Medical Knowledge in heterogeneous networks
20
Biological and Medical Knowledge in heterogeneous networks
21
22
Medical Research
Background
• Italian research center that analyzes cancer
samples from around the world
• Provides state-of-the-art therapeutic and
diagnostic cancer services
Business Problem
• Develop a tool that provides cancer data
insights, tracks workflows and is available to
external researchers
• Relational databases didn’t provide adequate
flexibility
Solution and Benefits
• Easily find complex research data relationships
• Develop complex semantics for genomic
knowledge
• Cancer research is accessible to external
scientists
23
Pharmaceutical Research
Business Problem
• Seeking to automate phenotype, compound
and protein cell behaviour research by using
previously documented research more
effectively
• Text mining for research elements like DNA
strings, proteins, RNA, chemicals and diseases
Solution and Benefits
• Found ways to identify compound interaction
behaviour from millions of rearch documents
• Relations between biological entities can be
identified and validated by biological experts
• Still very challenging to keep up to date, add
genomics data, and find a breakthrough
Background
• 5 year long drug discovery research
• Parse & Navigate over 25 Million scientific papers
• Sourced from National Library of Medicine and
tagging of “Medical Subject Headers” (MeSH tags)
24
Agriculture
Background
• One of the world’s largest agribusinesses
• Founded in 1901 and based in St. Louis
• Grew from pioneer to leader in genetically
modifying plants and building related businesses
• Among the first companies to genetically modify
a plant cell (1983)
Business Problem
• Although the data volume was not huge, (200
GB, 800 Mln nodes, Bln relationships) queries
from connected data sets using traditional
technology ran for long durations. In some
cases, Monsanto had to stop them
• Shorten new product development pipeline by
one year through “yield testing in the lab”
• Efficiently impute genotypes of newly bred
populations from analysis of decades of genetic
ancestry data
25
Large Chemical Company: R&D Knowledge Solution
Background
• Provide new ways to search and interact with
internal R&D Knowledge and published scientific
information, highly connected at fact level to
make knowledge actionable
• Thousands of employees in R&D
• Chemicals, Reactions Biologicals, physical-
chemical properties
Company
• 10.000+ employees in R&D
• 70+ R&D locations
• 800 new patents
• 3.000 R&D projects
• 2 Bln R&D budget
26
Large Pharmaceutical Company: Enterprise Search
Background
• Personalized Search for 100.000+ employees
• 300.000.000 docs, pptx, pdf, html
• 1 Mln products
• 130.000 projects
• Sources Exchange, Sharepoint, Office 365,
Oracle, Hana, Blogs, Active Directory …..
Background
• 150.000+ employees, 300 locations
White Board Session

More Related Content

PDF
Neo4j GraphDay Munich - Improve Health Research
PDF
Neo4j for Discovering Drugs and Biomarkers
PPTX
Elastic as a Fundamental Core to Pfizer’s Scientific Data Cloud
PDF
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
PPTX
Anaconda Data Science Collaboration
PDF
ELSS use cases and strategy
PPTX
Cloud-native Enterprise Data Science Teams
PPTX
Beyond the Science Gateway
Neo4j GraphDay Munich - Improve Health Research
Neo4j for Discovering Drugs and Biomarkers
Elastic as a Fundamental Core to Pfizer’s Scientific Data Cloud
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Anaconda Data Science Collaboration
ELSS use cases and strategy
Cloud-native Enterprise Data Science Teams
Beyond the Science Gateway

What's hot (19)

PPTX
BigData Testing by Shreya Pal
PPTX
PerkinElmer Informatics Overview
PPTX
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
PDF
GenRocket Data Sheet
PPTX
Towards Automated AI-guided Drug Discovery Labs
PDF
Science Distributed's Chain Event: Distributed Science Pilot - Lauren Long
PDF
Data Con LA 2018 Keynote - Better Collaborative Data Science by Megan Risdal
PPTX
Data Visibility and Protection at the Scale of Life Sciences
PPTX
Irving-TeraData: data and science driven big industry-nfdp13
PDF
Deep Learning in Security - Examples, Infrastructure, Challenges, and Suggest...
PDF
Data Search in Cloud using the Encrypted Keywords
PDF
Efficient Privacy Preserving Clustering Based Multi Keyword Search
PPTX
Automating the process of continuously prioritising data, updating and deploy...
PPTX
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...
DOCX
A secure and dynamic multi keyword ranked
PDF
Hortonworks Hybrid Cloud - Putting you back in control of your data
PDF
Genomics Applications in the Cloud with the DNAnexus Platform
PPTX
A practical guide to practicing open science
DOCX
Secure Phrase Search for Intelligent Processing of Encrypted Data in Cloud-Ba...
BigData Testing by Shreya Pal
PerkinElmer Informatics Overview
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
GenRocket Data Sheet
Towards Automated AI-guided Drug Discovery Labs
Science Distributed's Chain Event: Distributed Science Pilot - Lauren Long
Data Con LA 2018 Keynote - Better Collaborative Data Science by Megan Risdal
Data Visibility and Protection at the Scale of Life Sciences
Irving-TeraData: data and science driven big industry-nfdp13
Deep Learning in Security - Examples, Infrastructure, Challenges, and Suggest...
Data Search in Cloud using the Encrypted Keywords
Efficient Privacy Preserving Clustering Based Multi Keyword Search
Automating the process of continuously prioritising data, updating and deploy...
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...
A secure and dynamic multi keyword ranked
Hortonworks Hybrid Cloud - Putting you back in control of your data
Genomics Applications in the Cloud with the DNAnexus Platform
A practical guide to practicing open science
Secure Phrase Search for Intelligent Processing of Encrypted Data in Cloud-Ba...
Ad

Similar to Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs (20)

PPTX
Neo4j GraphTalk Basel - Health & Life Sciences
PPTX
Neue Lösungen für Life Sciences und die Pharmaindustrie mit Graphdatenbanken
PDF
MedChemica BigData What Is That All About?
PPTX
Jisc's new shared data centre
PDF
BigDataAnalytics_Talk_KOCH_FINAL
PPT
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
PDF
Information Security Forum (ISF) Congress 2013
PPTX
Pistoia alliance debates analytics 15-09-2015 16.00
PDF
Sharing and standards christopher hart - clinical innovation and partnering...
PDF
Data Virtualization Modernizes Biobanking
PPTX
Will Biomedical Research Fundamentally Change in the Era of Big Data?
PPTX
Data Harmonization for a Molecularly Driven Health System
PPTX
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
PPTX
2016 09 cxo forum
PPTX
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
PPTX
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
PDF
Considerations and challenges in building an end to-end microbiome workflow
PDF
Data analytics - May 2016
PPTX
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
PDF
High Performance Computing and the Opportunity with Cognitive Technology
Neo4j GraphTalk Basel - Health & Life Sciences
Neue Lösungen für Life Sciences und die Pharmaindustrie mit Graphdatenbanken
MedChemica BigData What Is That All About?
Jisc's new shared data centre
BigDataAnalytics_Talk_KOCH_FINAL
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Information Security Forum (ISF) Congress 2013
Pistoia alliance debates analytics 15-09-2015 16.00
Sharing and standards christopher hart - clinical innovation and partnering...
Data Virtualization Modernizes Biobanking
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Data Harmonization for a Molecularly Driven Health System
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
2016 09 cxo forum
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
Considerations and challenges in building an end to-end microbiome workflow
Data analytics - May 2016
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
High Performance Computing and the Opportunity with Cognitive Technology
Ad

More from Neo4j (20)

PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
PDF
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
PDF
GraphSummit Singapore Master Deck - May 20, 2025
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
PPTX
Neo4j Knowledge for Customer Experience.pptx
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
PDF
Neo4j: The Art of the Possible with Graph
PDF
Smarter Knowledge Graphs For Public Sector
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
PDF
ANZ Presentation: GraphSummit Melbourne 2024
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
PDF
Démonstration Digital Twin Building Wire Management
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
PDF
Démonstration Supply Chain - GraphTalk Paris
PDF
The Art of Possible - GraphTalk Paris Opening Session
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
GraphSummit Singapore Master Deck - May 20, 2025
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j Knowledge for Customer Experience.pptx
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j: The Art of the Possible with Graph
Smarter Knowledge Graphs For Public Sector
GraphRAG and Knowledge Graphs Exploring AI's Future
Matinée GenAI & GraphRAG Paris - Décembre 24
ANZ Presentation: GraphSummit Melbourne 2024
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Démonstration Digital Twin Building Wire Management
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Démonstration Supply Chain - GraphTalk Paris
The Art of Possible - GraphTalk Paris Opening Session
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...

Recently uploaded (20)

PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
STKI Israel Market Study 2025 version august
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Microsoft Excel 365/2024 Beginner's training
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PPTX
Benefits of Physical activity for teenagers.pptx
DOCX
search engine optimization ppt fir known well about this
PDF
Enhancing emotion recognition model for a student engagement use case through...
Final SEM Unit 1 for mit wpu at pune .pptx
Developing a website for English-speaking practice to English as a foreign la...
Credit Without Borders: AI and Financial Inclusion in Bangladesh
STKI Israel Market Study 2025 version august
A contest of sentiment analysis: k-nearest neighbor versus neural network
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Module 1.ppt Iot fundamentals and Architecture
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Microsoft Excel 365/2024 Beginner's training
Custom Battery Pack Design Considerations for Performance and Safety
sbt 2.0: go big (Scala Days 2025 edition)
Convolutional neural network based encoder-decoder for efficient real-time ob...
1 - Historical Antecedents, Social Consideration.pdf
Flame analysis and combustion estimation using large language and vision assi...
Getting started with AI Agents and Multi-Agent Systems
Taming the Chaos: How to Turn Unstructured Data into Decisions
Benefits of Physical activity for teenagers.pptx
search engine optimization ppt fir known well about this
Enhancing emotion recognition model for a student engagement use case through...

Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs

  • 2. 9.00- 9:30 Breakfast & Networking 9.30- 12.30 Presentations Introduction to Graph Databases and Neo4j Bruno Ungermann, Neo4j The Germany Centre of Diabetes Research Greatly Improves Research Capabilities with Graph Technology Dr. Alexander Jarasch, Deutsches Zentrum für Diabetesforschung Big Data in Genomics: How Neo4j enables personalized therapies Dr. Martin Preusse, Knowing Health Neo4j Bloom – Visualization & Analysis for Everyone Michael Hunger, Neo4j 12.30 Lunch Break How to Make your Graph Project a Success with Neo4j Stefan Kolmar, Neo4j Workshop: New Possibilities in Health & Life Sciences with Graphs Michael Hunger, Dr. Martin Preusse 15.30 – Coffee & Open Discussion Agenda Health & Life Sciences
  • 8. Graph Model: Nodes & Relationships Containe r Load USING ROUTE Depart 2014-04-15 Arrive 2014-04-28 USING_CARRIER Vessel Physical Container Shipment Carrier Emission Class A Shipment: ID 256787 Carrier: DHL Route 10520km Route: 823km Fueling Max Wgt 80 Type Gas B Town: Tokyo Town: Hong Kong Town: Hamburg Container LoadContainer LoadContainer Load Parcel Weight 15.5kg Container Load
  • 12. “We found Neo4j to be literally thousands of times faster than our prior MySQL solution, with queries that require 10-100 times less code. Today, Neo4j provides eBay with functionality that was previously impossible.” - Volker Pacher, Senior Developer “Minutes to milliseconds” performance Queries up to 1000x faster than other tested database types Speed
  • 14. Neo4j - The Graph Company 500+ 7/10 12/25 8/10 53K+ 100+ 250+ 450+ Adoption Top Retail Firms Top Financial Firms Top Software Vendors Customers Partners • Creator of the Neo4j Graph Platform • ~250 employees • HQ in Silicon Valley, other offices include London, Munich, Paris and Malmö (Sweden) • $160M in funding from Morgan Stanley, Fidelity, Sunstone, Conor, Creandum, and Greenbridge Capital • Over 10M+ downloads, • 250+ enterprise subscription customers with over half with >$1B in revenue Ecosystem Startups in program Enterprise customers Partners Meet up members Events per year Industry’s Largest Dedicated Investment in Graphs
  • 15. 15 • Record “Cyber Monday” sales • About 35M daily transactions • Each transaction is 3-22 hops • Queries executed in 4ms or less • Replaced IBM Websphere commerce • 300M pricing operations per day • 10x transaction throughput on half the hardware compared to Oracle • Replaced Oracle database • Large postal service with over 500k employees • Neo4j routes 10M+ packages daily at peak, with peaks of 5,000+ routing operations per second. Handling Large Graph Work Loads for Enterprises Real-time promotion recommendations Marriott’s Real-time Pricing Engine Handling Package Routing in Real-Time
  • 16. Discrete Data Minimally connected data Neo4j is designed for data relationships Other NoSQL Relational DBMS Neo4j Graph DB Connected Data Focused on Data Relationships Development Benefits Easy model maintenance Easy query Deployment Benefits Ultra high performance Minimal resource usage Use the Right Database for the Right Job
  • 17. How Neo4j Fits — Common Architecture Patterns From Disparate Silos To Cross-Silo Connections From Tabular Data To Connected Data From Data Lake Analytics to Real-Time Operations
  • 18. 18 Common Graph Technology Use Cases Network & IT Operations Application Management Meta Data Management Real-Time Recommendations Identity & Access Management, Security Knowledge Management Fraud Detection, AML Compliance, GDPR
  • 19. 19 Biological and Medical Knowledge in heterogeneous networks
  • 20. 20 Biological and Medical Knowledge in heterogeneous networks
  • 21. 21
  • 22. 22 Medical Research Background • Italian research center that analyzes cancer samples from around the world • Provides state-of-the-art therapeutic and diagnostic cancer services Business Problem • Develop a tool that provides cancer data insights, tracks workflows and is available to external researchers • Relational databases didn’t provide adequate flexibility Solution and Benefits • Easily find complex research data relationships • Develop complex semantics for genomic knowledge • Cancer research is accessible to external scientists
  • 23. 23 Pharmaceutical Research Business Problem • Seeking to automate phenotype, compound and protein cell behaviour research by using previously documented research more effectively • Text mining for research elements like DNA strings, proteins, RNA, chemicals and diseases Solution and Benefits • Found ways to identify compound interaction behaviour from millions of rearch documents • Relations between biological entities can be identified and validated by biological experts • Still very challenging to keep up to date, add genomics data, and find a breakthrough Background • 5 year long drug discovery research • Parse & Navigate over 25 Million scientific papers • Sourced from National Library of Medicine and tagging of “Medical Subject Headers” (MeSH tags)
  • 24. 24 Agriculture Background • One of the world’s largest agribusinesses • Founded in 1901 and based in St. Louis • Grew from pioneer to leader in genetically modifying plants and building related businesses • Among the first companies to genetically modify a plant cell (1983) Business Problem • Although the data volume was not huge, (200 GB, 800 Mln nodes, Bln relationships) queries from connected data sets using traditional technology ran for long durations. In some cases, Monsanto had to stop them • Shorten new product development pipeline by one year through “yield testing in the lab” • Efficiently impute genotypes of newly bred populations from analysis of decades of genetic ancestry data
  • 25. 25 Large Chemical Company: R&D Knowledge Solution Background • Provide new ways to search and interact with internal R&D Knowledge and published scientific information, highly connected at fact level to make knowledge actionable • Thousands of employees in R&D • Chemicals, Reactions Biologicals, physical- chemical properties Company • 10.000+ employees in R&D • 70+ R&D locations • 800 new patents • 3.000 R&D projects • 2 Bln R&D budget
  • 26. 26 Large Pharmaceutical Company: Enterprise Search Background • Personalized Search for 100.000+ employees • 300.000.000 docs, pptx, pdf, html • 1 Mln products • 130.000 projects • Sources Exchange, Sharepoint, Office 365, Oracle, Hana, Blogs, Active Directory ….. Background • 150.000+ employees, 300 locations