SlideShare a Scribd company logo
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
1
Graph Data Science:
Accelerating AI & Machine Learning
Alicia Frame, PhD
Director, Data Science @ Neo4j
Neo4j, Inc. All rights reserved 2021
2
Neo4j: The Connected Data Company
20 of the top 25 financial firms
7 of the top 10 retailers
7 of the top 10 software vendors
Neo4j is the creator of:
• The world’s leading graph database
• The first graph data science platform
• The most flexible graph data model
• The easiest-to-use graph query language
Thousands of Organizations Use Neo4j
Silicon Valley | London | Munich
Paris | Malmö
Neo4j, Inc. All rights reserved 2021
3
Node
Represents an entity in the graph
Relationship
Connect nodes to each other
Property
Describes a node or relationship:
e.g. name, age, weight etc
What’s a graph?
MICA
ANDRE
Name: “Andre”
Born: May 29, 1970
Twitter: “@dan”
Name: “Mica”
Born: Dec 5, 1975
CAR
Brand “Volvo”
Model: “V70”
Since:
Jan 10, 2011
LOVES
LOVES
LOVES
LIVES WITH
O
W
N
S
D
R
I
V
E
S
Neo4j, Inc. All rights reserved 2021
Networks of People Transaction Networks
Bought
B
ou
gh
t
V
i
e
w
e
d
R
e
t
u
r
n
e
d
Bought
Knowledge Networks
Pl
ay
s
Lives_in
In_sport
Likes
F
a
n
_
o
f
Plays_for
Risk management,
Supply chain, Orders,
Payments, etc.
Employees, Customers,
Suppliers, Partners,
Influencers, etc.
Enterprise content,
Domain specific content,
eCommerce content, etc
K
n
o
w
s
Knows
Knows
K
n
o
w
s
4
Everything is Naturally Connected
Neo4j, Inc. All rights reserved 2021
5
Higher Pay and More Promotions
• People Near Structural Holes
• Organizational Misfits
Network Structure is
Highly Predictive
Photo by Helena Lopes on Unsplash
“Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum
“Structural Holes and Good Ideas” R. Burt
Neo4j, Inc. All rights reserved 2021
Consider What Drives Your Business
It’s not the numbers, it’s the relationships behind them
Plants
Warehouses
Suppliers
Distributors
Competitors
Partners
Regulations
Employees
Citizens
Customers
Products
Parts
Services
Regions
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
7
Relationships
are the strongest
predictors of behavior
But You Can’t Analyse
What You Can’t See
● Most data science techniques
ignore relationships
● It’s painful to manually engineer
connected features from tabular
data
● Graphs are built on
relationships, so…
● You don’t have to guess at the
correlations: with graphs,
relationships are built in
James Fowler
Neo4j, Inc. All rights reserved 2021
8
8 Top 10 Tech Trends in Data and Analytics, 16 Feb 2021
According to Gartner, “Graphs form
the foundation of modern D&A,
with capabilities to enhance and
improve user collaboration, ML models
and explainable AI.
The recent Gartner AI in Organizations
Survey demonstrates that graph
techniques are increasingly
prevalent as AI maturity grows,
going from 13% adoption when AI
maturity is lowest to 48% when
maturity is highest.”
AI Research Papers
Featuring Graph
Source: Dimensions Knowledge System
4x
Increase in
traffic to
Neo4j GDS
page in
2H-2020
Analytics & Data Science Interest
Exploding in Neo4j Community
100k+
Practicing data
scientists
engaged with
Neo4j
+210k
downloads
Neo4j, Inc. All rights reserved 2021
9
Queries
Find the patterns you know exist.
Machine Learning
Uncover trends and make
predictions
Visualization
Explore, collaborate, and explain
Graphs & Data Science
Analytics
Feature
Engineering
Data
Exploration
Graph
Data
Science
Queries
Machine Learning Visualization
Neo4j, Inc. All rights reserved 2021
10
Graphs & Data Science
Knowledge Graphs
Graph Algorithms
Graph Native
Machine Learning
Find the patterns you’re
looking for in connected data
Use unsupervised machine
learning techniques to
identify associations,
anomalies, and trends.
Use embeddings to learn the
features in your graph that
you don’t even know are
important yet.
Train in-graph supervise ML
models to predict links,
labels, and missing data.
Neo4j, Inc. All rights reserved 2021
Better Predictions with Data You Already Have
● Traditional ML ignores network structure because it’s difficult to extract
● Uncover patterns and trends you can’t find any other way
● Easily generate predictive features to incorporate into ML pipelines
11
Machine Learning Pipeline
Neo4j, Inc. All rights reserved 2021
Neo4j’s Graph Data Science Framework
Neo4j Graph Data
Science Library
Neo4j
Database
Neo4j
Bloom
Scalable Graph Algorithms &
Analytics Workspace
Native Graph Creation &
Persistence
Visual Graph
Exploration & Prototyping
Neo4j, Inc. All rights reserved 2021
Robust Graph Algorithms & ML methods
● Compute metrics about the topology and connectivity
● Build predictive models to enhance your graph
● Highly parallelized and scale to 10’s of billions of nodes
13
The Neo4j GDS Library
Mutable In-Memory
Workspace
Computational Graph
Native Graph Store
Efficient & Flexible Analytics Workspace
● Automatically reshapes transactional graphs into
an in-memory analytics graph
● Optimized for global traversals and aggregation
● Create workflows and layer algorithms
● Store and manage predictive models in the
model catalog
Neo4j, Inc. All rights reserved 2021
Community
Detection
14
Neo4j’s Graph Data Science Library
Unsupervised Graph Algorithms
Clustering
Dimension Reduction
(generalization)
Association
Which parts of my graph are
connected to each other?
Which nodes are most
similar?
How important is each node?
Supervised Machine Learning
Node Classification
Link Prediction
Where will connections
form next?
What’s the label
for this node?
Centrality
Embeddings
Similarity
Pathfinding
More Algos than
any other vendor
ONLY in neo4j
Neo4j, Inc. All rights reserved 2021
15
Unsupervised Machine Learning
Pathfinding &
Search
• Shortest Path
• Single-Source Shortest Path
• All Pairs Shortest Path
• A* Shortest Path
• Yen’s K Shortest Path
• Minimum Weight Spanning Tree
• K-Spanning Tree (MST)
• Random Walk
• Breadth & Depth First Search
Centrality &
Importance
• Degree Centrality
• Closeness Centrality
• Harmonic Centrality
• Betweenness Centrality & Approx.
• PageRank
• Personalized PageRank
• ArticleRank
• Eigenvector Centrality
• Hyperlink Induced Topic Search (HITS)
• Influence Maximization (Greedy, CELF)
Community
Detection
• Triangle Count
• Local Clustering Coefficient
• Connected Components (Union Find)
• Strongly Connected Components
• Label Propagation
• Louvain Modularity
• K-1 Coloring
• Modularity Optimization
• Speaker Listener Label Propagation
Heuristic Link
Prediction
• Adamic Adar
• Common Neighbors
• Preferential Attachment
• Resource Allocations
• Same Community
• Total Neighbors
Similarity
• Node Similarity
• K-Nearest Neighbors (KNN)
• Jaccard Similarity
• Cosine Similarity
• Pearson Similarity
• Euclidean Distance
• Approximate Nearest Neighbors (ANN)
Graph
Embeddings
• Node2Vec
• FastRP
• FastRPExtended
• GraphSAGE
Neo4j, Inc. All rights reserved 2021
Graph Features & Graph Models for Predictions
Traditional ML problems where
relationships between your data points
are important predictive features
16
Predictions influenced by
graph structure
Predictions about
graph structure
Enhance your graph by predicting
missing data or changes to your graph
that will occur in the future
Neo4j, Inc. All rights reserved 2021
17
Neo4j’s In-Graph ML Models
Node
classification:
“What kind of node
is this?”
Link prediction:
“Should there be a
relationship between
these nodes?”
Labeled data: Pairs of nodes
that are either linked or not
Features: Pre-existing
attributes, algorithms
(pageRank), embedding
Neo4j, Inc. All rights reserved 2021
18
The Only Completely In-Graph, ML Workflow
Graph-Native
Feature
Engineering
Train
Predictive Model
Queries
Algorithms
Embeddings
1. Model Type
2. Property
Selection
3. Train & Test
4. Model
Selection
Apply Model to
Existing / New
Data
Use Predictions
for Decisions
Use Predictions
to Enhance
the Graph
Publish & Share
Store Model in
Database
Neo4j, Inc. All rights reserved 2021
What’s most important and
influential in my business?
What’s occurring that’s unusual?
What’s going to happen next?
But traditional
approaches to data make
it impossible to reveal and
effectively use those
connections as data sizes
become large
Predictive signals get lost in
big data noise
19
Graph Data Science Answers the BIG Questions
Connected Data is
Powerful
Graph Data Science uses
Connections to Answer
Critical Questions
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
20
Resources
Graph Resources
● Video: Advantages of Graph Technology
● Code: https://github.com/neo4j/graph-data-science/
● Whitepaper: Financial Fraud Detection with Graph Data Science
● Case Study: Meredith Corporation
Neo4j BookShelf
● Graph Databases For Dummies
● Graph Data Science For Dummies
● O’Reilly Graph Algorithms

More Related Content

PDF
Sharepoint metadata workshop
PPTX
Digital globalization: The new era of global flows
PPTX
Introduction-to-Generative-AI.pptx
PPTX
SIEBEL Customer Relationship Management
PDF
Data Driven Assessment.pdf
PDF
2023 brand & digital marketing Trends
PDF
Accenture Tech Vision 2020 - Trend 3
PDF
Power The Retail Energy Customer Experience
Sharepoint metadata workshop
Digital globalization: The new era of global flows
Introduction-to-Generative-AI.pptx
SIEBEL Customer Relationship Management
Data Driven Assessment.pdf
2023 brand & digital marketing Trends
Accenture Tech Vision 2020 - Trend 3
Power The Retail Energy Customer Experience

What's hot (18)

PDF
leewayhertz.com-How to build a generative AI solution From prototyping to pro...
PPTX
Salesforce integration best practices columbus meetup
PPTX
HuggingFace AI - Hugging Face lets users create interactive, in-browser demos...
PDF
GenAI Risks & Security Meetup 01052024.pdf
PDF
Job Skills Report of 2024.pdf
PDF
The essential elements of a digital transformation strategy
PDF
Fintech New York: Partnerships, Platforms and Open Innovation
PDF
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
DOCX
Documentation on salesforce communities
PPTX
Technology Vision 2022: Communications Industry | Accenture
PPTX
Public ripple (payment protocol) for blockchain - Anil Nayak
PPTX
Generative AI, WiDS 2023.pptx
PDF
Changing Consumer Motivations & Expectations | Accenture
PDF
CMMI-DEV 1.3 Tool (checklist)
PDF
A comprehensive guide to Salesforce Org Strategy
PDF
APidays Paris 2019 - API-First vs Data Driven Architecture by Jerome Louvel, ...
PDF
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
PDF
SEJ_StateofSEO2023.pdf
leewayhertz.com-How to build a generative AI solution From prototyping to pro...
Salesforce integration best practices columbus meetup
HuggingFace AI - Hugging Face lets users create interactive, in-browser demos...
GenAI Risks & Security Meetup 01052024.pdf
Job Skills Report of 2024.pdf
The essential elements of a digital transformation strategy
Fintech New York: Partnerships, Platforms and Open Innovation
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
Documentation on salesforce communities
Technology Vision 2022: Communications Industry | Accenture
Public ripple (payment protocol) for blockchain - Anil Nayak
Generative AI, WiDS 2023.pptx
Changing Consumer Motivations & Expectations | Accenture
CMMI-DEV 1.3 Tool (checklist)
A comprehensive guide to Salesforce Org Strategy
APidays Paris 2019 - API-First vs Data Driven Architecture by Jerome Louvel, ...
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
SEJ_StateofSEO2023.pdf
Ad

Similar to Graph Data Science: The Secret to Accelerating Innovation with AI/ML (20)

PDF
Graphs for Data Science and Machine Learning
PPTX
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
PDF
Graph Data Science with Neo4j: Nordics Webinar
PDF
Relationships Matter: Using Connected Data for Better Machine Learning
PDF
Einstieg in Neo4j Graph Data Science
PDF
Neo4j – The Fastest Path to Scalable Real-Time Analytics
PDF
Workshop Tel Aviv - Graph Data Science
PDF
Introduction to Neo4j
PDF
The Data Platform for Today's Intelligent Applications.pdf
PDF
ntroducing to the Power of Graph Technology
PPTX
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
PDF
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
PDF
Neo4j Graph Data Science - Webinar
PDF
GraphSummit Toronto: Leveraging Graphs for AI and ML
PDF
How Graph Technology is Changing AI
PPTX
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
PDF
3. Relationships Matter: Using Connected Data for Better Machine Learning
PDF
Leveraging Graphs for Better AI
PDF
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
PDF
Are You Underestimating the Value Within Your Data? A conversation about grap...
Graphs for Data Science and Machine Learning
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Graph Data Science with Neo4j: Nordics Webinar
Relationships Matter: Using Connected Data for Better Machine Learning
Einstieg in Neo4j Graph Data Science
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Workshop Tel Aviv - Graph Data Science
Introduction to Neo4j
The Data Platform for Today's Intelligent Applications.pdf
ntroducing to the Power of Graph Technology
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
Neo4j Graph Data Science - Webinar
GraphSummit Toronto: Leveraging Graphs for AI and ML
How Graph Technology is Changing AI
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
3. Relationships Matter: Using Connected Data for Better Machine Learning
Leveraging Graphs for Better AI
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Are You Underestimating the Value Within Your Data? A conversation about grap...
Ad

More from Neo4j (20)

PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
PDF
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
PDF
GraphSummit Singapore Master Deck - May 20, 2025
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
PPTX
Neo4j Knowledge for Customer Experience.pptx
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
PDF
Neo4j: The Art of the Possible with Graph
PDF
Smarter Knowledge Graphs For Public Sector
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
PDF
ANZ Presentation: GraphSummit Melbourne 2024
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
PDF
Démonstration Digital Twin Building Wire Management
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
PDF
Démonstration Supply Chain - GraphTalk Paris
PDF
The Art of Possible - GraphTalk Paris Opening Session
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
GraphSummit Singapore Master Deck - May 20, 2025
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j Knowledge for Customer Experience.pptx
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j: The Art of the Possible with Graph
Smarter Knowledge Graphs For Public Sector
GraphRAG and Knowledge Graphs Exploring AI's Future
Matinée GenAI & GraphRAG Paris - Décembre 24
ANZ Presentation: GraphSummit Melbourne 2024
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Démonstration Digital Twin Building Wire Management
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Démonstration Supply Chain - GraphTalk Paris
The Art of Possible - GraphTalk Paris Opening Session
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...

Recently uploaded (20)

PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Approach and Philosophy of On baking technology
PDF
NewMind AI Weekly Chronicles - August'25 Week I
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Cloud computing and distributed systems.
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Programs and apps: productivity, graphics, security and other tools
Unlocking AI with Model Context Protocol (MCP)
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
Approach and Philosophy of On baking technology
NewMind AI Weekly Chronicles - August'25 Week I
The AUB Centre for AI in Media Proposal.docx
Empathic Computing: Creating Shared Understanding
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced methodologies resolving dimensionality complications for autism neur...
Cloud computing and distributed systems.
Building Integrated photovoltaic BIPV_UPV.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf

Graph Data Science: The Secret to Accelerating Innovation with AI/ML

  • 1. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 1 Graph Data Science: Accelerating AI & Machine Learning Alicia Frame, PhD Director, Data Science @ Neo4j
  • 2. Neo4j, Inc. All rights reserved 2021 2 Neo4j: The Connected Data Company 20 of the top 25 financial firms 7 of the top 10 retailers 7 of the top 10 software vendors Neo4j is the creator of: • The world’s leading graph database • The first graph data science platform • The most flexible graph data model • The easiest-to-use graph query language Thousands of Organizations Use Neo4j Silicon Valley | London | Munich Paris | Malmö
  • 3. Neo4j, Inc. All rights reserved 2021 3 Node Represents an entity in the graph Relationship Connect nodes to each other Property Describes a node or relationship: e.g. name, age, weight etc What’s a graph? MICA ANDRE Name: “Andre” Born: May 29, 1970 Twitter: “@dan” Name: “Mica” Born: Dec 5, 1975 CAR Brand “Volvo” Model: “V70” Since: Jan 10, 2011 LOVES LOVES LOVES LIVES WITH O W N S D R I V E S
  • 4. Neo4j, Inc. All rights reserved 2021 Networks of People Transaction Networks Bought B ou gh t V i e w e d R e t u r n e d Bought Knowledge Networks Pl ay s Lives_in In_sport Likes F a n _ o f Plays_for Risk management, Supply chain, Orders, Payments, etc. Employees, Customers, Suppliers, Partners, Influencers, etc. Enterprise content, Domain specific content, eCommerce content, etc K n o w s Knows Knows K n o w s 4 Everything is Naturally Connected
  • 5. Neo4j, Inc. All rights reserved 2021 5 Higher Pay and More Promotions • People Near Structural Holes • Organizational Misfits Network Structure is Highly Predictive Photo by Helena Lopes on Unsplash “Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum “Structural Holes and Good Ideas” R. Burt
  • 6. Neo4j, Inc. All rights reserved 2021 Consider What Drives Your Business It’s not the numbers, it’s the relationships behind them Plants Warehouses Suppliers Distributors Competitors Partners Regulations Employees Citizens Customers Products Parts Services Regions
  • 7. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 7 Relationships are the strongest predictors of behavior But You Can’t Analyse What You Can’t See ● Most data science techniques ignore relationships ● It’s painful to manually engineer connected features from tabular data ● Graphs are built on relationships, so… ● You don’t have to guess at the correlations: with graphs, relationships are built in James Fowler
  • 8. Neo4j, Inc. All rights reserved 2021 8 8 Top 10 Tech Trends in Data and Analytics, 16 Feb 2021 According to Gartner, “Graphs form the foundation of modern D&A, with capabilities to enhance and improve user collaboration, ML models and explainable AI. The recent Gartner AI in Organizations Survey demonstrates that graph techniques are increasingly prevalent as AI maturity grows, going from 13% adoption when AI maturity is lowest to 48% when maturity is highest.” AI Research Papers Featuring Graph Source: Dimensions Knowledge System 4x Increase in traffic to Neo4j GDS page in 2H-2020 Analytics & Data Science Interest Exploding in Neo4j Community 100k+ Practicing data scientists engaged with Neo4j +210k downloads
  • 9. Neo4j, Inc. All rights reserved 2021 9 Queries Find the patterns you know exist. Machine Learning Uncover trends and make predictions Visualization Explore, collaborate, and explain Graphs & Data Science Analytics Feature Engineering Data Exploration Graph Data Science Queries Machine Learning Visualization
  • 10. Neo4j, Inc. All rights reserved 2021 10 Graphs & Data Science Knowledge Graphs Graph Algorithms Graph Native Machine Learning Find the patterns you’re looking for in connected data Use unsupervised machine learning techniques to identify associations, anomalies, and trends. Use embeddings to learn the features in your graph that you don’t even know are important yet. Train in-graph supervise ML models to predict links, labels, and missing data.
  • 11. Neo4j, Inc. All rights reserved 2021 Better Predictions with Data You Already Have ● Traditional ML ignores network structure because it’s difficult to extract ● Uncover patterns and trends you can’t find any other way ● Easily generate predictive features to incorporate into ML pipelines 11 Machine Learning Pipeline
  • 12. Neo4j, Inc. All rights reserved 2021 Neo4j’s Graph Data Science Framework Neo4j Graph Data Science Library Neo4j Database Neo4j Bloom Scalable Graph Algorithms & Analytics Workspace Native Graph Creation & Persistence Visual Graph Exploration & Prototyping
  • 13. Neo4j, Inc. All rights reserved 2021 Robust Graph Algorithms & ML methods ● Compute metrics about the topology and connectivity ● Build predictive models to enhance your graph ● Highly parallelized and scale to 10’s of billions of nodes 13 The Neo4j GDS Library Mutable In-Memory Workspace Computational Graph Native Graph Store Efficient & Flexible Analytics Workspace ● Automatically reshapes transactional graphs into an in-memory analytics graph ● Optimized for global traversals and aggregation ● Create workflows and layer algorithms ● Store and manage predictive models in the model catalog
  • 14. Neo4j, Inc. All rights reserved 2021 Community Detection 14 Neo4j’s Graph Data Science Library Unsupervised Graph Algorithms Clustering Dimension Reduction (generalization) Association Which parts of my graph are connected to each other? Which nodes are most similar? How important is each node? Supervised Machine Learning Node Classification Link Prediction Where will connections form next? What’s the label for this node? Centrality Embeddings Similarity Pathfinding More Algos than any other vendor ONLY in neo4j
  • 15. Neo4j, Inc. All rights reserved 2021 15 Unsupervised Machine Learning Pathfinding & Search • Shortest Path • Single-Source Shortest Path • All Pairs Shortest Path • A* Shortest Path • Yen’s K Shortest Path • Minimum Weight Spanning Tree • K-Spanning Tree (MST) • Random Walk • Breadth & Depth First Search Centrality & Importance • Degree Centrality • Closeness Centrality • Harmonic Centrality • Betweenness Centrality & Approx. • PageRank • Personalized PageRank • ArticleRank • Eigenvector Centrality • Hyperlink Induced Topic Search (HITS) • Influence Maximization (Greedy, CELF) Community Detection • Triangle Count • Local Clustering Coefficient • Connected Components (Union Find) • Strongly Connected Components • Label Propagation • Louvain Modularity • K-1 Coloring • Modularity Optimization • Speaker Listener Label Propagation Heuristic Link Prediction • Adamic Adar • Common Neighbors • Preferential Attachment • Resource Allocations • Same Community • Total Neighbors Similarity • Node Similarity • K-Nearest Neighbors (KNN) • Jaccard Similarity • Cosine Similarity • Pearson Similarity • Euclidean Distance • Approximate Nearest Neighbors (ANN) Graph Embeddings • Node2Vec • FastRP • FastRPExtended • GraphSAGE
  • 16. Neo4j, Inc. All rights reserved 2021 Graph Features & Graph Models for Predictions Traditional ML problems where relationships between your data points are important predictive features 16 Predictions influenced by graph structure Predictions about graph structure Enhance your graph by predicting missing data or changes to your graph that will occur in the future
  • 17. Neo4j, Inc. All rights reserved 2021 17 Neo4j’s In-Graph ML Models Node classification: “What kind of node is this?” Link prediction: “Should there be a relationship between these nodes?” Labeled data: Pairs of nodes that are either linked or not Features: Pre-existing attributes, algorithms (pageRank), embedding
  • 18. Neo4j, Inc. All rights reserved 2021 18 The Only Completely In-Graph, ML Workflow Graph-Native Feature Engineering Train Predictive Model Queries Algorithms Embeddings 1. Model Type 2. Property Selection 3. Train & Test 4. Model Selection Apply Model to Existing / New Data Use Predictions for Decisions Use Predictions to Enhance the Graph Publish & Share Store Model in Database
  • 19. Neo4j, Inc. All rights reserved 2021 What’s most important and influential in my business? What’s occurring that’s unusual? What’s going to happen next? But traditional approaches to data make it impossible to reveal and effectively use those connections as data sizes become large Predictive signals get lost in big data noise 19 Graph Data Science Answers the BIG Questions Connected Data is Powerful Graph Data Science uses Connections to Answer Critical Questions
  • 20. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 20 Resources Graph Resources ● Video: Advantages of Graph Technology ● Code: https://github.com/neo4j/graph-data-science/ ● Whitepaper: Financial Fraud Detection with Graph Data Science ● Case Study: Meredith Corporation Neo4j BookShelf ● Graph Databases For Dummies ● Graph Data Science For Dummies ● O’Reilly Graph Algorithms