SlideShare a Scribd company logo
excella.com | @excellaco
Utilizing Neo4j with
AI Applications
Patrick D. Smith & Brian Rodrigue
excella.com | @excellaco
Introduction
excella.com | @excellaco
About Us
Excella has been implementing successful IT mission critical solutions in
the commercial, federal, and non-profit sectors since 2002. Our experts
specialize in software development, data and analytics, DevOps,
program management, business analysis, and Agile best practices. We
also deliver Agile and Scrum training to corporations, government
agencies, and associations at our headquarters in Arlington, VA
excella.com | @excellaco
About Us
Our data team consists of data scientists, data engineers, and data
visualization professionals working across a range of federal and
commercial clients. We have experts with PhDs and Masters from
universities such as Johns Hopkins, Oxford, Harvard and the University
of Chicago with expertise in natural language processing and computer
vision.
excella.com | @excellaco
Introduction
We’ll introduce three ways in integrate AI into your graph-
based systems:
• Intelligent Retrieval for Graph Ingest
• Graph Embeddings and Intelligent Graph Reasoning
• Graph Knowledge Reinforcement
excella.com | @excellaco
The MAKO Project
AI – Based Decision Making
excella.com | @excellaco
Excella’s AI Research &
Development Effort
1. Advancing the field with
active research and
development into the most
innovative AI Methods
2. Developing AI based
solutions
The MAKO Project
excella.com | @excellaco
“How can it be that
mathematics, being after all a
product of human thought
independent of experience, is so
admirably adapted to the
objects of reality”
- Einstein
The MAKO Project
excella.com | @excellaco
Creating Intelligent Agents
DALE: The Deep Answer Learning Engine
• Intelligent Customer Service System to respond to Tier 1
inquiries
• Utilizes custom embedding and memory unit structures
• On test sets: 87% accuracy on classifying a tweet as
originating from one of 10 twitter accounts using 190 dim
LieRr vectors. 83% with 50 dimensional glove vectors
excella.com | @excellaco
Our Work: LieGr, GeoNN, and More
• Mathematical Structure of Networks: Discovered that
words, trained end-to-end with RNNs on NLP tasks, tend to
naturally embed into a Lie group structure. This connects
the "black box" of neural nets to mathematics that has been
well understood for over a century.
• LieGr: Leveraging words' natural embedding structure, we
created a basic unsupervised word embedding scheme
using special orthogonal Lie groups and the distributional
hypothesis
excella.com | @excellaco
Our Work: LieGr, GeoNN, and More
• Geodesic Neural Networks (GeoNN): Generates text
without treating words as discrete units and by modeling
sequences of words as geodesic flow (analogue of straight-
line motion) on a Lie group. This permits the use of a
Generative Adversarial Network (GAN) for training.
Sentences are deterministically generated, but the path
along which they are generated can be randomly seeded.
excella.com | @excellaco
Aiding Graph with AI
Artificial Intelligence applications with deep neural networks can
help advance a variety of graph computational problems:
• node classification
• node clustering
• node retrieval/recommendation
• link prediction
excella.com | @excellaco
Graph Based AI for Customer
Service
Intelligent Graph Based Knowledge Retrieval
excella.com | @excellaco
Business Case
• Complicated process of forms lead to costly mistakes with
serious repercussions on applicants​
• Confusion and anxiety leads to frequent calls to agency to
check status or ask questions​
• Ultimate goal is to reduce the call volume
excella.com | @excellaco
Technical Response
• Create AI using Deep Learning to provide initial customer
service responses​
• Graph to provide context into forms, supporting documents,
processing times, costs, eligibility, etc
excella.com | @excellaco
The Compliment
• User research showed that there was additional value in
opening graph access to the end user​
• Allow users to plan their journey and understand the options
that fit their situation​
• Add Q&A layers to support interaction
excella.com | @excellaco
The Stack
• Initial PoC using Load CSV to start building the knowledge
graph​
• React front end to allow user interaction and support agile
development​
• Py2Neo for AI interaction​
• AWS Cloud, CI Pipeline​
• Replacing Load CSV with custom front end for maintenance​
• Automated uploads of updates to costs and processing times
excella.com | @excellaco
Core Inference Engine
• Utilizes a combined CNN/RNN structure to extract sentence
meaning
• Accesses additional structured information from Neo4j that is
relevant to the question via a Neural Variational Answer
Model
• Combines output structure from Neo4j NVAM pipeline with
RNN/CNN output in a fully connected layer
• Utilizes a generative network component for answer
generation
excella.com | @excellaco
Core Inference Engine
Question
LieGr
Embeddings
RNN
CNN
Fully
Connected
Layer
Answer
Neo
excella.com | @excellaco
Modeling Customer Service Data
• Customer Service Knowledge based is modeled based on
documents
• (Brian)
excella.com | @excellaco
Embeddings
• Embeddings are low dimensional
vector representations of
unstructured data
• Embeddings store latent
information and structure on the
data
• Generated predictive and count
based dimensionality reduction)
models
excella.com | @excellaco
Graph Embeddings
Graph Embeddings help solve the computational efficiency
problem of graph computing by embedding graph structures on
a compact Manifold. Embedding Structures are broken down
into:
• node embedding
• edge embedding
• hybrid embedding
• whole-graph embedding
excella.com | @excellaco
Graph Embeddings
• Converting graphs to vector spaces makes
computation easier for artificial neural
networks​
• It's hard to find meaningful information after
traversing several edges away from a node
• Easier to discover latent information that is
embedded within the data
excella.com | @excellaco
Graph Embeddings
The learned representations of graph embeddings are useful for
machine learning tasks such as the labeling of the nodes,
regression, and edge prediction
Features extracted with these sequence based graph
embedding procedures can be used for predicting:
• social network users’ missing age
• the category of scientific papers in citation networks
• the function of proteins in protein-protein interaction networks
excella.com | @excellaco
Graph Embeddings
Besides supervised learning tasks on nodes the extracted
features can be used for:
• graph visualization
• edge prediction
• community detection
• structural role identification
excella.com | @excellaco
Graph Embeddings
• First big push in modern graph embedding research was DeepWalk by
Perozzi et al, which uses truncated random walks for modeling sequences.
• More involved sequence sampling methods include the use of second order
random walks, the introduction of skips in random walks and branching
processes
• More sophisticated models encode the structural role of nodes, to get a
representation that is in line with the multi level structure of the graph and
consequently to improve the predictive performance on downstream machine
learning tasks
excella.com | @excellaco
Data Retrieval
First Step in utilizing deep learning for graph is to extract
features:
• Nodes
• Pairs: connections and number of common neighbors
• Groups: existing cluster assignments
Perozzi et al.
excella.com | @excellaco
Embedding Structures: GEMSEC
GEMSEC: Embeddings with Clustering
• Graph embedding scheme that learns embeddings
and latent clusters at the same time
• Similar representations for nodes which have similar
sampled neighborhoods
• Probabilistic model on graphs: minimizing the
negative log likelihood of observed neighborhood
samples
Rozemberczki, et al.
excella.com | @excellaco
Embedding Structures: GEMSEC
GEMSEC: Embeddings with Clustering
• Clusters from GEMSEC provide the basis for
information retrieval
• We utilize the trained GEMSEC model to determine
which cluster our sent information point lies in
• Once we have the cluster, we utilize a selection
model to determine which information is most
relevant.
excella.com | @excellaco
Variational Inference Answer Selection
• Proposed as an answer selection model for
question answering tasks
• Employs a latent attention mechanism
• Given a question q, it finds a set of answer
sentences associated with q
• Answer set determines the context vector, which
are the words in the answer sentences that are
prominent for predicting the answer matches to the
current question. This enables the model to learn
subtleties inherent in the questions.
excella.com | @excellaco
Network Operation
• Fully connected layer identifies need for
information, sends ”query” to information
retrieval model (NASM)
• Query is vectorized via LieGr and sent to
NASM
• NASM utilizes an embedded graph
representation as input
Fully
Connected
Layer
Answer
Neo
excella.com | @excellaco
Utilizing Graph for Intelligent
Fraud Detection
AI-Based Graph Reasoning
excella.com | @excellaco
Our System
excella.com | @excellaco
Intelligent Ingest
Documents
with Fraud
Intelligent
Retrieval
Graph
ReasoningNeo
excella.com | @excellaco
Intelligent Ingest
Documents
with Fraud
Intelligent
Retrieval
Graph
ReasoningNeo
excella.com | @excellaco
Unstructured Data to Graph
PDF to Text
• Python PDF Miner – Extracts unstructured text
information
Structured Data Extraction
• OCR techniques for extracting tables and
figures from PDF files
• Utilizes tesseract OCR for regonition
excella.com | @excellaco
Unstructured Data to Graph
Speech to Text
• Takes in call center audio data as an additive to the PDF to
text pipeline
• Utilizes Google Cloud Speech to Text API
• 96% accuracy on call center audio files
excella.com | @excellaco
Unstructured Data to Graph
Extracting Entities: Stanford Name Entity Recognizer
• Conditional Random Fields Model: Discriminative sequence
modeling method
• Entities: Person, Location, Organization
• Trained on both British and American newswire, so robust
across both domains
• Optimized with LBFGS
excella.com | @excellaco
Neural Variational Inference
• NVDM: Generative model for probabilistic
document modeling
• Combines Unsupervised Variational
Autoencoders with generative approaches
• Unlike traditional models, the NVDM
provides a dynamic, variational model of
the text’s distribution
excella.com | @excellaco
Intelligent Ingest
Documents
with Fraud
Intelligent
Retrieval
Graph
ReasoningNeo
Utilizing Neo4j with AI Applications
excella.com | @excellaco
Unstructured Data to Graph
Intelligent Search
• Crawl and extract further structured and unstructured data from websites
using python-based Selenium API (grants.gov, public information
databases, etc.)
• Searches are handled with respect to nodes; nodes are iterated through
and searches pulled on metadata relating to that entity to enrich the
dataset
• Information pulled from and pushed back to nodes is handled in Py2Neo
excella.com | @excellaco
Graph Reasoning
Documents
with Fraud
Intelligent
Retrieval
Graph
ReasoningNeo
excella.com | @excellaco
Effective Graph Analytics
What type of machine learning can we do on graph?
• node classification
• node clustering
• node retrieval/recommendation
• link prediction
excella.com | @excellaco
Knowledge Graph Reinforcement
excella.com | @excellaco
Graph Based Reasoning
Reasoning over large scale knowledge graphs
One option is to use traditional graph algorithms that are
supported in Neo4j:
• PageRank to determine entity importance
• Path-Finding algorithms for relationship modeling
• Label Propogation for group recognition
excella.com | @excellaco
Graph Based Reasoning
GEMSEC Embeddings
• Pull nodes + relationships from Neo4j to
feed into our embedding pipeline, output
clusters and embeddings
• Shows a 8.79% improvement over
previous deep methods in predicting a
related grouping
• GEMSEC created with GPU enabled
Tensorflow
excella.com | @excellaco
Pulling Data for reasoning
CREATE p =(audit:audit { number:'A-04-17-01003' })-[:AUDITS]-
>(organization:organization { name:'AURUM INSTITUTE'})-[:manages]-
>(grant:grant { name: 'PEPFAR' })-[:ISSUED_BY]->(agency:agency {
name:'CDC'})RETURN p
For our downstream predictive tasks, we pull nodes and their
immediate relationships, or chains of relationships
excella.com | @excellaco
Graph Based Reasoning
Downstream Prediction Tasks:
• Is this fraud or not? Graph and it’s embeddings are holding all of our latent
information
• We utilized a Recurrent Neural Network with a single softmax output layer,
trained end to end, for our fraud detection predictions
excella.com | @excellaco
Other Examples of Reasoning
Graph Embeddings allow us utilize and harness the power of graph while
allowing us to use a standard suite of machine learning and deep learning
methods on downstream tasks:
• Perozzi and Skiena showed in 2015 that we can use graph embeddings for
downstream age prediction in social networks
• Graph Convolutional Networks: Kipf & Welling introduced a structure for
modeling arbitrarily structured graphs
excella.com | @excellaco
Other Examples of Reasoning
Convolutional Graph Networks: For these models, the goal is
then to learn a function of signals/features on a graph which
takes as input:
• A feature description xixi for every node ii; summarized in
a N×DN×D feature matrix XX (NN: number of nodes, DD:
number of input features)
• A representative description of the graph structure in matrix
form; typically in the form of an adjacency matrix AA (or some
function thereof)
excella.com | @excellaco
Other Examples of Reasoning
Convolutional Graph Networks:
• Reduces the complexity of the training procedures
• Powerful, but the learned structures cannot be transferred to
other graphs
excella.com | @excellaco
Closing
Why Utilize AI Methods for Graph Analysis?
excella.com | @excellaco
Closing
• Graph Embeddings are a powerful means of utilizing your
graph-based data for deep learning
• Embedding structures can aid in creating ingest, in-graph, and
downstream post-graph predictive tasks
• Still a long way to go: How can we more closely integrate
graph and deep learning?
excella.com | @excellaco
excella.com | @excellaco
Patrick D. Smith
patrick.smith@excella.com
Brian Rodrigue
brian.rodrigue@excella.com

More Related Content

PDF
How Graph Technology is Changing AI
PDF
No Sql On Social And Sematic Web
PDF
Nature Inspired Models And The Semantic Web
PDF
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
PDF
Improving Machine Learning using Graph Algorithms
PPTX
Deep Learning Enabled Question Answering System to Automate Corporate Helpdesk
PPTX
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
PDF
Data! Data! Data! I Can't Make Bricks Without Clay!
How Graph Technology is Changing AI
No Sql On Social And Sematic Web
Nature Inspired Models And The Semantic Web
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Improving Machine Learning using Graph Algorithms
Deep Learning Enabled Question Answering System to Automate Corporate Helpdesk
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Data! Data! Data! I Can't Make Bricks Without Clay!

What's hot (9)

PDF
Marvin_Capstone
PPTX
Machine learning with graph
PDF
Test PDF
PDF
Advance deep learning
PDF
Learning to learn unlearned feature for segmentation
PDF
Neo4j Graph Data Science Training - June 9 & 10 - Slides #6 Graph Algorithms
PDF
“Explainability in Computer Vision: A Machine Learning Engineer’s Overview,” ...
PDF
Graph-Powered Machine Learning
PPTX
Natural language processing and transformer models
Marvin_Capstone
Machine learning with graph
Test PDF
Advance deep learning
Learning to learn unlearned feature for segmentation
Neo4j Graph Data Science Training - June 9 & 10 - Slides #6 Graph Algorithms
“Explainability in Computer Vision: A Machine Learning Engineer’s Overview,” ...
Graph-Powered Machine Learning
Natural language processing and transformer models
Ad

Similar to Utilizing Neo4j with AI Applications (20)

PDF
Leveraging Graphs for Better AI
PDF
Leveraging Graphs for Better AI
PDF
How Graphs Enhance AI
PDF
3. Relationships Matter: Using Connected Data for Better Machine Learning
PDF
Graph Data Science with Neo4j: Nordics Webinar
PDF
IA Generativa y Grafos de Neo4j: RAG time
PPTX
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
PDF
YouTube Downloader v3.4.9 APK Download
PDF
Wondershare UniConverter for MacOS Download
PDF
Minitab Free crack Download (Latest 2025)
PDF
TunesKit Video Repair 2.0.0.11 Free Download
PDF
Neo4j y GenAI
PDF
Webinar - IA generativa e grafi Neo4j: RAG time!
PDF
Deep learning 1.0 and Beyond, Part 1
PDF
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
PDF
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
PDF
GPT and Graph Data Science to power your Knowledge Graph
PDF
Large Language Models ❤️ Knowledge Graphs - Michael Hunger
PDF
ntroducing to the Power of Graph Technology
PDF
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Leveraging Graphs for Better AI
Leveraging Graphs for Better AI
How Graphs Enhance AI
3. Relationships Matter: Using Connected Data for Better Machine Learning
Graph Data Science with Neo4j: Nordics Webinar
IA Generativa y Grafos de Neo4j: RAG time
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
YouTube Downloader v3.4.9 APK Download
Wondershare UniConverter for MacOS Download
Minitab Free crack Download (Latest 2025)
TunesKit Video Repair 2.0.0.11 Free Download
Neo4j y GenAI
Webinar - IA generativa e grafi Neo4j: RAG time!
Deep learning 1.0 and Beyond, Part 1
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
GPT and Graph Data Science to power your Knowledge Graph
Large Language Models ❤️ Knowledge Graphs - Michael Hunger
ntroducing to the Power of Graph Technology
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Ad

More from Neo4j (20)

PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
PDF
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
PDF
GraphSummit Singapore Master Deck - May 20, 2025
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
PPTX
Neo4j Knowledge for Customer Experience.pptx
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
PDF
Neo4j: The Art of the Possible with Graph
PDF
Smarter Knowledge Graphs For Public Sector
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
PDF
ANZ Presentation: GraphSummit Melbourne 2024
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
PDF
Démonstration Digital Twin Building Wire Management
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
PDF
Démonstration Supply Chain - GraphTalk Paris
PDF
The Art of Possible - GraphTalk Paris Opening Session
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
GraphSummit Singapore Master Deck - May 20, 2025
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j Knowledge for Customer Experience.pptx
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j: The Art of the Possible with Graph
Smarter Knowledge Graphs For Public Sector
GraphRAG and Knowledge Graphs Exploring AI's Future
Matinée GenAI & GraphRAG Paris - Décembre 24
ANZ Presentation: GraphSummit Melbourne 2024
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Démonstration Digital Twin Building Wire Management
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Démonstration Supply Chain - GraphTalk Paris
The Art of Possible - GraphTalk Paris Opening Session
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...

Recently uploaded (20)

PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Cloud computing and distributed systems.
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Approach and Philosophy of On baking technology
PDF
Machine learning based COVID-19 study performance prediction
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Encapsulation theory and applications.pdf
PPT
Teaching material agriculture food technology
Network Security Unit 5.pdf for BCA BBA.
Advanced methodologies resolving dimensionality complications for autism neur...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
MIND Revenue Release Quarter 2 2025 Press Release
“AI and Expert System Decision Support & Business Intelligence Systems”
Cloud computing and distributed systems.
sap open course for s4hana steps from ECC to s4
Approach and Philosophy of On baking technology
Machine learning based COVID-19 study performance prediction
The AUB Centre for AI in Media Proposal.docx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Programs and apps: productivity, graphics, security and other tools
MYSQL Presentation for SQL database connectivity
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
NewMind AI Weekly Chronicles - August'25-Week II
Encapsulation theory and applications.pdf
Teaching material agriculture food technology

Utilizing Neo4j with AI Applications

  • 1. excella.com | @excellaco Utilizing Neo4j with AI Applications Patrick D. Smith & Brian Rodrigue
  • 3. excella.com | @excellaco About Us Excella has been implementing successful IT mission critical solutions in the commercial, federal, and non-profit sectors since 2002. Our experts specialize in software development, data and analytics, DevOps, program management, business analysis, and Agile best practices. We also deliver Agile and Scrum training to corporations, government agencies, and associations at our headquarters in Arlington, VA
  • 4. excella.com | @excellaco About Us Our data team consists of data scientists, data engineers, and data visualization professionals working across a range of federal and commercial clients. We have experts with PhDs and Masters from universities such as Johns Hopkins, Oxford, Harvard and the University of Chicago with expertise in natural language processing and computer vision.
  • 5. excella.com | @excellaco Introduction We’ll introduce three ways in integrate AI into your graph- based systems: • Intelligent Retrieval for Graph Ingest • Graph Embeddings and Intelligent Graph Reasoning • Graph Knowledge Reinforcement
  • 6. excella.com | @excellaco The MAKO Project AI – Based Decision Making
  • 7. excella.com | @excellaco Excella’s AI Research & Development Effort 1. Advancing the field with active research and development into the most innovative AI Methods 2. Developing AI based solutions The MAKO Project
  • 8. excella.com | @excellaco “How can it be that mathematics, being after all a product of human thought independent of experience, is so admirably adapted to the objects of reality” - Einstein The MAKO Project
  • 9. excella.com | @excellaco Creating Intelligent Agents DALE: The Deep Answer Learning Engine • Intelligent Customer Service System to respond to Tier 1 inquiries • Utilizes custom embedding and memory unit structures • On test sets: 87% accuracy on classifying a tweet as originating from one of 10 twitter accounts using 190 dim LieRr vectors. 83% with 50 dimensional glove vectors
  • 10. excella.com | @excellaco Our Work: LieGr, GeoNN, and More • Mathematical Structure of Networks: Discovered that words, trained end-to-end with RNNs on NLP tasks, tend to naturally embed into a Lie group structure. This connects the "black box" of neural nets to mathematics that has been well understood for over a century. • LieGr: Leveraging words' natural embedding structure, we created a basic unsupervised word embedding scheme using special orthogonal Lie groups and the distributional hypothesis
  • 11. excella.com | @excellaco Our Work: LieGr, GeoNN, and More • Geodesic Neural Networks (GeoNN): Generates text without treating words as discrete units and by modeling sequences of words as geodesic flow (analogue of straight- line motion) on a Lie group. This permits the use of a Generative Adversarial Network (GAN) for training. Sentences are deterministically generated, but the path along which they are generated can be randomly seeded.
  • 12. excella.com | @excellaco Aiding Graph with AI Artificial Intelligence applications with deep neural networks can help advance a variety of graph computational problems: • node classification • node clustering • node retrieval/recommendation • link prediction
  • 13. excella.com | @excellaco Graph Based AI for Customer Service Intelligent Graph Based Knowledge Retrieval
  • 14. excella.com | @excellaco Business Case • Complicated process of forms lead to costly mistakes with serious repercussions on applicants​ • Confusion and anxiety leads to frequent calls to agency to check status or ask questions​ • Ultimate goal is to reduce the call volume
  • 15. excella.com | @excellaco Technical Response • Create AI using Deep Learning to provide initial customer service responses​ • Graph to provide context into forms, supporting documents, processing times, costs, eligibility, etc
  • 16. excella.com | @excellaco The Compliment • User research showed that there was additional value in opening graph access to the end user​ • Allow users to plan their journey and understand the options that fit their situation​ • Add Q&A layers to support interaction
  • 17. excella.com | @excellaco The Stack • Initial PoC using Load CSV to start building the knowledge graph​ • React front end to allow user interaction and support agile development​ • Py2Neo for AI interaction​ • AWS Cloud, CI Pipeline​ • Replacing Load CSV with custom front end for maintenance​ • Automated uploads of updates to costs and processing times
  • 18. excella.com | @excellaco Core Inference Engine • Utilizes a combined CNN/RNN structure to extract sentence meaning • Accesses additional structured information from Neo4j that is relevant to the question via a Neural Variational Answer Model • Combines output structure from Neo4j NVAM pipeline with RNN/CNN output in a fully connected layer • Utilizes a generative network component for answer generation
  • 19. excella.com | @excellaco Core Inference Engine Question LieGr Embeddings RNN CNN Fully Connected Layer Answer Neo
  • 20. excella.com | @excellaco Modeling Customer Service Data • Customer Service Knowledge based is modeled based on documents • (Brian)
  • 21. excella.com | @excellaco Embeddings • Embeddings are low dimensional vector representations of unstructured data • Embeddings store latent information and structure on the data • Generated predictive and count based dimensionality reduction) models
  • 22. excella.com | @excellaco Graph Embeddings Graph Embeddings help solve the computational efficiency problem of graph computing by embedding graph structures on a compact Manifold. Embedding Structures are broken down into: • node embedding • edge embedding • hybrid embedding • whole-graph embedding
  • 23. excella.com | @excellaco Graph Embeddings • Converting graphs to vector spaces makes computation easier for artificial neural networks​ • It's hard to find meaningful information after traversing several edges away from a node • Easier to discover latent information that is embedded within the data
  • 24. excella.com | @excellaco Graph Embeddings The learned representations of graph embeddings are useful for machine learning tasks such as the labeling of the nodes, regression, and edge prediction Features extracted with these sequence based graph embedding procedures can be used for predicting: • social network users’ missing age • the category of scientific papers in citation networks • the function of proteins in protein-protein interaction networks
  • 25. excella.com | @excellaco Graph Embeddings Besides supervised learning tasks on nodes the extracted features can be used for: • graph visualization • edge prediction • community detection • structural role identification
  • 26. excella.com | @excellaco Graph Embeddings • First big push in modern graph embedding research was DeepWalk by Perozzi et al, which uses truncated random walks for modeling sequences. • More involved sequence sampling methods include the use of second order random walks, the introduction of skips in random walks and branching processes • More sophisticated models encode the structural role of nodes, to get a representation that is in line with the multi level structure of the graph and consequently to improve the predictive performance on downstream machine learning tasks
  • 27. excella.com | @excellaco Data Retrieval First Step in utilizing deep learning for graph is to extract features: • Nodes • Pairs: connections and number of common neighbors • Groups: existing cluster assignments Perozzi et al.
  • 28. excella.com | @excellaco Embedding Structures: GEMSEC GEMSEC: Embeddings with Clustering • Graph embedding scheme that learns embeddings and latent clusters at the same time • Similar representations for nodes which have similar sampled neighborhoods • Probabilistic model on graphs: minimizing the negative log likelihood of observed neighborhood samples Rozemberczki, et al.
  • 29. excella.com | @excellaco Embedding Structures: GEMSEC GEMSEC: Embeddings with Clustering • Clusters from GEMSEC provide the basis for information retrieval • We utilize the trained GEMSEC model to determine which cluster our sent information point lies in • Once we have the cluster, we utilize a selection model to determine which information is most relevant.
  • 30. excella.com | @excellaco Variational Inference Answer Selection • Proposed as an answer selection model for question answering tasks • Employs a latent attention mechanism • Given a question q, it finds a set of answer sentences associated with q • Answer set determines the context vector, which are the words in the answer sentences that are prominent for predicting the answer matches to the current question. This enables the model to learn subtleties inherent in the questions.
  • 31. excella.com | @excellaco Network Operation • Fully connected layer identifies need for information, sends ”query” to information retrieval model (NASM) • Query is vectorized via LieGr and sent to NASM • NASM utilizes an embedded graph representation as input Fully Connected Layer Answer Neo
  • 32. excella.com | @excellaco Utilizing Graph for Intelligent Fraud Detection AI-Based Graph Reasoning
  • 34. excella.com | @excellaco Intelligent Ingest Documents with Fraud Intelligent Retrieval Graph ReasoningNeo
  • 35. excella.com | @excellaco Intelligent Ingest Documents with Fraud Intelligent Retrieval Graph ReasoningNeo
  • 36. excella.com | @excellaco Unstructured Data to Graph PDF to Text • Python PDF Miner – Extracts unstructured text information Structured Data Extraction • OCR techniques for extracting tables and figures from PDF files • Utilizes tesseract OCR for regonition
  • 37. excella.com | @excellaco Unstructured Data to Graph Speech to Text • Takes in call center audio data as an additive to the PDF to text pipeline • Utilizes Google Cloud Speech to Text API • 96% accuracy on call center audio files
  • 38. excella.com | @excellaco Unstructured Data to Graph Extracting Entities: Stanford Name Entity Recognizer • Conditional Random Fields Model: Discriminative sequence modeling method • Entities: Person, Location, Organization • Trained on both British and American newswire, so robust across both domains • Optimized with LBFGS
  • 39. excella.com | @excellaco Neural Variational Inference • NVDM: Generative model for probabilistic document modeling • Combines Unsupervised Variational Autoencoders with generative approaches • Unlike traditional models, the NVDM provides a dynamic, variational model of the text’s distribution
  • 40. excella.com | @excellaco Intelligent Ingest Documents with Fraud Intelligent Retrieval Graph ReasoningNeo
  • 42. excella.com | @excellaco Unstructured Data to Graph Intelligent Search • Crawl and extract further structured and unstructured data from websites using python-based Selenium API (grants.gov, public information databases, etc.) • Searches are handled with respect to nodes; nodes are iterated through and searches pulled on metadata relating to that entity to enrich the dataset • Information pulled from and pushed back to nodes is handled in Py2Neo
  • 43. excella.com | @excellaco Graph Reasoning Documents with Fraud Intelligent Retrieval Graph ReasoningNeo
  • 44. excella.com | @excellaco Effective Graph Analytics What type of machine learning can we do on graph? • node classification • node clustering • node retrieval/recommendation • link prediction
  • 45. excella.com | @excellaco Knowledge Graph Reinforcement
  • 46. excella.com | @excellaco Graph Based Reasoning Reasoning over large scale knowledge graphs One option is to use traditional graph algorithms that are supported in Neo4j: • PageRank to determine entity importance • Path-Finding algorithms for relationship modeling • Label Propogation for group recognition
  • 47. excella.com | @excellaco Graph Based Reasoning GEMSEC Embeddings • Pull nodes + relationships from Neo4j to feed into our embedding pipeline, output clusters and embeddings • Shows a 8.79% improvement over previous deep methods in predicting a related grouping • GEMSEC created with GPU enabled Tensorflow
  • 48. excella.com | @excellaco Pulling Data for reasoning CREATE p =(audit:audit { number:'A-04-17-01003' })-[:AUDITS]- >(organization:organization { name:'AURUM INSTITUTE'})-[:manages]- >(grant:grant { name: 'PEPFAR' })-[:ISSUED_BY]->(agency:agency { name:'CDC'})RETURN p For our downstream predictive tasks, we pull nodes and their immediate relationships, or chains of relationships
  • 49. excella.com | @excellaco Graph Based Reasoning Downstream Prediction Tasks: • Is this fraud or not? Graph and it’s embeddings are holding all of our latent information • We utilized a Recurrent Neural Network with a single softmax output layer, trained end to end, for our fraud detection predictions
  • 50. excella.com | @excellaco Other Examples of Reasoning Graph Embeddings allow us utilize and harness the power of graph while allowing us to use a standard suite of machine learning and deep learning methods on downstream tasks: • Perozzi and Skiena showed in 2015 that we can use graph embeddings for downstream age prediction in social networks • Graph Convolutional Networks: Kipf & Welling introduced a structure for modeling arbitrarily structured graphs
  • 51. excella.com | @excellaco Other Examples of Reasoning Convolutional Graph Networks: For these models, the goal is then to learn a function of signals/features on a graph which takes as input: • A feature description xixi for every node ii; summarized in a N×DN×D feature matrix XX (NN: number of nodes, DD: number of input features) • A representative description of the graph structure in matrix form; typically in the form of an adjacency matrix AA (or some function thereof)
  • 52. excella.com | @excellaco Other Examples of Reasoning Convolutional Graph Networks: • Reduces the complexity of the training procedures • Powerful, but the learned structures cannot be transferred to other graphs
  • 53. excella.com | @excellaco Closing Why Utilize AI Methods for Graph Analysis?
  • 54. excella.com | @excellaco Closing • Graph Embeddings are a powerful means of utilizing your graph-based data for deep learning • Embedding structures can aid in creating ingest, in-graph, and downstream post-graph predictive tasks • Still a long way to go: How can we more closely integrate graph and deep learning?
  • 56. excella.com | @excellaco Patrick D. Smith patrick.smith@excella.com Brian Rodrigue brian.rodrigue@excella.com

Editor's Notes

  • #39: LBFGS solved unconstrained non-linear optimization problems
  • #48: Label Propagation uses labels randomly propagated through the network to identify communities large scale knowledge graphs
  • #52: VIDEO