SlideShare a Scribd company logo
Graph Databases
and Neo4j
Data is getting bigger:
“Every 2 days we create as much
information as we did up to 2003”
– Eric Schmidt, Google
NOSQL
Key Value
Stores
Most Based on Dynamo: Amazon Highly Available
Key-Value Store
Data Model:
Global key-value mapping
Big scalable Hash Map
Highly fault tolerant (typically)
Examples:
Redis, Riak, Voldemort
Pros & Cons
Pros:
Simple data model
Scalable
Cons:
Create your own “foreign keys”
Poor for complex data
Column Family
Most Based on Big Table: Google’s Distributed
Storage System for Structured Data
Data Model:
A big table, with column families
Map Reduce for querying/processing
Examples:
HBase, HyperTable, Cassandra
Pros & Cons
Pros:
Supports Simi-Structured Data
Naturally Indexed (columns)
Scalable
Cons:
Poor for interconnected data
Document
Databases
Data Model:
A collection of documents
A document is a key value collection
Index-centric, lots of map-reduce
Examples:
CouchDB, MongoDB
Pros & Cons
Pros:
Simple, powerful data model
Scalable
Cons:
Poor for interconnected data
Query model limited to keys and indexes
Map reduce for larger queries
Graph
Databases
Data Model:
Nodes and Relationships
Examples:
Neo4j, OrientDB, InfiniteGraph,
AllegroGraph
Pros & Cons
Pros:
Powerful data model, as general as RDBMS
Connected data locally indexed
Easy to query
Cons:
Requires rewiring your brain
Complexity
Big Table
Clones
Size
Key-Value
Store
Document
Databases
Graph
Databases
90% of
Use Cases
Relational
Databases
A Graph Database uses graph structure with nodes, edges
and properties to represent and store data.
By definition, a graph database is any storage system that
provides index-free adjacency. This means that every
element contains a direct pointer to its adjacent element
and no index lookups are necessary.
Graph databases focus on the interconnection between
Entities.
Graph Database definition
Compared with RDBMS
Graph databases are often faster for associative data sets
Map more directly to the structure of object-oriented
applications
Scale more naturally to large data sets as they do not typically
require expensive join operations.
As they depend less on a rigid schema, they are more suitable
to manage ad-hoc and changing data with evolving schemas.
Finding Extended Friends
Nodes
Nodes represent Entities such as people, businesses, accounts,
or any other item you might want to keep track of.
Properties
Properties are pertinent information that relate to nodes.
Edges
Edges are the lines that connect nodes to nodes or nodes to
properties and they represent the Relationship between the
two.
Most of the important information is really stored in the
edges.
Meaningful patterns emerge when one examines the
connections and interconnections of nodes, properties and
edges.
Graph Database and Neo4j
Graph Database and Neo4j
What is Neo4j?
• A Graph Database
• Property Graph
• Full ACID (atomicity, consistency, isolation, durability)
• High Availability (with Enterprise Edition)
• 32 Billion Nodes, 32 Billion Relationships,
64 Billion Properties
• Embedded Server
• REST API
Key Features
• Runs on major platforms : Mac | Windows | Unix
• Extensive documentation
• Active community
• Open Source
CYPHER
Cypher is a declarative graph query language that allows for
expressive and efficient querying and updating of the graph
store without having to write traversal through the graph
structure in code.
CYPHER
START: Starting points in the graph, obtained via index lookups or by element IDs.
MATCH: The graph pattern to match, bound to the starting points in START.
WHERE: Filtering criteria.
RETURN: What to return.
CREATE: Creates nodes and relationships.
DELETE: Removes nodes, relationships and properties.
SET: Set values to properties.
FOREACH: Performs updating actions once per element in a list.
WITH: Divides a query into multiple, distinct parts.

More Related Content

PDF
How to visualize Cosmos DB graph data
PPTX
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
PDF
DBPedia-past-present-future
PDF
Getting started with Cosmos DB + Linkurious Enterprise
PPTX
Introduction to Big Data
PPTX
The future of Big Data tooling
PPTX
Scalable Web Data Management using RDF
PDF
Graph Database
How to visualize Cosmos DB graph data
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
DBPedia-past-present-future
Getting started with Cosmos DB + Linkurious Enterprise
Introduction to Big Data
The future of Big Data tooling
Scalable Web Data Management using RDF
Graph Database

What's hot (20)

PPTX
Improvement of no sql technology for relational databases v2
PPT
Graph database
PPTX
The Internet as a Single Database
PPTX
Lju Lazarevic
PDF
Visualize the Knowledge Graph and Unleash Your Data
PDF
Introduction to ELK
PPTX
MongoDB and Hadoop Handling for Big Data
PPT
Big Data: Improving capacity utilization of transport companies
PPTX
Connected data meetup group - introduction & scope
PDF
Linked Data Experiences at Springer Nature
PDF
LogStash: Concept Run-Through
PPTX
Solution architecture
PDF
Discovering Related Data Sources in Data Portals
PPTX
NO SQL Databases, Big Data and the cloud
PPTX
First Step in NoSql
PPT
Analytics and Access to the UK web archive
PPTX
Accelerating Delivery of Data Products - The EBSCO Way
PPTX
Exploring MongoDB & Elasticsearch: Better Together
Improvement of no sql technology for relational databases v2
Graph database
The Internet as a Single Database
Lju Lazarevic
Visualize the Knowledge Graph and Unleash Your Data
Introduction to ELK
MongoDB and Hadoop Handling for Big Data
Big Data: Improving capacity utilization of transport companies
Connected data meetup group - introduction & scope
Linked Data Experiences at Springer Nature
LogStash: Concept Run-Through
Solution architecture
Discovering Related Data Sources in Data Portals
NO SQL Databases, Big Data and the cloud
First Step in NoSql
Analytics and Access to the UK web archive
Accelerating Delivery of Data Products - The EBSCO Way
Exploring MongoDB & Elasticsearch: Better Together
Ad

Viewers also liked (7)

PDF
GraphConnect 2014 SF: How eBay and Shutl Deliver Even Faster Using Neo4j
PDF
GraphDay Stockholm - Graphs in the Real World: Top Use Cases for Graph Databases
PPTX
Graphs in the Real World
PDF
Challenges in the Design of a Graph Database Benchmark
PDF
Introducing Neo4j
PPTX
Introduction to Graph Databases
PDF
Graph database Use Cases
GraphConnect 2014 SF: How eBay and Shutl Deliver Even Faster Using Neo4j
GraphDay Stockholm - Graphs in the Real World: Top Use Cases for Graph Databases
Graphs in the Real World
Challenges in the Design of a Graph Database Benchmark
Introducing Neo4j
Introduction to Graph Databases
Graph database Use Cases
Ad

Similar to Graph Database and Neo4j (20)

PDF
Graph Databases 101
PPTX
Graph Databases
PPTX
Graph Databases
PDF
Introduction to Graph databases and Neo4j (by Stefan Armbruster)
PDF
Getting started with Graph Databases & Neo4j
PPTX
Neo4j graph database
PPTX
Graph database
PDF
Neo4j in Depth
PDF
Gerry McNicol Graph Databases
PPTX
Neo4j Training Introduction
PDF
Polyglot Persistence with MongoDB and Neo4j
PDF
Neo4j (Part 1)
PPTX
Intro to Neo4j with Ruby
PDF
Graph Databases and Graph Data Science in Neo4j
PPTX
PDF
Intro to Neo4j and Graph Databases
PPTX
NoSQL Module -5.pptx nosql module 4 notes
PPTX
GraphTalks Rome - Selecting the right Technology
KEY
Spring Data Neo4j Intro SpringOne 2011
PDF
Brett Ragozzine - Graph Databases and Neo4j
Graph Databases 101
Graph Databases
Graph Databases
Introduction to Graph databases and Neo4j (by Stefan Armbruster)
Getting started with Graph Databases & Neo4j
Neo4j graph database
Graph database
Neo4j in Depth
Gerry McNicol Graph Databases
Neo4j Training Introduction
Polyglot Persistence with MongoDB and Neo4j
Neo4j (Part 1)
Intro to Neo4j with Ruby
Graph Databases and Graph Data Science in Neo4j
Intro to Neo4j and Graph Databases
NoSQL Module -5.pptx nosql module 4 notes
GraphTalks Rome - Selecting the right Technology
Spring Data Neo4j Intro SpringOne 2011
Brett Ragozzine - Graph Databases and Neo4j

Recently uploaded (20)

PDF
Fluorescence-microscope_Botany_detailed content
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Lecture1 pattern recognition............
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPT
Quality review (1)_presentation of this 21
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Foundation of Data Science unit number two notes
PDF
.pdf is not working space design for the following data for the following dat...
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
Fluorescence-microscope_Botany_detailed content
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Lecture1 pattern recognition............
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Quality review (1)_presentation of this 21
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Reliability_Chapter_ presentation 1221.5784
1_Introduction to advance data techniques.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Foundation of Data Science unit number two notes
.pdf is not working space design for the following data for the following dat...
Miokarditis (Inflamasi pada Otot Jantung)
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Business Acumen Training GuidePresentation.pptx

Graph Database and Neo4j

  • 2. Data is getting bigger: “Every 2 days we create as much information as we did up to 2003” – Eric Schmidt, Google
  • 4. Key Value Stores Most Based on Dynamo: Amazon Highly Available Key-Value Store Data Model: Global key-value mapping Big scalable Hash Map Highly fault tolerant (typically) Examples: Redis, Riak, Voldemort
  • 5. Pros & Cons Pros: Simple data model Scalable Cons: Create your own “foreign keys” Poor for complex data
  • 6. Column Family Most Based on Big Table: Google’s Distributed Storage System for Structured Data Data Model: A big table, with column families Map Reduce for querying/processing Examples: HBase, HyperTable, Cassandra
  • 7. Pros & Cons Pros: Supports Simi-Structured Data Naturally Indexed (columns) Scalable Cons: Poor for interconnected data
  • 8. Document Databases Data Model: A collection of documents A document is a key value collection Index-centric, lots of map-reduce Examples: CouchDB, MongoDB
  • 9. Pros & Cons Pros: Simple, powerful data model Scalable Cons: Poor for interconnected data Query model limited to keys and indexes Map reduce for larger queries
  • 10. Graph Databases Data Model: Nodes and Relationships Examples: Neo4j, OrientDB, InfiniteGraph, AllegroGraph
  • 11. Pros & Cons Pros: Powerful data model, as general as RDBMS Connected data locally indexed Easy to query Cons: Requires rewiring your brain
  • 13. A Graph Database uses graph structure with nodes, edges and properties to represent and store data. By definition, a graph database is any storage system that provides index-free adjacency. This means that every element contains a direct pointer to its adjacent element and no index lookups are necessary. Graph databases focus on the interconnection between Entities. Graph Database definition
  • 14. Compared with RDBMS Graph databases are often faster for associative data sets Map more directly to the structure of object-oriented applications Scale more naturally to large data sets as they do not typically require expensive join operations. As they depend less on a rigid schema, they are more suitable to manage ad-hoc and changing data with evolving schemas.
  • 16. Nodes Nodes represent Entities such as people, businesses, accounts, or any other item you might want to keep track of.
  • 17. Properties Properties are pertinent information that relate to nodes.
  • 18. Edges Edges are the lines that connect nodes to nodes or nodes to properties and they represent the Relationship between the two. Most of the important information is really stored in the edges. Meaningful patterns emerge when one examines the connections and interconnections of nodes, properties and edges.
  • 21. What is Neo4j? • A Graph Database • Property Graph • Full ACID (atomicity, consistency, isolation, durability) • High Availability (with Enterprise Edition) • 32 Billion Nodes, 32 Billion Relationships, 64 Billion Properties • Embedded Server • REST API
  • 22. Key Features • Runs on major platforms : Mac | Windows | Unix • Extensive documentation • Active community • Open Source
  • 23. CYPHER Cypher is a declarative graph query language that allows for expressive and efficient querying and updating of the graph store without having to write traversal through the graph structure in code.
  • 24. CYPHER START: Starting points in the graph, obtained via index lookups or by element IDs. MATCH: The graph pattern to match, bound to the starting points in START. WHERE: Filtering criteria. RETURN: What to return. CREATE: Creates nodes and relationships. DELETE: Removes nodes, relationships and properties. SET: Set values to properties. FOREACH: Performs updating actions once per element in a list. WITH: Divides a query into multiple, distinct parts.

Editor's Notes

  • #5: Dynamo is a set of techniques Fault tolerant : it enables continue operating after of failure some of its coponents
  • #8: Interconnected data: dAde hAye be ham peyvaste