SlideShare a Scribd company logo
JanusGraph
Project status and what's next
August 22, 2017
©2017 IBM Corporation 24 August 20172
Graph Database
§ JanusGraph is a scalable graph database optimized for
storing and querying graphs containing hundreds of
billions of vertices and edges distributed across a multi-
machine cluster.
§ JanusGraph is a transactional database that can support
thousands of concurrent users executing complex graph
traversals in real time.
©2017 IBM Corporation 24 August 20173
Project History
§ February 2012
– Titan graph database started
§ February 2015
– Aurelius acquired by DataStax
§ September 2015
– Titan 1.0 released
§ January 2017
– JanusGraph established at the Linux Foundation
– With partners from Expero, Google, GRAKN.AI, Hortonworks, and IBM
©2017 IBM Corporation 24 August 20174
Releases
§ JanusGraph 0.1.0 – April 20, 2017
– Version uplifts
§ TinkerPop 3.2.3 + Spark 1.6.1
§ HBase 1.2.4
§ BerkeleyJE 7.3.7 (now Apache licensed!)
§ Google Cloud BigTable 0.9.5.1 (new!)
– Do not use for Titan 1.0 migration
§ JanusGraph 0.1.1 – May 16, 2017
– Critical Titan migration bug fixes
– Titan users should migrate directly from Titan to JanusGraph 0.1.1
©2017 IBM Corporation 24 August 20175
Next Release
§ JanusGraph 0.2.0 – soon August/September 2017
– Version uplifts
§ TinkerPop 3.3.0 + Spark 2.2.0 (Scala 2.11)
§ Cassandra 2.1.18 + 3.0.14
§ Elasticsearch 5.4.2
§ Lucene 6.6.0, Solr 6.6.0
– CQL Storage Adapter
§ Improved performance with CQL native transport (Paul Kendall, Samant Maharaj)
§ Thrift will be removed in Cassandra 4.0
§ OLAP support on Cassandra 3.0 (Kedar Mhaswade)
– Indexing improvements
§ Replaced Elasticsearch node client with REST client (sjudeng)
§ Support all geometries features in Elasticsearch and Solr (David Clement)
©2017 IBM Corporation 24 August 20176
Growing Community
§ New Committers
– sjudeng
– Robert Dale
– Paul Kendall
– Samant Maharaj
§ Corporate CLA Signers
– Amazon
– Netflix
– Orchestral Developments
– Seeq
– Uber
©2017 IBM Corporation 24 August 20177
Users in Production
©2017 IBM Corporation 24 August 20178
Opportunities
§ On-boarding experiences
– Migration from Titan
– Getting started via documentation
§ Product usage documentation
– What advantages does property graphs provide in your domain
– How did you choose JanusGraph
§ Tooling
– Schema design
– Indexing performance
– Bulk loading
– Visualization
©2017 IBM Corporation 24 August 20179
Programming Language Popularity
TIOBE Index
PYPL Popularity RedMonk Rankings
©2017 IBM Corporation 24 August 201710
Gremlin-Python (GLV)
§ Directly incorporated and tested by Apache TinkerPop
§ Implements core pieces found in Gremlin-Java
– Graph Traversal
– Graph Traversal Source
– __
§ Install via pip package manager
§ Gremlin-Python sugar
– g.V().both()[1:3].toList()
– g.V().both().name.toList()
§ Bindings
– g.V(('id',4)).out('created').name.toList()
http://tinkerpop.apache.org/docs/current/reference/#gremlin-python
©2017 IBM Corporation 24 August 201711
ipython-gremlin: Gremlin Magic
ipython-gremlin: David Michael Brown @davebshow
Line Magic CellMagic
©2017 IBM Corporation 24 August 201712
ipython-gremlin: Python integrations
ipython-gremlin: David Michael Brown @davebshow
pandas, matplotlib NetworkX
©2017 IBM Corporation 24 August 201713
©2017 IBM Corporation 24 August 201714
©2017 IBM Corporation 24 August 201715
Flying into the cloud
§ Graphs are most useful when they are large
§ Operating distributed clusters
– Security
– Availability
– Scalability
§ Sharing is caring
©2017 IBM Corporation 24 August 201716
Python Application Architecture (Local)
Graph DatabaseClients
Web browser
Data notebook
Gremlin Bytecode
Gremlin
Bytecode
HTTP
HTTP
ipython-gremlin
Gremlin-Python
Application Server
©2017 IBM Corporation 24 August 201717
Python Application Architecture (IBM Cloud)
Graph Database
Application Server
Clients
Web browser
Data notebook
Gremlin Bytecode
Gremlin
Bytecode
HTTP
HTTP
ipython-gremlin
Gremlin-Python
©2017 IBM Corporation 24 August 201718
https://www.compose.com/janusgraph
JanusGraph, Jupyter Meetup NYC

More Related Content

PDF
JanusGraph: Looking Backward, Reaching Forward
PDF
Graph Computing with JanusGraph
PPTX
Powers of Ten Redux
PDF
Start Flying with Python & Apache TinkerPop
PDF
Graph Computing with JanusGraph
PDF
Community-Driven Graphs with JanusGraph
PDF
Graph Processing with Apache TinkerPop and Gremlin
PDF
Exploring Graph Use Cases with JanusGraph
JanusGraph: Looking Backward, Reaching Forward
Graph Computing with JanusGraph
Powers of Ten Redux
Start Flying with Python & Apache TinkerPop
Graph Computing with JanusGraph
Community-Driven Graphs with JanusGraph
Graph Processing with Apache TinkerPop and Gremlin
Exploring Graph Use Cases with JanusGraph

What's hot (19)

PDF
Graph Computing with Apache TinkerPop
PDF
Enabling Multimodel Graphs with Apache TinkerPop
PPTX
Janus graph lookingbackwardreachingforward
PDF
IBM Open by Design: Graph Technology
PDF
Airline Reservations and Routing: A Graph Use Case
PPTX
Community-Driven Graphs with JanusGraph
PPTX
Large Scale Graph Analytics with JanusGraph
PDF
Graph Processing with Apache TinkerPop
PDF
Graph Processing with Titan and Scylla
PDF
On-boarding with JanusGraph Performance
PDF
Presto Summit 2018 - 08 - FINRA
PDF
Presto talk @ Global AI conference 2018 Boston
PPTX
The Fermilab HEPCloud Facility
PDF
Big problems Big Data, simple solutions
PDF
Presto Summit 2018 - 10 - Qubole
PDF
HBaseCon 2015: Warcbase - Scaling 'Out' and 'Down' HBase for Web Archiving
PPTX
Graph Databases at Netflix
PPTX
Quix presto ide, presto summit IL
PPTX
Big Data Pipeline and Analytics Platform
Graph Computing with Apache TinkerPop
Enabling Multimodel Graphs with Apache TinkerPop
Janus graph lookingbackwardreachingforward
IBM Open by Design: Graph Technology
Airline Reservations and Routing: A Graph Use Case
Community-Driven Graphs with JanusGraph
Large Scale Graph Analytics with JanusGraph
Graph Processing with Apache TinkerPop
Graph Processing with Titan and Scylla
On-boarding with JanusGraph Performance
Presto Summit 2018 - 08 - FINRA
Presto talk @ Global AI conference 2018 Boston
The Fermilab HEPCloud Facility
Big problems Big Data, simple solutions
Presto Summit 2018 - 10 - Qubole
HBaseCon 2015: Warcbase - Scaling 'Out' and 'Down' HBase for Web Archiving
Graph Databases at Netflix
Quix presto ide, presto summit IL
Big Data Pipeline and Analytics Platform
Ad

Similar to JanusGraph, Jupyter Meetup NYC (20)

PPTX
Stream Processing and Real-Time Data Pipelines
PPTX
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
PPTX
Rancher presentation august 2017
PPTX
Storage for containers and cloud-native deployments - Rancher Online Meetup -...
PDF
Infochimps: Cloud for Big Data
PDF
Ibm integrated analytics system
PDF
Containers and Kubernetes
PDF
Stop compromising your data in the cloud with Veritas CloudPoint
PDF
Graph Day 2017 Spring Boot
PDF
Dremio introduction
PDF
Spark and MapR Streams: A Motivating Example
PDF
Heterogeneous Data Mining with Spark
PPTX
Geo-Distributed Big Data and Analytics
PPTX
Instantaneous Replication of Build Artifacts with NetApp
PDF
DEVELOPING SHAREPOINT FRAMEWORK SOLUTIONS FOR THE ENTERPRISE
PPTX
Philly Code Camp 2013 Mark Kromer Big Data with SQL Server
PPTX
Progress for big data in Kubernetes
PDF
Elasticsearch + Cascading for Scalable Log Processing
PDF
"Implementing the TensorFlow Deep Learning Framework on Qualcomm’s Low-power ...
POTX
EDB Postgres in DBaaS & Container Platforms
Stream Processing and Real-Time Data Pipelines
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Rancher presentation august 2017
Storage for containers and cloud-native deployments - Rancher Online Meetup -...
Infochimps: Cloud for Big Data
Ibm integrated analytics system
Containers and Kubernetes
Stop compromising your data in the cloud with Veritas CloudPoint
Graph Day 2017 Spring Boot
Dremio introduction
Spark and MapR Streams: A Motivating Example
Heterogeneous Data Mining with Spark
Geo-Distributed Big Data and Analytics
Instantaneous Replication of Build Artifacts with NetApp
DEVELOPING SHAREPOINT FRAMEWORK SOLUTIONS FOR THE ENTERPRISE
Philly Code Camp 2013 Mark Kromer Big Data with SQL Server
Progress for big data in Kubernetes
Elasticsearch + Cascading for Scalable Log Processing
"Implementing the TensorFlow Deep Learning Framework on Qualcomm’s Low-power ...
EDB Postgres in DBaaS & Container Platforms
Ad

Recently uploaded (20)

PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Autodesk AutoCAD Crack Free Download 2025
PDF
Download FL Studio Crack Latest version 2025 ?
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
PPTX
assetexplorer- product-overview - presentation
PDF
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
PDF
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
Patient Appointment Booking in Odoo with online payment
PPTX
Computer Software and OS of computer science of grade 11.pptx
PDF
iTop VPN Free 5.6.0.5262 Crack latest version 2025
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
history of c programming in notes for students .pptx
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
Oracle Fusion HCM Cloud Demo for Beginners
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
CHAPTER 2 - PM Management and IT Context
Design an Analysis of Algorithms II-SECS-1021-03
Autodesk AutoCAD Crack Free Download 2025
Download FL Studio Crack Latest version 2025 ?
Wondershare Filmora 15 Crack With Activation Key [2025
Navsoft: AI-Powered Business Solutions & Custom Software Development
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
assetexplorer- product-overview - presentation
Tally Prime Crack Download New Version 5.1 [2025] (License Key Free
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Patient Appointment Booking in Odoo with online payment
Computer Software and OS of computer science of grade 11.pptx
iTop VPN Free 5.6.0.5262 Crack latest version 2025
Reimagine Home Health with the Power of Agentic AI​
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
history of c programming in notes for students .pptx
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Oracle Fusion HCM Cloud Demo for Beginners
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
CHAPTER 2 - PM Management and IT Context

JanusGraph, Jupyter Meetup NYC

  • 1. JanusGraph Project status and what's next August 22, 2017
  • 2. ©2017 IBM Corporation 24 August 20172 Graph Database § JanusGraph is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi- machine cluster. § JanusGraph is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time.
  • 3. ©2017 IBM Corporation 24 August 20173 Project History § February 2012 – Titan graph database started § February 2015 – Aurelius acquired by DataStax § September 2015 – Titan 1.0 released § January 2017 – JanusGraph established at the Linux Foundation – With partners from Expero, Google, GRAKN.AI, Hortonworks, and IBM
  • 4. ©2017 IBM Corporation 24 August 20174 Releases § JanusGraph 0.1.0 – April 20, 2017 – Version uplifts § TinkerPop 3.2.3 + Spark 1.6.1 § HBase 1.2.4 § BerkeleyJE 7.3.7 (now Apache licensed!) § Google Cloud BigTable 0.9.5.1 (new!) – Do not use for Titan 1.0 migration § JanusGraph 0.1.1 – May 16, 2017 – Critical Titan migration bug fixes – Titan users should migrate directly from Titan to JanusGraph 0.1.1
  • 5. ©2017 IBM Corporation 24 August 20175 Next Release § JanusGraph 0.2.0 – soon August/September 2017 – Version uplifts § TinkerPop 3.3.0 + Spark 2.2.0 (Scala 2.11) § Cassandra 2.1.18 + 3.0.14 § Elasticsearch 5.4.2 § Lucene 6.6.0, Solr 6.6.0 – CQL Storage Adapter § Improved performance with CQL native transport (Paul Kendall, Samant Maharaj) § Thrift will be removed in Cassandra 4.0 § OLAP support on Cassandra 3.0 (Kedar Mhaswade) – Indexing improvements § Replaced Elasticsearch node client with REST client (sjudeng) § Support all geometries features in Elasticsearch and Solr (David Clement)
  • 6. ©2017 IBM Corporation 24 August 20176 Growing Community § New Committers – sjudeng – Robert Dale – Paul Kendall – Samant Maharaj § Corporate CLA Signers – Amazon – Netflix – Orchestral Developments – Seeq – Uber
  • 7. ©2017 IBM Corporation 24 August 20177 Users in Production
  • 8. ©2017 IBM Corporation 24 August 20178 Opportunities § On-boarding experiences – Migration from Titan – Getting started via documentation § Product usage documentation – What advantages does property graphs provide in your domain – How did you choose JanusGraph § Tooling – Schema design – Indexing performance – Bulk loading – Visualization
  • 9. ©2017 IBM Corporation 24 August 20179 Programming Language Popularity TIOBE Index PYPL Popularity RedMonk Rankings
  • 10. ©2017 IBM Corporation 24 August 201710 Gremlin-Python (GLV) § Directly incorporated and tested by Apache TinkerPop § Implements core pieces found in Gremlin-Java – Graph Traversal – Graph Traversal Source – __ § Install via pip package manager § Gremlin-Python sugar – g.V().both()[1:3].toList() – g.V().both().name.toList() § Bindings – g.V(('id',4)).out('created').name.toList() http://tinkerpop.apache.org/docs/current/reference/#gremlin-python
  • 11. ©2017 IBM Corporation 24 August 201711 ipython-gremlin: Gremlin Magic ipython-gremlin: David Michael Brown @davebshow Line Magic CellMagic
  • 12. ©2017 IBM Corporation 24 August 201712 ipython-gremlin: Python integrations ipython-gremlin: David Michael Brown @davebshow pandas, matplotlib NetworkX
  • 13. ©2017 IBM Corporation 24 August 201713
  • 14. ©2017 IBM Corporation 24 August 201714
  • 15. ©2017 IBM Corporation 24 August 201715 Flying into the cloud § Graphs are most useful when they are large § Operating distributed clusters – Security – Availability – Scalability § Sharing is caring
  • 16. ©2017 IBM Corporation 24 August 201716 Python Application Architecture (Local) Graph DatabaseClients Web browser Data notebook Gremlin Bytecode Gremlin Bytecode HTTP HTTP ipython-gremlin Gremlin-Python Application Server
  • 17. ©2017 IBM Corporation 24 August 201717 Python Application Architecture (IBM Cloud) Graph Database Application Server Clients Web browser Data notebook Gremlin Bytecode Gremlin Bytecode HTTP HTTP ipython-gremlin Gremlin-Python
  • 18. ©2017 IBM Corporation 24 August 201718 https://www.compose.com/janusgraph