SlideShare a Scribd company logo
The following is an excerpt of presentation
   delivered at Gluecon 2010 in Broomfield
                   Colorado.

The presentation is not a presentation on the
     InfiniteGraph/DB, but an overview of
  managing distributed graph data in a graph
                   database.

                 Copyright © InfiniteGraph
Scaling the [Social] Graph
      in the [Cloud]
         Darren Wood
  Lead Architect, InfiniteGraph
Graph Databases (Quickly)
• Optimized around data relationships
• Small focused API (typically not SQL)
• Typical Use Cases :
  – Social Graph Analysis
  – Catching Bad Guys (see Booth 16)
  – Fraud / Financial (more bad guys)
  – Data Intensive Science
  – Web / Advertising Analytics

                     Copyright © InfiniteGraph
Graph Databases (Almost Done)
        Vertex alice = myGraph.addVertex(new Person(“Alice”));
        Vertex bob = myGraph.addVertex(new Person(“Bob”));
        Vertex carlos = myGraph.addVertex(new Person(“Carlos”));
        Vertex charlie = myGraph.addVertex(new Person(“Charlie”));

        alice.addEdge(new Meeting(“Denver”, “5-27-10”), bob);
        bob.addEdge(new Call(timestamp), carlos);
        carlos.addEdge(new Payment(100000.00), charlie);
        bob.addEdge(new Call(timestamp), charlie);



Alice                       Bob                             Carlos             Charlie
             Meets                     Calls                            Pays



                                                                Calls

                                    Copyright © InfiniteGraph
What’s So Difficult Then ?
• Graphs grow quickly
  – Billions of phone calls / day in US
  – Emails, social media events, IP Traffic
  – Financial transactions
• Some analytics require navigation of large
  sections of the graph
• Each step (often) depends on the last
• Must distribute data and go parallel
                       Copyright © InfiniteGraph
First Some Good News…
• Graph algorithms naturally branch
• Can be automated or guided

                 Bob                            Carlos           Charlie
         Meets             Calls                         Pays

 Alice


         Calls
                 Chuck                          Dave              Eve
                            Lives                        Meets
                            With

                         Copyright © InfiniteGraph
Big Distributed Data
               (Traditional - Huge Generalization)

                             Application(s)




                              Distributed API

Processor      Processor                Processor          Processor




Partition 1    Partition 2              Partition 3       Partition ...n



                              Copyright © InfiniteGraph
Big Distributed Data
                                 (Graph)

                             Application(s)




                              Distributed API

Processor      Processor                Processor          Processor




Partition 1    Partition 2              Partition 3       Partition ...n



                              Copyright © InfiniteGraph
So What Are The Answers?
           Best Effort Partitioning

                Distributed API

  Processor                                   Processor




  Partition 1                                 Partition 2



                  Copyright © InfiniteGraph
So What Are The Answers?
                      The Look Ahead Example

                            Application

                           Distributed API

Processor                                               Processor


     A            C

                      D
              B
                                                         E

         Y
X

Partition 1                                             Partition 2

                            Copyright © InfiniteGraph
Which of These Work ?
• A carefully orchestrated combination of
  various options 
• Can be tuned (degree of look ahead)
• Healing graph can be expensive (write cost)
• This can also be tuned/configured (external
  edge thresholds)



                    Copyright © InfiniteGraph
Thankyou !
darren.wood@infinitegraph.com

   twitter.com/infinitegraph




         Copyright © InfiniteGraph

More Related Content

ODP
Max s clothes
PDF
Mercury instructions
PPT
Free Library of Philadelphia, Job And Company Research
PDF
Podcast, Blogs, Wikis, and more
PDF
Lake Parc Place Vestibule Addition
PPT
评分Scores
PPT
Pym Career Day
Max s clothes
Mercury instructions
Free Library of Philadelphia, Job And Company Research
Podcast, Blogs, Wikis, and more
Lake Parc Place Vestibule Addition
评分Scores
Pym Career Day

Viewers also liked (8)

PDF
Active studio select tool default
PDF
This is Where I Live
PDF
Picnik instructions
PPTX
The vineyards of bergerac france
PDF
Blue Print Ppt 3
PPT
Regulatory supports for foreign investments of renewable energy112 r
PDF
Waste analyses and characterization study (wacs for wte project) greenergy so...
PDF
March 5 solar energy investment opportunities in the philippines ruth p bri...
Active studio select tool default
This is Where I Live
Picnik instructions
The vineyards of bergerac france
Blue Print Ppt 3
Regulatory supports for foreign investments of renewable energy112 r
Waste analyses and characterization study (wacs for wte project) greenergy so...
March 5 solar energy investment opportunities in the philippines ruth p bri...
Ad

Similar to Gluecon InfiniteGraph/DB (20)

PDF
Gluecon InfiniteGraph Presentation: Scaling the Social Graph in the Cloud
PPT
An overview of InfiniteGraph, the distributed graph database
PPT
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
PPTX
Drive Away Fraudsters With Driverless AI - Venkatesh Ramanathan, Senior Data ...
PDF
Introduction to InfiniteGraph, the Distributed and Scalable Graph Database
PPTX
Jive World 12 ~ Apps 201
PPTX
Graph representation learning to prevent payment collusion fraud
PPT
InfiniteGraph Presentation from Oct 21, 2010 DBTA Webcast
PDF
Cois240 lesson01
PDF
Towards FutureOps: Stable, Repeatable environments from Dev to Prod
PDF
1st UIM-GDB - Connections to the Real World
PDF
Copyright, Open Source and APIs (towards an Interface Commons)
PPT
Cloud computing and Hadoop introduction
PDF
Prophet - Beijing Perl Workshop
PPTX
Deep Learning Fundamentals
PDF
Know your dependencies
PPTX
(Isc)² secure johannesburg
PDF
Machine Learning Projects @ commercetools
PDF
Introduction to mobile programming with Androids.
Gluecon InfiniteGraph Presentation: Scaling the Social Graph in the Cloud
An overview of InfiniteGraph, the distributed graph database
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
Drive Away Fraudsters With Driverless AI - Venkatesh Ramanathan, Senior Data ...
Introduction to InfiniteGraph, the Distributed and Scalable Graph Database
Jive World 12 ~ Apps 201
Graph representation learning to prevent payment collusion fraud
InfiniteGraph Presentation from Oct 21, 2010 DBTA Webcast
Cois240 lesson01
Towards FutureOps: Stable, Repeatable environments from Dev to Prod
1st UIM-GDB - Connections to the Real World
Copyright, Open Source and APIs (towards an Interface Commons)
Cloud computing and Hadoop introduction
Prophet - Beijing Perl Workshop
Deep Learning Fundamentals
Know your dependencies
(Isc)² secure johannesburg
Machine Learning Projects @ commercetools
Introduction to mobile programming with Androids.
Ad

Recently uploaded (20)

PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Big Data Technologies - Introduction.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
cuic standard and advanced reporting.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Encapsulation theory and applications.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Modernizing your data center with Dell and AMD
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Cloud computing and distributed systems.
PPTX
A Presentation on Artificial Intelligence
DOCX
The AUB Centre for AI in Media Proposal.docx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Network Security Unit 5.pdf for BCA BBA.
Advanced methodologies resolving dimensionality complications for autism neur...
Spectral efficient network and resource selection model in 5G networks
Big Data Technologies - Introduction.pptx
Unlocking AI with Model Context Protocol (MCP)
cuic standard and advanced reporting.pdf
Encapsulation_ Review paper, used for researhc scholars
Digital-Transformation-Roadmap-for-Companies.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Review of recent advances in non-invasive hemoglobin estimation
Encapsulation theory and applications.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Modernizing your data center with Dell and AMD
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Cloud computing and distributed systems.
A Presentation on Artificial Intelligence
The AUB Centre for AI in Media Proposal.docx

Gluecon InfiniteGraph/DB

  • 1. The following is an excerpt of presentation delivered at Gluecon 2010 in Broomfield Colorado. The presentation is not a presentation on the InfiniteGraph/DB, but an overview of managing distributed graph data in a graph database. Copyright © InfiniteGraph
  • 2. Scaling the [Social] Graph in the [Cloud] Darren Wood Lead Architect, InfiniteGraph
  • 3. Graph Databases (Quickly) • Optimized around data relationships • Small focused API (typically not SQL) • Typical Use Cases : – Social Graph Analysis – Catching Bad Guys (see Booth 16) – Fraud / Financial (more bad guys) – Data Intensive Science – Web / Advertising Analytics Copyright © InfiniteGraph
  • 4. Graph Databases (Almost Done) Vertex alice = myGraph.addVertex(new Person(“Alice”)); Vertex bob = myGraph.addVertex(new Person(“Bob”)); Vertex carlos = myGraph.addVertex(new Person(“Carlos”)); Vertex charlie = myGraph.addVertex(new Person(“Charlie”)); alice.addEdge(new Meeting(“Denver”, “5-27-10”), bob); bob.addEdge(new Call(timestamp), carlos); carlos.addEdge(new Payment(100000.00), charlie); bob.addEdge(new Call(timestamp), charlie); Alice Bob Carlos Charlie Meets Calls Pays Calls Copyright © InfiniteGraph
  • 5. What’s So Difficult Then ? • Graphs grow quickly – Billions of phone calls / day in US – Emails, social media events, IP Traffic – Financial transactions • Some analytics require navigation of large sections of the graph • Each step (often) depends on the last • Must distribute data and go parallel Copyright © InfiniteGraph
  • 6. First Some Good News… • Graph algorithms naturally branch • Can be automated or guided Bob Carlos Charlie Meets Calls Pays Alice Calls Chuck Dave Eve Lives Meets With Copyright © InfiniteGraph
  • 7. Big Distributed Data (Traditional - Huge Generalization) Application(s) Distributed API Processor Processor Processor Processor Partition 1 Partition 2 Partition 3 Partition ...n Copyright © InfiniteGraph
  • 8. Big Distributed Data (Graph) Application(s) Distributed API Processor Processor Processor Processor Partition 1 Partition 2 Partition 3 Partition ...n Copyright © InfiniteGraph
  • 9. So What Are The Answers? Best Effort Partitioning Distributed API Processor Processor Partition 1 Partition 2 Copyright © InfiniteGraph
  • 10. So What Are The Answers? The Look Ahead Example Application Distributed API Processor Processor A C D B E Y X Partition 1 Partition 2 Copyright © InfiniteGraph
  • 11. Which of These Work ? • A carefully orchestrated combination of various options  • Can be tuned (degree of look ahead) • Healing graph can be expensive (write cost) • This can also be tuned/configured (external edge thresholds) Copyright © InfiniteGraph
  • 12. Thankyou ! darren.wood@infinitegraph.com twitter.com/infinitegraph Copyright © InfiniteGraph