SlideShare a Scribd company logo
Graph ( Theory and Databases )


       Pere Urbón Bayes
         Senior Software Engineer
               Independent

            purbon@purbon.com
                                    purbon.com

                                    in/purbon
         December of 2010
                                    @purbon
Graph (Theory and Databases)

●   Graph Theory                           ●    Graph Databases
       –   Definitions                            –   Definitions
       –   Applications                           –   Facts
       –   Analytics
                                                  –   Performance
                                                  –   Vendors




                          Graph ( Theory and Databases )            2
Graph
                         Definitions

●   Graph G(V,E) where V = {v1,v2,...,vN) and E =
    {E1,E2,...,EN)
       –   Directed / Undirected
       –   Mixed
       –   Multigraph
       –   Weighted
       –   ....


                        Graph ( Theory and Databases )   3
Graph
                       Definitions

●   Directed graphs
●   Vertex
●   Edges
●   From V(N) to V(M)




                      Graph ( Theory and Databases )   4
Graph
                   Definitions

    Multigraph                          Labelling
●   More than one edge             ●    The process of
    between two nodes.                  assigning a label to a
●   Loops, edges                        vertex and edges.
    between the same
    node.



                  Graph ( Theory and Databases )                 5
Graph Theory
                    Applications

●   Task planning
●   Scheduling
●   Process assignation
●   Routing
●   Logistics
●   League planning


                    Graph ( Theory and Databases )   6
Graph Theory
                      Applications

●   Pattern Recognition
●   Dependency analysis
●   Impact analysis
●   Network flow
    –   Traffic analysis and optimization
    –   Delivery optimization
●   Optimization of tasks

                      Graph ( Theory and Databases )   7
Graph Theory
                      analytics

●   Clustering (Communities)
●   Social connexions
●   Hubs
●   Graph Mining
●   Centrality measures




                   Graph ( Theory and Databases )   8
Graph Like
                         Applications

●   Recommendations
       –   Heuristics (PageRank)
       –   Local
               ●   Shortest Paths
               ●   Hammock Functions
               ●   Walks
               ●   Search algorithms
               ●   Shooting stars
               ●   K-nearest neighbours

                          Graph ( Theory and Databases )   9
Graph Like
                    Applications

●   Location based services
●   Hubs
●   Spatial databases
●   Logical (multi-)index construction




                     Graph ( Theory and Databases )   10
Web
                       Trending Topics

●   Semantic web
        –   RDF (OWL) Store
        –   RDF-Sail
        –   SPARQL
●   Linked data (Open Data)
●   Link analysis
●   Structure mining

                         Graph ( Theory and Databases )   11
Graph databases

“A graph database is a database that uses graph
 structures with nodes, edges, and properties to
         represent and store information.

  General graph databases that can store any
   graph are distinct from specialized graph
  databases such as triple stores and network
                  databases.”
                                                  Wikipedia


                 Graph ( Theory and Databases )          12
Graph databases
                          Property graph

●   Abstractions
          –   Nodes
          –   Relationships
          –   Properties on both.
    John smith liked http://www.example.com at 01/10/11




                             Graph ( Theory and Databases )   13
Graph databases
                                    Facts

Connectivity
                                                              Everything
                                                              connected


                                      RDF       Ontologies
                                                     Linked Data
                                    Tagging
                            Blogs                Folksonomies
                                Social Networks


               Text files



                        1990's             2010's            2020's        Decades

                            Graph ( Theory and Databases )                       14
Graph databases
                                  Facts

Size of




                     1990's               2010's            2020's   Decades

                           Graph ( Theory and Databases )                      15
          http://www.guardian.co.uk/business/2009/may/18/digital-content-expansion
Graph databases
                            Facts

Performance

          Lists


                                          Graph like structures
                                              Semantic web
                                            Semantic reasoning
                                               Linked data


                                    Performance slowdown



                                                                  Unstructured

                     Graph ( Theory and Databases )                         16
Graph databases
                              Performance
Kernel      DEX      Neo4j     Jena         HyperGraphDB
Scale 15
Load(s)     7,44     697       141          +24h
Scan (s)    0,0010   2,71      0,689
2-Hops(s)   0,0120   0,0260    0,443
BC (s)      14,8     8,24      138
Size (MB)   30       17        207

                              Kernel          DEX               Neo4j    Jena     HyperGraph
                              Scale 20                                            DB
                              Load(s)         317               32.094   4.560    +24h
                              Scan (s)        0,005             751      18,6
                              2-Hops(s)       0,033             0,0230   0,4580
                              BC (s)          617               7027     59512
                              Size (MB)       893               539      6656

                               Graph ( Theory and Databases )                            17
HPC Scalable Graph Analysis Benchmark IWGD 2010
Graph databases
                        Vendors

●   Neo4J: Open source database NoSQL graph.
●   Dex: The high performance graph database.
●   HyperGraphDB: An IA and semantic web graph
    database.
●   Infogrid: The Internet Graph database.
●   Sones: SaaS dot Net Graph database.
●   VertexDB: High performance database server.

                    Graph ( Theory and Databases )   18
Graph ( Theory and Databases )



              Thanks!
           purbon@purbon.com


         December of 2010



           Graph ( Theory and Databases )   19

More Related Content

PDF
Try NoSQL it doesn't hurts and is fun
PDF
Graph Databases introduction to rug-b
PDF
Graph Databases, The Web of Data Storage Engines
PPTX
Exchange and Consumption of Huge RDF Data
PDF
Binary RDF for Scalable Publishing, Exchanging and Consumption in the Web of ...
PPTX
Compact Representation of Large RDF Data Sets for Publishing and Exchange
PDF
Old Maps, Annotations, and Open Data Networks
PDF
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
Try NoSQL it doesn't hurts and is fun
Graph Databases introduction to rug-b
Graph Databases, The Web of Data Storage Engines
Exchange and Consumption of Huge RDF Data
Binary RDF for Scalable Publishing, Exchanging and Consumption in the Web of ...
Compact Representation of Large RDF Data Sets for Publishing and Exchange
Old Maps, Annotations, and Open Data Networks
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...

What's hot (13)

PDF
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
PPTX
Semantic Web and Related Work at W3C
PPT
Benchmarking graph databases on the problem of community detection
PDF
Efficient Tabling of Structured Data Using Indexing and Program Transformation
PDF
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
PDF
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
PDF
Data Modeling with Neo4j
KEY
Intro to Neo4j presentation
PDF
[2D3]TurboGraph- Ultrafast graph analystics engine for billion-scale graphs i...
PDF
Using python to analyze spatial data
PPT
BDAS Shark study report 03 v1.1
PPTX
Graph database & neo4j
PDF
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Semantic Web and Related Work at W3C
Benchmarking graph databases on the problem of community detection
Efficient Tabling of Structured Data Using Indexing and Program Transformation
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
Data Modeling with Neo4j
Intro to Neo4j presentation
[2D3]TurboGraph- Ultrafast graph analystics engine for billion-scale graphs i...
Using python to analyze spatial data
BDAS Shark study report 03 v1.1
Graph database & neo4j
Raster data in GeoServer and GeoTools: Achievements, issues and future develo...
Ad

Viewers also liked (20)

PPTX
Interesting applications of graph theory
PPTX
Graph theory and life
PDF
Graph theory in Practise
PPTX
Application of graph theory in drug design
PDF
CS6702 graph theory and applications notes pdf book
PDF
introduction to graph theory
PDF
the bike map - a look into a practical application of graph theory
PPT
Applications of graphs
PPTX
Introduction to graph class 8
PPTX
Graph theory Application
PDF
Algorithms for Graph Coloring Problem
PPTX
Graph theory short presentation
KEY
How to Leverage the Social Graph with Facebook Platform
PDF
Graph basic
PDF
Introduction to Facebook Graph API and OAuth 2
PPTX
Types of graphs
PDF
2 Graph Theory
PDF
Facebook Open Graph API and How To Use It
PPT
Facebook Open Graph API
PPT
GRAPH COLORING AND ITS APPLICATIONS
Interesting applications of graph theory
Graph theory and life
Graph theory in Practise
Application of graph theory in drug design
CS6702 graph theory and applications notes pdf book
introduction to graph theory
the bike map - a look into a practical application of graph theory
Applications of graphs
Introduction to graph class 8
Graph theory Application
Algorithms for Graph Coloring Problem
Graph theory short presentation
How to Leverage the Social Graph with Facebook Platform
Graph basic
Introduction to Facebook Graph API and OAuth 2
Types of graphs
2 Graph Theory
Facebook Open Graph API and How To Use It
Facebook Open Graph API
GRAPH COLORING AND ITS APPLICATIONS
Ad

Similar to Graph Theory and Databases (20)

PDF
Bcn On Rails May2010 On Graph Databases
PDF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
PPT
Graph database
PPTX
Data Integration at the Ontology Engineering Group
PPTX
Hadoop For Enterprises
PDF
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)
PDF
An introduction to apache drill presentation
PDF
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
PDF
Introduction to Property Graph Features (AskTOM Office Hours part 1)
PPTX
Drill njhug -19 feb2013
PPTX
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
PDF
Using postgre sql for 3d cms
PDF
STI Summit 2011 - Digital Worlds
PDF
Linked Open data: CNR
PPT
20130204 graph to-pacer-xml
PDF
Introduction to Neo4j
PPTX
Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...
PPTX
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
PPTX
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
PPTX
Data Processing over very Large Relational Databases
Bcn On Rails May2010 On Graph Databases
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Graph database
Data Integration at the Ontology Engineering Group
Hadoop For Enterprises
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)
An introduction to apache drill presentation
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
Introduction to Property Graph Features (AskTOM Office Hours part 1)
Drill njhug -19 feb2013
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
Using postgre sql for 3d cms
STI Summit 2011 - Digital Worlds
Linked Open data: CNR
20130204 graph to-pacer-xml
Introduction to Neo4j
Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Data Processing over very Large Relational Databases

Recently uploaded (20)

PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Empathic Computing: Creating Shared Understanding
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Big Data Technologies - Introduction.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Electronic commerce courselecture one. Pdf
PPT
Teaching material agriculture food technology
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Reach Out and Touch Someone: Haptics and Empathic Computing
Empathic Computing: Creating Shared Understanding
20250228 LYD VKU AI Blended-Learning.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Encapsulation_ Review paper, used for researhc scholars
Machine learning based COVID-19 study performance prediction
Big Data Technologies - Introduction.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Unlocking AI with Model Context Protocol (MCP)
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Electronic commerce courselecture one. Pdf
Teaching material agriculture food technology
NewMind AI Weekly Chronicles - August'25 Week I
sap open course for s4hana steps from ECC to s4
Building Integrated photovoltaic BIPV_UPV.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Review of recent advances in non-invasive hemoglobin estimation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Advanced methodologies resolving dimensionality complications for autism neur...

Graph Theory and Databases

  • 1. Graph ( Theory and Databases ) Pere Urbón Bayes Senior Software Engineer Independent purbon@purbon.com purbon.com in/purbon December of 2010 @purbon
  • 2. Graph (Theory and Databases) ● Graph Theory ● Graph Databases – Definitions – Definitions – Applications – Facts – Analytics – Performance – Vendors Graph ( Theory and Databases ) 2
  • 3. Graph Definitions ● Graph G(V,E) where V = {v1,v2,...,vN) and E = {E1,E2,...,EN) – Directed / Undirected – Mixed – Multigraph – Weighted – .... Graph ( Theory and Databases ) 3
  • 4. Graph Definitions ● Directed graphs ● Vertex ● Edges ● From V(N) to V(M) Graph ( Theory and Databases ) 4
  • 5. Graph Definitions Multigraph Labelling ● More than one edge ● The process of between two nodes. assigning a label to a ● Loops, edges vertex and edges. between the same node. Graph ( Theory and Databases ) 5
  • 6. Graph Theory Applications ● Task planning ● Scheduling ● Process assignation ● Routing ● Logistics ● League planning Graph ( Theory and Databases ) 6
  • 7. Graph Theory Applications ● Pattern Recognition ● Dependency analysis ● Impact analysis ● Network flow – Traffic analysis and optimization – Delivery optimization ● Optimization of tasks Graph ( Theory and Databases ) 7
  • 8. Graph Theory analytics ● Clustering (Communities) ● Social connexions ● Hubs ● Graph Mining ● Centrality measures Graph ( Theory and Databases ) 8
  • 9. Graph Like Applications ● Recommendations – Heuristics (PageRank) – Local ● Shortest Paths ● Hammock Functions ● Walks ● Search algorithms ● Shooting stars ● K-nearest neighbours Graph ( Theory and Databases ) 9
  • 10. Graph Like Applications ● Location based services ● Hubs ● Spatial databases ● Logical (multi-)index construction Graph ( Theory and Databases ) 10
  • 11. Web Trending Topics ● Semantic web – RDF (OWL) Store – RDF-Sail – SPARQL ● Linked data (Open Data) ● Link analysis ● Structure mining Graph ( Theory and Databases ) 11
  • 12. Graph databases “A graph database is a database that uses graph structures with nodes, edges, and properties to represent and store information. General graph databases that can store any graph are distinct from specialized graph databases such as triple stores and network databases.” Wikipedia Graph ( Theory and Databases ) 12
  • 13. Graph databases Property graph ● Abstractions – Nodes – Relationships – Properties on both. John smith liked http://www.example.com at 01/10/11 Graph ( Theory and Databases ) 13
  • 14. Graph databases Facts Connectivity Everything connected RDF Ontologies Linked Data Tagging Blogs Folksonomies Social Networks Text files 1990's 2010's 2020's Decades Graph ( Theory and Databases ) 14
  • 15. Graph databases Facts Size of 1990's 2010's 2020's Decades Graph ( Theory and Databases ) 15 http://www.guardian.co.uk/business/2009/may/18/digital-content-expansion
  • 16. Graph databases Facts Performance Lists Graph like structures Semantic web Semantic reasoning Linked data Performance slowdown Unstructured Graph ( Theory and Databases ) 16
  • 17. Graph databases Performance Kernel DEX Neo4j Jena HyperGraphDB Scale 15 Load(s) 7,44 697 141 +24h Scan (s) 0,0010 2,71 0,689 2-Hops(s) 0,0120 0,0260 0,443 BC (s) 14,8 8,24 138 Size (MB) 30 17 207 Kernel DEX Neo4j Jena HyperGraph Scale 20 DB Load(s) 317 32.094 4.560 +24h Scan (s) 0,005 751 18,6 2-Hops(s) 0,033 0,0230 0,4580 BC (s) 617 7027 59512 Size (MB) 893 539 6656 Graph ( Theory and Databases ) 17 HPC Scalable Graph Analysis Benchmark IWGD 2010
  • 18. Graph databases Vendors ● Neo4J: Open source database NoSQL graph. ● Dex: The high performance graph database. ● HyperGraphDB: An IA and semantic web graph database. ● Infogrid: The Internet Graph database. ● Sones: SaaS dot Net Graph database. ● VertexDB: High performance database server. Graph ( Theory and Databases ) 18
  • 19. Graph ( Theory and Databases ) Thanks! purbon@purbon.com December of 2010 Graph ( Theory and Databases ) 19