SlideShare a Scribd company logo
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. |
Visual Analysis of Social Media Data from
using Graph Technologies
DATA NATIVES 2018 | Nov 22-23, 2018 | Berlin
Karin Patenge | Principal Solution Engineer | Cloud & Core Technologies
@kpatenge |  karin.patenge@oracle.com
Oracle Deutschland B.V. & Co. KG | Potsdam | Schiffbauergasse 14
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Accessing Data Entities
• Data retrieval via REST API
https://www.meetup.com/meetup_api
• Different API methods & versions
• API Key required
• Sample request
• Data returned as JSON
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Potential Questions of Interest
• Which Meetup groups are most active in terms of:
– # members
– # events
– # event attendees
• Who and where are influencers in the Meetup community?
• Where are connections between the Meetup groups in different locations?
• Which topics are “hot” and where?
• How close/similar are groups?
• …
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Approach: Modeling Data as Graphs
The more connected the data is, the better a Graph fits
Source: http://www.ateam-oracle.com/intro-to-graphs-at-oracle/
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• A set of nodes (aka vertices)
– each vertex has a unique identifier
– each vertex has a set of in/out edges
– each vertex has a collection of key-value
properties
• A set of edges
– each edge has a unique identifier
– each edge has a head/tail vertex
– each edge has a label denoting type of
relationship between two vertices
– each edge has a collection of key-value properties
• Implementations
– Oracle (Spatial and Graph/Big Data Spatial and
Graph), Neo4j, DataStax (Titan), InfiniteGraph, …
What is a Property Graph?
https://github.com/tinkerpop/blueprints/wiki/Property-Graph-Model
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• PageRank, Weighted PageRank
– Find influencers, critical vertices
• Personalized PageRank
– Find important people/products/...
with respect to a given starting point
• Sparsification
– Trim down the graph to make it more
fragmented
• Clustering
– Find communities which can be the
basis of segmentation, and/or
recommendation/anomaly detection,
churn analysis
• Centrality
– Find critical people/devices/...
• Shortest path
– Discover links, find suspect‘s close
collaborators, transportation routing
• Breadth-First-Search (BFS)
– Impact analysis, link analysis
• Matric factorization
– Recommendation
• Reachability
– Connectivity test
• ...
Graph Algorithms and their Applications
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Pathfinding
– fattestPath
– shortestPathBellmanFord
– shortestPathBellmanFordReverse
– shortestPathDijkstra
– shortestPathDijkstraBidirectional
– shortestPathFilteredDijkstra
– shortestPathFilteredDijkstraBidirectional
– shortestPathHopDist
– shortestPathHopDistReverse
Ranking
– closenessCentralityUnitLength
– degreeCentrality
– eigenvectorCentrality
– Hyperlink-Induced Topic Search (HITS)
– inDegreeCentrality
– nodeBetweennessCentrality
– outDegreeCentrality
– PageRank, weighted PageRank
– approximatePagerank
– personalizedPagerank
– randomWalkWithRestart
Social Network Analysis Algorithms (1)
@kpatenge @datanativesconf #DN18
https://tinyurl.com/pgxdocs
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Structure Evaluation
– Conductance
– countTriangles
– inDegreeDistribution
– outDegreeDistribution
– partitionConductance
– partitionModularity
– sparsify
– K-Core computes
Community Detection
– communitiesLabelPropagation
Recommendation
– salsa
– personalizedSalsa
– whomToFollow
Classic - Connected Components
– sccKosaraju
– sccTarjan
– wcc
Social Network Analysis Algorithms (2)
@kpatenge @datanativesconf #DN18
https://tinyurl.com/pgxdocs
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Architecture of Oracle Property Graph Analytics
Property
Graph formats
GraphML
GML
GraphSON
Flat Files
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
PGX
• Toolkit for In-Memory, Parallel Graph
Analytics containing
– PGX shell
– Analyst API with a large collection of built-in
algorithms (45+)
– Enhance with user defined algorithms written
in Green-Marl
– Tutorials, JavaDoc, Use Cases, and more
• Developed by Oracle Labs
• https://docs.oracle.com/cd/E56133_01/latest/i
ndex.html
PGQL – Property Graph Query Language
• http://pgql-lang.org/
• Graph Pattern Matching combined with
SQL
• Developed by Oracle Labs
• Proposed for standardization
• Changes in Version 1.1:
http://pgql-lang.org/spec/1.1/#breaking-syntax-
changes-since-pgql-10
Property Graph Analytics Engine
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Data Processing and Analysis Workflow: Overview
Retrieve&Prepare
Prepare
source data
• Using R for data
retrieval via
REST API and
conversion
JSON  CSV 
OPV/OPE
Load&Build
Load
nodes and
edges data
into a graph
• Using Oracle
NoSQL DB as
Graph data
store
Analyze
Analyze
graph data
• Using Graph
Analytics Engine
(PGX) and
Property Graph
Query Language
(PGQL)
Visualize
Visualize
graph data
• Using
Cytoscape
Results
Summarize
results
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Demo
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
‚Big Data‘ Groups
in relation with
Topics and Cities
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
‚Big Data‘ Groups
in relation with
Organizers, Cities
and Events
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
‚Big Data‘ Groups
in relation with
Organizers and Cities
Weakly Connected
Components (WCC)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
✓Which cities are tech hot spots?
✓Who are important people in the
Meetup landscape?
✓Which Meetup groups cover with
topics?
✓Which Meetup groups are relevant in
terms of
#Members, #Participants of events,
#Events
✓Which Meetup groups are related
and how?
✓Which topics are related and how?
• The way you model the graph has
influence on the results of executing
Graph algorithms
• The choice of edge directions does
matter depending on the algorithms
• Attaching weights to edges is useful
for certain algorithms
Some Results
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Key Takeaways
• Graph data model perfect to focus on connectivity
• Graph databases are powerful tools, complementing relational and other
databases
– Especially strong for analysis of graph topology and connectedness
• Visual analysis helps a great deal to understand how data are connected
– New insights, especially with relationships, dependencies and behavioral patterns
• Big variety of analytic tools and frameworks to answer all kind of questions
• Oracle Graph Technologies combined with Open Source or 3rd party tools
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Follow us @kpatenge @SpatialHannes @JeanIhm
karin.patenge@oracle.com
GitHub:
https://github.com/karinpatenge/DN2018
Blogs:
https://blogs.oracle.com/bigdataspatialgraph/
https://blogs.oracle.com/oraclespatial/
AskTom Office Hours for Property Graph:
https://asktom.oracle.com/pls/apex/f?p=100:551
@kpatenge @datanativesconf #DN18
20181123 dn2018 graph_analytics_k_patenge

More Related Content

PDF
20181019 code.talks graph_analytics_k_patenge
PDF
Introduction to Property Graph Features (AskTOM Office Hours part 1)
PDF
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
PDF
How To Visualize Graphs
PDF
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
PDF
Gain Insights with Graph Analytics
PPTX
Hybrid Enterprise Knowledge Graphs
PDF
PGQL: A Language for Graphs
20181019 code.talks graph_analytics_k_patenge
Introduction to Property Graph Features (AskTOM Office Hours part 1)
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Visualize Graphs
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
Gain Insights with Graph Analytics
Hybrid Enterprise Knowledge Graphs
PGQL: A Language for Graphs

What's hot (20)

PDF
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
PDF
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
PDF
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
PDF
Oracle Spatial Studio: Fast and Easy Spatial Analytics and Maps
PDF
Neo4j GraphDay Seattle- Sept19- Connected data imperative
PDF
Connected datalondon metadata-driven apps
PPTX
Strata sf - Amundsen presentation
PPTX
How Lyft Drives Data Discovery
PPTX
Data council sf amundsen presentation
PDF
Neo4j: What's Under the Hood
PDF
Neo4j GraphTour Santa Monica 2019 - Amundsen Presentation
PDF
SHACL-based data life cycle management
PDF
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
PDF
Meetup SF - Amundsen
PDF
Graph Algorithms for Developers
PDF
Neo4j GraphDay Seattle- Sept19- in the enterprise
PPSX
In search of database nirvana - The challenges of delivering Hybrid Transacti...
PPTX
Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15
PPTX
GraphTour - Neo4j Platform Overview
PDF
Neo4j GraphDay Seattle- Sept19- graphs are ai
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Oracle Spatial Studio: Fast and Easy Spatial Analytics and Maps
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Connected datalondon metadata-driven apps
Strata sf - Amundsen presentation
How Lyft Drives Data Discovery
Data council sf amundsen presentation
Neo4j: What's Under the Hood
Neo4j GraphTour Santa Monica 2019 - Amundsen Presentation
SHACL-based data life cycle management
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
Meetup SF - Amundsen
Graph Algorithms for Developers
Neo4j GraphDay Seattle- Sept19- in the enterprise
In search of database nirvana - The challenges of delivering Hybrid Transacti...
Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15
GraphTour - Neo4j Platform Overview
Neo4j GraphDay Seattle- Sept19- graphs are ai
Ad

Similar to 20181123 dn2018 graph_analytics_k_patenge (20)

PPTX
Graph Analytics on Data from Meetup.com
PDF
An Introduction to Graph: Database, Analytics, and Cloud Services
PPTX
Oracle Data Science Platform
PPTX
Neo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
PDF
OracleCode_Berlin_Jun2018_AnalyzeBitcoinTransactionDataUsingAsGraph
PDF
Applying large scale text analytics with graph databases
PDF
Domain Specific Languages for Parallel Graph AnalytiX (PGX)
PDF
GraphGen: Conducting Graph Analytics over Relational Databases
PDF
GraphGen: Conducting Graph Analytics over Relational Databases
PPTX
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
PPTX
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
PDF
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
PDF
Data meets AI - AICUG - Santa Clara
PDF
Graph Gurus Episode 1: Enterprise Graph
PDF
Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)
PDF
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
PDF
From Developer to Data Scientist - Gaines Kergosien
PDF
Using graphs for recommendations
PDF
Embedded-ml(ai)applications - Bjoern Staender
PPTX
Neo4j GraphTalk Oslo - Introduction to Graphs
Graph Analytics on Data from Meetup.com
An Introduction to Graph: Database, Analytics, and Cloud Services
Oracle Data Science Platform
Neo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
OracleCode_Berlin_Jun2018_AnalyzeBitcoinTransactionDataUsingAsGraph
Applying large scale text analytics with graph databases
Domain Specific Languages for Parallel Graph AnalytiX (PGX)
GraphGen: Conducting Graph Analytics over Relational Databases
GraphGen: Conducting Graph Analytics over Relational Databases
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Data meets AI - AICUG - Santa Clara
Graph Gurus Episode 1: Enterprise Graph
Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
From Developer to Data Scientist - Gaines Kergosien
Using graphs for recommendations
Embedded-ml(ai)applications - Bjoern Staender
Neo4j GraphTalk Oslo - Introduction to Graphs
Ad

More from Karin Patenge (17)

PDF
20190704_AGIT_Georaster_ImageryData_KPatenge
PDF
20190703_AGIT_GeoRasterWorkshop_GriddedData_KPatenge
PDF
20190604_DOAGDatabase2019_OracleNoSQLDB_for_DBAs
PDF
Big Data Community Webinar vom 16. Mai 2019: Oracle NoSQL DB im Überblick
PDF
20181210_ITTage2018_OracleNoSQLDB_KPatenge
PDF
20181120_DOAG_OracleNoSQLDB_KPatenge
PDF
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
PDF
5 Gründe für Oracle Spatial Technologies
PDF
IT-Tage 2017: Visuelle Analyse komplexer Datenbestände am Beispiel der Panama...
PDF
20171106_OracleWebcast_ITTrends_EFavuzzi_KPatenge
PDF
20171121_DOAGKonferenz_JSON_OracleNoSQL_KPatenge
PDF
Oracle NoSQL Database: Integration in den Oracle Enterprise Manager 12
PDF
20160310_ModernApplicationDevelopment_NoSQL_KPatenge
PDF
20160229_ModernApplicationDevelopment_Python_KPatenge
PDF
Datenbank-gestützte Validierung und Geokodierung von Adressdatenbeständen
PPTX
Raster Algebra mit Oracle Spatial und uDig
PPTX
Geodatenmanagement und -Visualisierung mit Oracle Spatial Technologies
20190704_AGIT_Georaster_ImageryData_KPatenge
20190703_AGIT_GeoRasterWorkshop_GriddedData_KPatenge
20190604_DOAGDatabase2019_OracleNoSQLDB_for_DBAs
Big Data Community Webinar vom 16. Mai 2019: Oracle NoSQL DB im Überblick
20181210_ITTage2018_OracleNoSQLDB_KPatenge
20181120_DOAG_OracleNoSQLDB_KPatenge
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
5 Gründe für Oracle Spatial Technologies
IT-Tage 2017: Visuelle Analyse komplexer Datenbestände am Beispiel der Panama...
20171106_OracleWebcast_ITTrends_EFavuzzi_KPatenge
20171121_DOAGKonferenz_JSON_OracleNoSQL_KPatenge
Oracle NoSQL Database: Integration in den Oracle Enterprise Manager 12
20160310_ModernApplicationDevelopment_NoSQL_KPatenge
20160229_ModernApplicationDevelopment_Python_KPatenge
Datenbank-gestützte Validierung und Geokodierung von Adressdatenbeständen
Raster Algebra mit Oracle Spatial und uDig
Geodatenmanagement und -Visualisierung mit Oracle Spatial Technologies

Recently uploaded (20)

PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
modul_python (1).pptx for professional and student
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
Introduction to the R Programming Language
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Lecture1 pattern recognition............
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
SAP 2 completion done . PRESENTATION.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Topic 5 Presentation 5 Lesson 5 Corporate Fin
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
modul_python (1).pptx for professional and student
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Pilar Kemerdekaan dan Identi Bangsa.pptx
Introduction to the R Programming Language
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction-to-Cloud-ComputingFinal.pptx
Lecture1 pattern recognition............

20181123 dn2018 graph_analytics_k_patenge

  • 1. Copyright © 2017, Oracle and/or its affiliates. All rights reserved. | Visual Analysis of Social Media Data from using Graph Technologies DATA NATIVES 2018 | Nov 22-23, 2018 | Berlin Karin Patenge | Principal Solution Engineer | Cloud & Core Technologies @kpatenge |  karin.patenge@oracle.com Oracle Deutschland B.V. & Co. KG | Potsdam | Schiffbauergasse 14
  • 2. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
  • 3. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Accessing Data Entities • Data retrieval via REST API https://www.meetup.com/meetup_api • Different API methods & versions • API Key required • Sample request • Data returned as JSON @kpatenge @datanativesconf #DN18
  • 4. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Potential Questions of Interest • Which Meetup groups are most active in terms of: – # members – # events – # event attendees • Who and where are influencers in the Meetup community? • Where are connections between the Meetup groups in different locations? • Which topics are “hot” and where? • How close/similar are groups? • … @kpatenge @datanativesconf #DN18
  • 5. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Approach: Modeling Data as Graphs The more connected the data is, the better a Graph fits Source: http://www.ateam-oracle.com/intro-to-graphs-at-oracle/ @kpatenge @datanativesconf #DN18
  • 6. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | • A set of nodes (aka vertices) – each vertex has a unique identifier – each vertex has a set of in/out edges – each vertex has a collection of key-value properties • A set of edges – each edge has a unique identifier – each edge has a head/tail vertex – each edge has a label denoting type of relationship between two vertices – each edge has a collection of key-value properties • Implementations – Oracle (Spatial and Graph/Big Data Spatial and Graph), Neo4j, DataStax (Titan), InfiniteGraph, … What is a Property Graph? https://github.com/tinkerpop/blueprints/wiki/Property-Graph-Model @kpatenge @datanativesconf #DN18
  • 7. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | • PageRank, Weighted PageRank – Find influencers, critical vertices • Personalized PageRank – Find important people/products/... with respect to a given starting point • Sparsification – Trim down the graph to make it more fragmented • Clustering – Find communities which can be the basis of segmentation, and/or recommendation/anomaly detection, churn analysis • Centrality – Find critical people/devices/... • Shortest path – Discover links, find suspect‘s close collaborators, transportation routing • Breadth-First-Search (BFS) – Impact analysis, link analysis • Matric factorization – Recommendation • Reachability – Connectivity test • ... Graph Algorithms and their Applications @kpatenge @datanativesconf #DN18
  • 8. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Pathfinding – fattestPath – shortestPathBellmanFord – shortestPathBellmanFordReverse – shortestPathDijkstra – shortestPathDijkstraBidirectional – shortestPathFilteredDijkstra – shortestPathFilteredDijkstraBidirectional – shortestPathHopDist – shortestPathHopDistReverse Ranking – closenessCentralityUnitLength – degreeCentrality – eigenvectorCentrality – Hyperlink-Induced Topic Search (HITS) – inDegreeCentrality – nodeBetweennessCentrality – outDegreeCentrality – PageRank, weighted PageRank – approximatePagerank – personalizedPagerank – randomWalkWithRestart Social Network Analysis Algorithms (1) @kpatenge @datanativesconf #DN18 https://tinyurl.com/pgxdocs
  • 9. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Structure Evaluation – Conductance – countTriangles – inDegreeDistribution – outDegreeDistribution – partitionConductance – partitionModularity – sparsify – K-Core computes Community Detection – communitiesLabelPropagation Recommendation – salsa – personalizedSalsa – whomToFollow Classic - Connected Components – sccKosaraju – sccTarjan – wcc Social Network Analysis Algorithms (2) @kpatenge @datanativesconf #DN18 https://tinyurl.com/pgxdocs
  • 10. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Architecture of Oracle Property Graph Analytics Property Graph formats GraphML GML GraphSON Flat Files @kpatenge @datanativesconf #DN18
  • 11. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | PGX • Toolkit for In-Memory, Parallel Graph Analytics containing – PGX shell – Analyst API with a large collection of built-in algorithms (45+) – Enhance with user defined algorithms written in Green-Marl – Tutorials, JavaDoc, Use Cases, and more • Developed by Oracle Labs • https://docs.oracle.com/cd/E56133_01/latest/i ndex.html PGQL – Property Graph Query Language • http://pgql-lang.org/ • Graph Pattern Matching combined with SQL • Developed by Oracle Labs • Proposed for standardization • Changes in Version 1.1: http://pgql-lang.org/spec/1.1/#breaking-syntax- changes-since-pgql-10 Property Graph Analytics Engine @kpatenge @datanativesconf #DN18
  • 12. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Data Processing and Analysis Workflow: Overview Retrieve&Prepare Prepare source data • Using R for data retrieval via REST API and conversion JSON  CSV  OPV/OPE Load&Build Load nodes and edges data into a graph • Using Oracle NoSQL DB as Graph data store Analyze Analyze graph data • Using Graph Analytics Engine (PGX) and Property Graph Query Language (PGQL) Visualize Visualize graph data • Using Cytoscape Results Summarize results @kpatenge @datanativesconf #DN18
  • 13. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Demo @kpatenge @datanativesconf #DN18
  • 14. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 ‚Big Data‘ Groups in relation with Topics and Cities
  • 15. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 ‚Big Data‘ Groups in relation with Organizers, Cities and Events
  • 16. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 ‚Big Data‘ Groups in relation with Organizers and Cities Weakly Connected Components (WCC)
  • 17. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 18. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 19. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 20. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 21. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | ✓Which cities are tech hot spots? ✓Who are important people in the Meetup landscape? ✓Which Meetup groups cover with topics? ✓Which Meetup groups are relevant in terms of #Members, #Participants of events, #Events ✓Which Meetup groups are related and how? ✓Which topics are related and how? • The way you model the graph has influence on the results of executing Graph algorithms • The choice of edge directions does matter depending on the algorithms • Attaching weights to edges is useful for certain algorithms Some Results @kpatenge @datanativesconf #DN18
  • 22. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Key Takeaways • Graph data model perfect to focus on connectivity • Graph databases are powerful tools, complementing relational and other databases – Especially strong for analysis of graph topology and connectedness • Visual analysis helps a great deal to understand how data are connected – New insights, especially with relationships, dependencies and behavioral patterns • Big variety of analytic tools and frameworks to answer all kind of questions • Oracle Graph Technologies combined with Open Source or 3rd party tools @kpatenge @datanativesconf #DN18
  • 23. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Follow us @kpatenge @SpatialHannes @JeanIhm karin.patenge@oracle.com GitHub: https://github.com/karinpatenge/DN2018 Blogs: https://blogs.oracle.com/bigdataspatialgraph/ https://blogs.oracle.com/oraclespatial/ AskTom Office Hours for Property Graph: https://asktom.oracle.com/pls/apex/f?p=100:551 @kpatenge @datanativesconf #DN18