SlideShare a Scribd company logo
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
Recent Updates on IBM System G
— GraphBIG and Temporal Data
Yinglong Xia
IBM T.J. Watson Research Center
Yorktown Heights, NY 10598
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
2
IBM T.J. Watson Research Center
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
3
Using LDBC-SNB for GraphBIG
• GraphBIG = Graph Benchmark Suite from IBM System G and GaTech HPArch
• A wide selection of workloads from both CPU and GPU
• Workload ranging from graph traversal to Gibbs Sampling on Bayesian Network
• Illustrating processor architecture impact using h/w performance counter
• Fix input data and implementation
• Show performance profiling at processor architecture level
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
4
Beyond the Benchmarking for Graph DBs
• Graph computing was barely considered in architecture design
• Increasing motivation due to popularity of graph analytics
• Impact of architecture requires fixed input data and analytic implementation
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
5
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
6
Demanding Graph
• Interactions of entities in many big data applications are naturally modeled by property graphs
• Evolution of graph structure and properties over time usually provides useful information, which needs
to be maintained for query or analytics
• Graph analytics market grows increasingly fast as well as the graph data size and complexity, but
near real time response is typically required
Xiaoyan Fu, Seok-Hee Hong, Nikola S. Nikolov, Xiaobin Shen, Ying Xin Wu and Kai Xu,
Visualization and Analysis of Email Networks, Proceedings of APVIS 2007, IEEE, pp.1-8, 2007.
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
7
Use Case: Forensic Analysis on Individual Status
• Recover the dynamics of individual status
• Evaluate status measures, anomalies, etc.
• Propagate known status measures
• Estimate labels for each person at each time stamp
• Aggregate the received measures
Chain Graph: A collection of graphs on 

contiguous time steps
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
8
Use Case: Bitemporal Data Exploration
• Support the valid dimension and the transaction dimension
• Audit trail of what you know and when did you know
• History of how history from a business perspective was stored in the
database
http://bitemporalmodeling.com/temporal-data-blog/
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
9
Graph Data Management
SparkseeNeo4j
Titan
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
10
Organization of Graph Store
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
11
Organize Temporal Graph Data Name Default Value
vertex_history Disabled
num_vertex_property_bundles 0
edge_history Disabled
num_edge_property_bundles 0
… …
Flag

(uint8)
inEdge

(uint64)
inEdge Count
(uint16)
outEdge

(uint64)
outEdge Count

(uint16)
Property

(uint64)
Property Count

(unit64)
History

(unit64)
…
…
…
Vertex Record Table
inEdge List
Flag Property Property

Count
History …
…
…
Prev Edge_list_buffer<EID,VID,LID>
…
…
Edge Record Table
Accessed Vertex 

Record by VID * 

izeof (VtxRec)
inEdgeCount * sizeof (<EID,VID,LID>)

point to the buffer end
Accessed Edge 

Record by EID * 

izeof (EdgeRec)
Vertex Property Table
Prev property_buffer
…
…
PropertyCount

point to the buffer end
Edge Property Table
Prev property_buffer
…
…
Name Default
Value
min_VID 0
max_VID
min_EID 0
Max_EID
… …
Local Configuration
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
12
Pointer Jumping in Temporal Graph Inference
• Converting a temporal graph into tridiagonal system
• Forward Gaussian elimination by propagation
• Backward substitution to produce solutions
• A Parallel Solution to Thomas Algorithm
• Apply pointer jumping to Thomas algorithm
• Logarithmic speedup
parallel solution to solve a tridiagonal linear system
• Converting a chain graph into tridiagonal system
• Forward Gaussian elimination by propagation
• Backward substitution to produce solutions 

• A Parallel Solution to Thomas Algorithm
• Apply pointer jumping to Thomas algorithm
• Logarithmic speedup
• Propagate belief among vertices within and cross
time stamps
Speedup wrt Gaussian 

Elimination: T3 / logT
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
13
Comments and Questions?

More Related Content

PDF
AUTOWARE Neutral Facilities: Reconfigurable Robot Workcell
PDF
Automation of MultiDimensional DB Design (poster)
PPSX
S1000D Users Forum Lessons Learned MBE_A_Smith_17SEP15
PPTX
Laura sebu icstcc_merging
PDF
FITT Toolbox: Cost Approach
PPTX
BIM For Free_28May2013_EdBartlett_BIM and lifecycle management
PDF
Commonly made mistakes in Business Plans
PDF
Ref letter (1)
AUTOWARE Neutral Facilities: Reconfigurable Robot Workcell
Automation of MultiDimensional DB Design (poster)
S1000D Users Forum Lessons Learned MBE_A_Smith_17SEP15
Laura sebu icstcc_merging
FITT Toolbox: Cost Approach
BIM For Free_28May2013_EdBartlett_BIM and lifecycle management
Commonly made mistakes in Business Plans
Ref letter (1)

Viewers also liked (14)

PDF
Dlp phi
PPTX
хто отчет за 9 месяцев 2015 2016 года
PDF
기년회 130101 위승용
PPTX
5 reasons why your content calendar is failing
PPTX
Question 4
PDF
Recruitment Brochure
PDF
(주)아키테리어금빛가람 회사소개
PDF
Mini Projeto Horta Escolar
PPT
Varied Indian attires
PDF
IntellianSystems Digital Signage Brochure
PPTX
2016 회사소개서 통합 ver 01
PDF
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee
PDF
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편
PPTX
Analysis of student digipaks
Dlp phi
хто отчет за 9 месяцев 2015 2016 года
기년회 130101 위승용
5 reasons why your content calendar is failing
Question 4
Recruitment Brochure
(주)아키테리어금빛가람 회사소개
Mini Projeto Horta Escolar
Varied Indian attires
IntellianSystems Digital Signage Brochure
2016 회사소개서 통합 ver 01
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편
Analysis of student digipaks
Ad

Similar to Recent Updates on IBM System G — GraphBIG and Temporal Data (20)

PDF
DyGraph: A Dynamic Graph Generator and Benchmark Suite : NOTES
PDF
Leverage graph technologies to discover hidden insights in your EHS & Sustain...
PDF
Advanced Analytics: Graph Database Use Cases
PDF
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
PDF
Scalable Time-Versioning Support for Property Graph Databases
ODP
SC10 project slides
PDF
Graph Analysis over Relational Database. Roberto Franchini - Arcade Analytics
PDF
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
PDF
Time-Evolving Graph Processing On Commodity Clusters
PDF
Graph store
PPTX
Family tree of data – provenance and neo4j
PPTX
Follow the money with graphs
PDF
LDBC 8th TUC Meeting: Introduction and status update
PDF
Introduction to the graph technologies landscape
PDF
Introduction to the graph technologies landscape
PPTX
Using Graph Analysis and Fraud Detection in the Fintech Industry
PPTX
Using Graph Analysis and Fraud Detection in the Fintech Industry
PPTX
MathWorks Interview Lecture
PDF
Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...
PPTX
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
DyGraph: A Dynamic Graph Generator and Benchmark Suite : NOTES
Leverage graph technologies to discover hidden insights in your EHS & Sustain...
Advanced Analytics: Graph Database Use Cases
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
Scalable Time-Versioning Support for Property Graph Databases
SC10 project slides
Graph Analysis over Relational Database. Roberto Franchini - Arcade Analytics
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
Time-Evolving Graph Processing On Commodity Clusters
Graph store
Family tree of data – provenance and neo4j
Follow the money with graphs
LDBC 8th TUC Meeting: Introduction and status update
Introduction to the graph technologies landscape
Introduction to the graph technologies landscape
Using Graph Analysis and Fraud Detection in the Fintech Industry
Using Graph Analysis and Fraud Detection in the Fintech Industry
MathWorks Interview Lecture
Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Ad

More from Graph-TA (20)

PDF
Computing on Event-sourced Graphs
PDF
Using Evolutionary Computing for Feature-driven Graph generation
PDF
Reactive Databases for Big Data applications
PDF
The scarcity of crossing dependencies: a direct outcome of a specific constra...
PDF
Holistic Benchmarking of Big Linked Data: HOBBIT
PDF
Identifiability in Dynamic Casual Networks
PDF
Polyglot Graph Databases using OCL as pivot
PDF
Benchmarking Versioning for Big Linked Data
PDF
Synthetic Data Generation using exponential random Graph modeling
PDF
Use of Graphs for Cloud Service Selection in Multi-Cloud Environments
PDF
Graphalytics: A big data benchmark for graph-processing platforms
PDF
Modelling the Clustering Coefficient of a Random graph
PPTX
RDF Graph Data Management in Oracle Database and NoSQL Platforms
PPTX
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
PPTX
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
PDF
Graphalytics: A big data benchmark for graph processing platforms
PDF
Autograph: an evolving lightweight graph tool
PPTX
Understanding Graph Structure in Knowledge Bases
PDF
Finding patterns of chronic disease and medication prescriptions from a large...
PDF
Analysing the degree distribution of real graphs by means of several probabil...
Computing on Event-sourced Graphs
Using Evolutionary Computing for Feature-driven Graph generation
Reactive Databases for Big Data applications
The scarcity of crossing dependencies: a direct outcome of a specific constra...
Holistic Benchmarking of Big Linked Data: HOBBIT
Identifiability in Dynamic Casual Networks
Polyglot Graph Databases using OCL as pivot
Benchmarking Versioning for Big Linked Data
Synthetic Data Generation using exponential random Graph modeling
Use of Graphs for Cloud Service Selection in Multi-Cloud Environments
Graphalytics: A big data benchmark for graph-processing platforms
Modelling the Clustering Coefficient of a Random graph
RDF Graph Data Management in Oracle Database and NoSQL Platforms
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
Graphalytics: A big data benchmark for graph processing platforms
Autograph: an evolving lightweight graph tool
Understanding Graph Structure in Knowledge Bases
Finding patterns of chronic disease and medication prescriptions from a large...
Analysing the degree distribution of real graphs by means of several probabil...

Recently uploaded (20)

PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Encapsulation theory and applications.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
KodekX | Application Modernization Development
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Cloud computing and distributed systems.
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Encapsulation theory and applications.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Diabetes mellitus diagnosis method based random forest with bat algorithm
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Spectral efficient network and resource selection model in 5G networks
KodekX | Application Modernization Development
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Cloud computing and distributed systems.
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Review of recent advances in non-invasive hemoglobin estimation
NewMind AI Weekly Chronicles - August'25 Week I
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Understanding_Digital_Forensics_Presentation.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing

Recent Updates on IBM System G — GraphBIG and Temporal Data

  • 1. IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences Recent Updates on IBM System G — GraphBIG and Temporal Data Yinglong Xia IBM T.J. Watson Research Center Yorktown Heights, NY 10598
  • 2. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 2 IBM T.J. Watson Research Center
  • 3. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 3 Using LDBC-SNB for GraphBIG • GraphBIG = Graph Benchmark Suite from IBM System G and GaTech HPArch • A wide selection of workloads from both CPU and GPU • Workload ranging from graph traversal to Gibbs Sampling on Bayesian Network • Illustrating processor architecture impact using h/w performance counter • Fix input data and implementation • Show performance profiling at processor architecture level
  • 4. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 4 Beyond the Benchmarking for Graph DBs • Graph computing was barely considered in architecture design • Increasing motivation due to popularity of graph analytics • Impact of architecture requires fixed input data and analytic implementation
  • 5. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 5
  • 6. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 6 Demanding Graph • Interactions of entities in many big data applications are naturally modeled by property graphs • Evolution of graph structure and properties over time usually provides useful information, which needs to be maintained for query or analytics • Graph analytics market grows increasingly fast as well as the graph data size and complexity, but near real time response is typically required Xiaoyan Fu, Seok-Hee Hong, Nikola S. Nikolov, Xiaobin Shen, Ying Xin Wu and Kai Xu, Visualization and Analysis of Email Networks, Proceedings of APVIS 2007, IEEE, pp.1-8, 2007.
  • 7. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 7 Use Case: Forensic Analysis on Individual Status • Recover the dynamics of individual status • Evaluate status measures, anomalies, etc. • Propagate known status measures • Estimate labels for each person at each time stamp • Aggregate the received measures Chain Graph: A collection of graphs on 
 contiguous time steps
  • 8. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 8 Use Case: Bitemporal Data Exploration • Support the valid dimension and the transaction dimension • Audit trail of what you know and when did you know • History of how history from a business perspective was stored in the database http://bitemporalmodeling.com/temporal-data-blog/
  • 9. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 9 Graph Data Management SparkseeNeo4j Titan
  • 10. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 10 Organization of Graph Store
  • 11. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 11 Organize Temporal Graph Data Name Default Value vertex_history Disabled num_vertex_property_bundles 0 edge_history Disabled num_edge_property_bundles 0 … … Flag
 (uint8) inEdge
 (uint64) inEdge Count (uint16) outEdge
 (uint64) outEdge Count
 (uint16) Property
 (uint64) Property Count
 (unit64) History
 (unit64) … … … Vertex Record Table inEdge List Flag Property Property
 Count History … … … Prev Edge_list_buffer<EID,VID,LID> … … Edge Record Table Accessed Vertex 
 Record by VID * 
 izeof (VtxRec) inEdgeCount * sizeof (<EID,VID,LID>)
 point to the buffer end Accessed Edge 
 Record by EID * 
 izeof (EdgeRec) Vertex Property Table Prev property_buffer … … PropertyCount
 point to the buffer end Edge Property Table Prev property_buffer … … Name Default Value min_VID 0 max_VID min_EID 0 Max_EID … … Local Configuration
  • 12. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 12 Pointer Jumping in Temporal Graph Inference • Converting a temporal graph into tridiagonal system • Forward Gaussian elimination by propagation • Backward substitution to produce solutions • A Parallel Solution to Thomas Algorithm • Apply pointer jumping to Thomas algorithm • Logarithmic speedup parallel solution to solve a tridiagonal linear system • Converting a chain graph into tridiagonal system • Forward Gaussian elimination by propagation • Backward substitution to produce solutions 
 • A Parallel Solution to Thomas Algorithm • Apply pointer jumping to Thomas algorithm • Logarithmic speedup • Propagate belief among vertices within and cross time stamps Speedup wrt Gaussian 
 Elimination: T3 / logT
  • 13. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 13 Comments and Questions?