SlideShare a Scribd company logo
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Emerging Data Management Options
Adrian Bowles, PhD

Founder, STORM Insights, Inc.

info@storminsights.com
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Basic Life Advice
“When the map and the terrain disagree, believe the terrain.”
Gause and Weinberg (Exploring Requirements)
It is the pervading law of all things organic, and inorganic, of all things physical
and metaphysical, of all things human and all things superhuman, of all true
manifestations of the head, of the heart, of the soul, that the life is recognizable in
its expression, that form ever follows function. That is the law.
Louis Sullivan: The Tall Office Building Artistically Considered, 1896
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
How You Think About a Domain…
…influences your choice of maps and models…

rules and representations…and required operations.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
” To solve really hard problems, we'll have to use several different representations.
This is because each particular kind of data structure has its own virtues and
deficiencies, and none by itself would seem adequate for all the different functions
involved with what we call common sense.”
Marvin Minsky
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
What Do You Want/Need to Store?

How much? How complex? How fast?
What Do You Want/Need to DO With What You Store?
Do you need a graph database?
Options Include…
Files, tables, trees, queues, stacks, lists…

Hierarchical

RDBMS

Object DBMS

NoSQL

Graph
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Perception/
NLP
Problem Solving
& Learning
Simple:
deterministic,
retrieve/calculate
Complex:
probabalistic
hypothesize, test,
rank, select
Creative:
discover, generate
ORGANIZED
Memory*
Input Class/Type
Visual
Text
Image
Aural
Speech
Music
Cues
Noise
Informative
Touch
Temperature
Tactile
Texture
Taste
Smell
Response Types
Visible (to the environment)
Verbal/NL Text
Behavioral (system changes)
Haptics/Touch/Proprioception
Invisible
Memory updates
*Corpus including data in taxonomies, ontologies, trees…
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Graphs 101
A graph is a structure with vertices and edges.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Graphs 101
A graph is a structure with vertices and edges.
a
e
dc
b
Old Post Road
Cross Highway
Main Street
Shinbone Alley
Elk Road
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Graphs 101
A graph is a structure with vertices and edges.
a
e
dc
b
Old Post Road
Cross Highway
Main Street
Shinbone Alley
Elk Road
Old Post Road Paved
Old Post Road 11 miles
Elk Road Dirt
Elk Road 2 miles
Cross Highway toll road
Cross Highway 250 miles
Main Street 1 mile
Shinbone Alley .5 miles
a bus stop
b gas station
b Shell
c Elementary school
d House
e Office building
May be labeled, edges may be directed, all may
be stored/processed by properties
represented as key/value pairs.
Copyright (c) 2014-2016 by STORM Insights Inc. All Rights reserved.
Obvious structure is easy to process…
but most of the interesting stuff isn’t obvious to a computer.
Vertices,
edges,
properties
should
represent data
with higher-
level structure.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
You Probably Already Think In Graphs if…
You watch detective shows
You know trivia about movies
You remember relationships between people
You took a biology class
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
You Probably Already Think In Graphs if…
You took a biology class or played 20 questions (“animal, mineral or vegetable?”)
Wikipedia contributors. "Taxonomy (biology)." Wikipedia,
The Free Encyclopedia. Wikipedia, The Free Encyclopedia,
11 May. 2016. Web. 12 May. 2016.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
You Probably Already Think In Graphs if…
You watch detective shows
Typical crazy wall whiteboard - from Fargo.
A screen from IBM I2 Coplink
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
You Probably Already Think In Graphs if…
You know trivia about movies
IMDB
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
You Probably Already Think In Graphs if…
You remember relationships between people
Family Tree
LinkedIn Tree
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Anonymized look at my
desk/wall on a typical day.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Processes Can Be Represented As Graphs
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
A taxonomy represents the formal structure of classes or types of objects within a domain. Taxonomies are generally hierarchical and provide names
for each class in the domain. They may also capture the membership properties of each object in relation to the other objects. The rules of a specific
taxonomy are used to classify or categorize any object in the domain, so they must be complete, consistent, and unambiguous. This rigor in
specification should ensure that any newly discovered object must fit into one, and only one, category or object class.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
1952 DSM I
1968 DSM II
Pervasive Developmental Disorder (PDD)
Childhood onset PDD Infantile Autism Atypical Autism
1980 DSM III
Taxonomies Evolve
The History of Autism in the Diagnostic & Statistical Manual of the American Psychiatric Association
Pervasive Developmental Disorder (PDD)
PDD-NOS Autistic Disorder
(Not Otherwise Specified)
1987 DSM III-R
Pervasive Developmental Disorder (PDD)
PDD-NOS Autistic Disorder Asperger Disorder Childhood Disintegrative Disorder Rett Syndrome
1994 DSM IV
2000 DSM IV-TR
Autism Spectrum Disorder (ASD)
2013 DSMV
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
1952 DSM I
1968 DSM II
Pervasive Developmental Disorder (PDD)
Childhood onset PDD Infantile Autism Atypical Autism
1980 DSM III
Pervasive Developmental Disorder (PDD)
PDD-NOS Autistic Disorder
(Not Otherwise Specified)
1987 DSM III-R
Pervasive Developmental Disorder (PDD)
PDD-NOS Autistic Disorder Asperger Disorder Childhood Disintegrative Disorder Rett Syndrome
1994 DSM IV
2000 DSM IV-TR
Autism Spectrum Disorder (ASD)
2013 DSMV
An ontology provides more detail than a taxonomy, although the boundary between them in practice is somewhat fuzzy. An ontology should
comprehensively capture the common understanding – vocabulary, definitions, rules - of a community as it applies to a specific domain.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Key Concept… Graphs have well known mathematical properties:
e.g. If you represent a graph as a matrix M, then values in Mn
represent the number of paths of length n in the original graph.
a
e
dc
b
a b c d e
a 1
b 1
c 1
d 1
e 1
M =
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
a
e
dc
b
a b c d e
a 1
b 1
c 1
d 1
e 1
M2 =
Key Concept… Graphs have well known mathematical properties:
e.g. If you represent a graph as a matrix M, then values in Mn
represent the number of paths of length n in the original graph.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
a
e
dc
b
a b c d e
a 1
b 1
c 1
d 1
e 1
M3 =
Key Concept… Graphs have well known mathematical properties:
e.g. If you represent a graph as a matrix M, then values in Mn
represent the number of paths of length n in the original graph.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
The Market is Ready for You Now With Options
Commercial
Open Source
As a Service
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016.
Property graph
RDF
RDF - Resource Description Framework, W3C specs for
metadata modeling, now used in knowledge management
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
No SQL? (as opposed to NoSQL) No problem
Gremlin - Open source (Apache2 license)
- a graph traversal language, supported by Titan, Neo4j,
HadoopGiraph, Hadoop Spark, IBM…
Cypher - Neo4j, Objectivity…
Emerging graph query/traversal languages
SPARQL - Open source (SPARQL Protocol and RDF Query Language)
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016.
This chart is representative of the
market, but incomplete.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Apache TinkerPop, TinkerPop, Apache, Apache feather logo, and Apache TinkerPop project logo are
either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.
Apache TinkerPop™ is a graph computing framework for both
graph databases (OLTP) and graph analytic systems (OLAP).
“A graph is a structure composed of vertices and edges. Both vertices and edges
can have an arbitrary number of key/value-pairs called properties. Vertices denote
discrete objects such as a person, a place, or an event. Edges denote relationships
between vertices. For instance, a person may know another person, have been
involved in an event, and/or was recently at a particular place. Properties express
non-relational information about the vertices and edges. Example properties include
a vertex having a name, an age and an edge having a timestamp and/or a weight.
Together, the aforementioned graph is known as a property graph and it is the
foundational data structure of Apache TinkerPop.”
Apache TinkerPop™ is an open source, vendor-agnostic, graph computing
framework distributed under the commercial friendly Apache2 license. When a data
system is TinkerPop-enabled, its users are able to model their domain as a graph
and analyze that graph using the Gremlin graph traversal language.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Getting Started…
Why choose a graph database?
Speed to delivery when the data is naturally modeled as a graph

Simplifies multi-hop queries

Visualization? Baked-in
Ask Yourself
Do you need an on-premise solution, or to manage your own database?

Lots of options, Neo4J is the market leader

Do you want graphs as a service?

IBM offering graph as a service through BlueMix (in Beta now)
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Upcoming Webinar Dates & Topics
June 9 Advances in Natural Language Processing (NLP) 

July 13 Modern AI and The Future of Work (With Steve Ardire) 

adrian@storminsights.com Twitter @ajbowles Skype ajbowles
A hat-tip to Kamille Nixon…
Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
Upcoming Webinar Dates & Topics
June 9 Advances in Natural Language Processing (NLP) 

July 13 Modern AI and The Future of Work (With Steve Ardire) 

adrian@storminsights.com Twitter @ajbowles Skype ajbowles

More Related Content

PDF
Sense and Sensors - From Perception to Personality
PDF
R, Data Wrangling & Kaggle Data Science Competitions
PDF
Pandas, Data Wrangling & Data Science
PDF
Data-Ed Webinar: Best Practices with the DMM
PDF
Best Practices with the DMM
PDF
Data-Ed Webinar: Data-centric Strategy & Roadmap
PDF
How Ally Financial Achieved Regulatory Compliance with the Data Management Ma...
PDF
Presentation for Catalyst workshop, with notes
Sense and Sensors - From Perception to Personality
R, Data Wrangling & Kaggle Data Science Competitions
Pandas, Data Wrangling & Data Science
Data-Ed Webinar: Best Practices with the DMM
Best Practices with the DMM
Data-Ed Webinar: Data-centric Strategy & Roadmap
How Ally Financial Achieved Regulatory Compliance with the Data Management Ma...
Presentation for Catalyst workshop, with notes

Viewers also liked (19)

PDF
University Talks #2 | Анастасия Чекрыжова — Свежий взгляд на современное иску...
PPTX
Somar com o outono I
PDF
University Talks #1 | Екатерина Мамонтова - Счастье не поддается инфляции
PPTX
Presentación sobre el hiv
PDF
Ημερολόγιο Δραστηριοτήτων Π.Ο. 2015-16
ODP
How to bake reactive behavior into your Java EE applications
PDF
Social metadata on the web
PPT
PDF
University Talks #2 | Елена Шилова — Свет и безопасность
PDF
Bozza della Legge di Bilancio 2017
PPTX
STORYTELLING
PPTX
Social Media Trends 2014
PPT
Becoming a Systematic Entrepreneur?
PDF
Keynote 4: Leadership and Education for Sustainable Development, Philip Vaughter
PPS
Meilleures photos national geo 2015
DOCX
Proyecto "Song for a change"
PDF
Media, Technology and Consumer Trends to Watch in 2016
PDF
Exakat for PHP : smart code reviewing engine
PDF
University Talks #1 | Никита Булгаков и Алексей Ушаков - Дзен и искусство пои...
University Talks #2 | Анастасия Чекрыжова — Свежий взгляд на современное иску...
Somar com o outono I
University Talks #1 | Екатерина Мамонтова - Счастье не поддается инфляции
Presentación sobre el hiv
Ημερολόγιο Δραστηριοτήτων Π.Ο. 2015-16
How to bake reactive behavior into your Java EE applications
Social metadata on the web
University Talks #2 | Елена Шилова — Свет и безопасность
Bozza della Legge di Bilancio 2017
STORYTELLING
Social Media Trends 2014
Becoming a Systematic Entrepreneur?
Keynote 4: Leadership and Education for Sustainable Development, Philip Vaughter
Meilleures photos national geo 2015
Proyecto "Song for a change"
Media, Technology and Consumer Trends to Watch in 2016
Exakat for PHP : smart code reviewing engine
University Talks #1 | Никита Булгаков и Алексей Ушаков - Дзен и искусство пои...
Ad

Similar to Smart Data Webinar: Emerging Data Management Options (20)

PDF
Smart Data Webinar: Advances in Natural Language Processing
PDF
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
PDF
Smart Data Webinar: Deep QA (Question/Answer) - Lessons From Watson and Jeopa...
PDF
When data journalism meets science | Erice, June 10th, 2014
PPTX
Spohrer GAMP 20230628 v17.pptx
PDF
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
PDF
Big, Open, Data and Semantics for Real-World Application Near You
PDF
Smart Data Webinar: Knowledge as a Service
PDF
UseR 2017
PDF
History of AI, Current Trends, Prospective Trajectories
PPTX
From Turing To Humanoid Robots - Ramón López de Mántaras
PDF
"Methods for Understanding How Deep Neural Networks Work," a Presentation fro...
PDF
Just the basics_strata_2013
PDF
人工智慧與物聯網的創新與服務模式
PDF
PLOTCON NYC: Get Your Point Across: The Art of Choosing the Right Visualizati...
PDF
Find Review About EssayWriters.Us DissertationHQHelp.Com
PPT
The Potential of Web 3.0
PDF
Developmental Psychology Theoretical Approaches Essay
PPTX
I want to know more about compuerized text analysis
PPTX
Ben Kunz Mediassociates speech to Boise Ad Federation
Smart Data Webinar: Advances in Natural Language Processing
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
Smart Data Webinar: Deep QA (Question/Answer) - Lessons From Watson and Jeopa...
When data journalism meets science | Erice, June 10th, 2014
Spohrer GAMP 20230628 v17.pptx
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
Big, Open, Data and Semantics for Real-World Application Near You
Smart Data Webinar: Knowledge as a Service
UseR 2017
History of AI, Current Trends, Prospective Trajectories
From Turing To Humanoid Robots - Ramón López de Mántaras
"Methods for Understanding How Deep Neural Networks Work," a Presentation fro...
Just the basics_strata_2013
人工智慧與物聯網的創新與服務模式
PLOTCON NYC: Get Your Point Across: The Art of Choosing the Right Visualizati...
Find Review About EssayWriters.Us DissertationHQHelp.Com
The Potential of Web 3.0
Developmental Psychology Theoretical Approaches Essay
I want to know more about compuerized text analysis
Ben Kunz Mediassociates speech to Boise Ad Federation
Ad

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
PDF
Data at the Speed of Business with Data Mastering and Governance
PDF
Exploring Levels of Data Literacy
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PDF
Make Data Work for You
PDF
Data Catalogs Are the Answer – What is the Question?
PDF
Data Catalogs Are the Answer – What Is the Question?
PDF
Data Modeling Fundamentals
PDF
Showing ROI for Your Analytic Project
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
PDF
Is Enterprise Data Literacy Possible?
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
PDF
Data Governance Trends - A Look Backwards and Forwards
PDF
Data Governance Trends and Best Practices To Implement Today
PDF
2023 Trends in Enterprise Analytics
PDF
Data Strategy Best Practices
PDF
Who Should Own Data Governance – IT or Business?
PDF
Data Management Best Practices
PDF
MLOps – Applying DevOps to Competitive Advantage
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Data at the Speed of Business with Data Mastering and Governance
Exploring Levels of Data Literacy
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Make Data Work for You
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What Is the Question?
Data Modeling Fundamentals
Showing ROI for Your Analytic Project
How a Semantic Layer Makes Data Mesh Work at Scale
Is Enterprise Data Literacy Possible?
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends and Best Practices To Implement Today
2023 Trends in Enterprise Analytics
Data Strategy Best Practices
Who Should Own Data Governance – IT or Business?
Data Management Best Practices
MLOps – Applying DevOps to Competitive Advantage

Recently uploaded (20)

PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Chapter 5: Probability Theory and Statistics
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPT
What is a Computer? Input Devices /output devices
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Modernising the Digital Integration Hub
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Getting Started with Data Integration: FME Form 101
PDF
Hybrid model detection and classification of lung cancer
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
OMC Textile Division Presentation 2021.pptx
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Chapter 5: Probability Theory and Statistics
gpt5_lecture_notes_comprehensive_20250812015547.pdf
WOOl fibre morphology and structure.pdf for textiles
1 - Historical Antecedents, Social Consideration.pdf
What is a Computer? Input Devices /output devices
Programs and apps: productivity, graphics, security and other tools
Modernising the Digital Integration Hub
A comparative study of natural language inference in Swahili using monolingua...
Enhancing emotion recognition model for a student engagement use case through...
DP Operators-handbook-extract for the Mautical Institute
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Getting Started with Data Integration: FME Form 101
Hybrid model detection and classification of lung cancer
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Developing a website for English-speaking practice to English as a foreign la...
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game

Smart Data Webinar: Emerging Data Management Options

  • 1. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Emerging Data Management Options Adrian Bowles, PhD Founder, STORM Insights, Inc. info@storminsights.com
  • 2. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Basic Life Advice “When the map and the terrain disagree, believe the terrain.” Gause and Weinberg (Exploring Requirements) It is the pervading law of all things organic, and inorganic, of all things physical and metaphysical, of all things human and all things superhuman, of all true manifestations of the head, of the heart, of the soul, that the life is recognizable in its expression, that form ever follows function. That is the law. Louis Sullivan: The Tall Office Building Artistically Considered, 1896
  • 3. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. How You Think About a Domain… …influences your choice of maps and models… rules and representations…and required operations.
  • 4. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. ” To solve really hard problems, we'll have to use several different representations. This is because each particular kind of data structure has its own virtues and deficiencies, and none by itself would seem adequate for all the different functions involved with what we call common sense.” Marvin Minsky
  • 5. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
  • 6. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. What Do You Want/Need to Store? How much? How complex? How fast? What Do You Want/Need to DO With What You Store? Do you need a graph database? Options Include… Files, tables, trees, queues, stacks, lists… Hierarchical RDBMS Object DBMS NoSQL Graph
  • 7. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Perception/ NLP Problem Solving & Learning Simple: deterministic, retrieve/calculate Complex: probabalistic hypothesize, test, rank, select Creative: discover, generate ORGANIZED Memory* Input Class/Type Visual Text Image Aural Speech Music Cues Noise Informative Touch Temperature Tactile Texture Taste Smell Response Types Visible (to the environment) Verbal/NL Text Behavioral (system changes) Haptics/Touch/Proprioception Invisible Memory updates *Corpus including data in taxonomies, ontologies, trees…
  • 8. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Graphs 101 A graph is a structure with vertices and edges.
  • 9. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Graphs 101 A graph is a structure with vertices and edges. a e dc b Old Post Road Cross Highway Main Street Shinbone Alley Elk Road
  • 10. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Graphs 101 A graph is a structure with vertices and edges. a e dc b Old Post Road Cross Highway Main Street Shinbone Alley Elk Road Old Post Road Paved Old Post Road 11 miles Elk Road Dirt Elk Road 2 miles Cross Highway toll road Cross Highway 250 miles Main Street 1 mile Shinbone Alley .5 miles a bus stop b gas station b Shell c Elementary school d House e Office building May be labeled, edges may be directed, all may be stored/processed by properties represented as key/value pairs.
  • 11. Copyright (c) 2014-2016 by STORM Insights Inc. All Rights reserved. Obvious structure is easy to process… but most of the interesting stuff isn’t obvious to a computer. Vertices, edges, properties should represent data with higher- level structure.
  • 12. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. You Probably Already Think In Graphs if… You watch detective shows You know trivia about movies You remember relationships between people You took a biology class
  • 13. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. You Probably Already Think In Graphs if… You took a biology class or played 20 questions (“animal, mineral or vegetable?”) Wikipedia contributors. "Taxonomy (biology)." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016.
  • 14. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. You Probably Already Think In Graphs if… You watch detective shows Typical crazy wall whiteboard - from Fargo. A screen from IBM I2 Coplink
  • 15. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. You Probably Already Think In Graphs if… You know trivia about movies IMDB
  • 16. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. You Probably Already Think In Graphs if… You remember relationships between people Family Tree LinkedIn Tree
  • 17. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Anonymized look at my desk/wall on a typical day.
  • 18. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Processes Can Be Represented As Graphs
  • 19. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. A taxonomy represents the formal structure of classes or types of objects within a domain. Taxonomies are generally hierarchical and provide names for each class in the domain. They may also capture the membership properties of each object in relation to the other objects. The rules of a specific taxonomy are used to classify or categorize any object in the domain, so they must be complete, consistent, and unambiguous. This rigor in specification should ensure that any newly discovered object must fit into one, and only one, category or object class.
  • 20. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. 1952 DSM I 1968 DSM II Pervasive Developmental Disorder (PDD) Childhood onset PDD Infantile Autism Atypical Autism 1980 DSM III Taxonomies Evolve The History of Autism in the Diagnostic & Statistical Manual of the American Psychiatric Association Pervasive Developmental Disorder (PDD) PDD-NOS Autistic Disorder (Not Otherwise Specified) 1987 DSM III-R Pervasive Developmental Disorder (PDD) PDD-NOS Autistic Disorder Asperger Disorder Childhood Disintegrative Disorder Rett Syndrome 1994 DSM IV 2000 DSM IV-TR Autism Spectrum Disorder (ASD) 2013 DSMV
  • 21. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. 1952 DSM I 1968 DSM II Pervasive Developmental Disorder (PDD) Childhood onset PDD Infantile Autism Atypical Autism 1980 DSM III Pervasive Developmental Disorder (PDD) PDD-NOS Autistic Disorder (Not Otherwise Specified) 1987 DSM III-R Pervasive Developmental Disorder (PDD) PDD-NOS Autistic Disorder Asperger Disorder Childhood Disintegrative Disorder Rett Syndrome 1994 DSM IV 2000 DSM IV-TR Autism Spectrum Disorder (ASD) 2013 DSMV An ontology provides more detail than a taxonomy, although the boundary between them in practice is somewhat fuzzy. An ontology should comprehensively capture the common understanding – vocabulary, definitions, rules - of a community as it applies to a specific domain.
  • 22. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Key Concept… Graphs have well known mathematical properties: e.g. If you represent a graph as a matrix M, then values in Mn represent the number of paths of length n in the original graph. a e dc b a b c d e a 1 b 1 c 1 d 1 e 1 M =
  • 23. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. a e dc b a b c d e a 1 b 1 c 1 d 1 e 1 M2 = Key Concept… Graphs have well known mathematical properties: e.g. If you represent a graph as a matrix M, then values in Mn represent the number of paths of length n in the original graph.
  • 24. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. a e dc b a b c d e a 1 b 1 c 1 d 1 e 1 M3 = Key Concept… Graphs have well known mathematical properties: e.g. If you represent a graph as a matrix M, then values in Mn represent the number of paths of length n in the original graph.
  • 25. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. The Market is Ready for You Now With Options Commercial Open Source As a Service
  • 26. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016.
  • 27. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016. Property graph RDF RDF - Resource Description Framework, W3C specs for metadata modeling, now used in knowledge management
  • 28. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. No SQL? (as opposed to NoSQL) No problem Gremlin - Open source (Apache2 license) - a graph traversal language, supported by Titan, Neo4j, HadoopGiraph, Hadoop Spark, IBM… Cypher - Neo4j, Objectivity… Emerging graph query/traversal languages SPARQL - Open source (SPARQL Protocol and RDF Query Language)
  • 29. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016. This chart is representative of the market, but incomplete.
  • 30. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Apache TinkerPop, TinkerPop, Apache, Apache feather logo, and Apache TinkerPop project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries. Apache TinkerPop™ is a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP). “A graph is a structure composed of vertices and edges. Both vertices and edges can have an arbitrary number of key/value-pairs called properties. Vertices denote discrete objects such as a person, a place, or an event. Edges denote relationships between vertices. For instance, a person may know another person, have been involved in an event, and/or was recently at a particular place. Properties express non-relational information about the vertices and edges. Example properties include a vertex having a name, an age and an edge having a timestamp and/or a weight. Together, the aforementioned graph is known as a property graph and it is the foundational data structure of Apache TinkerPop.” Apache TinkerPop™ is an open source, vendor-agnostic, graph computing framework distributed under the commercial friendly Apache2 license. When a data system is TinkerPop-enabled, its users are able to model their domain as a graph and analyze that graph using the Gremlin graph traversal language.
  • 31. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved.
  • 32. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Getting Started… Why choose a graph database? Speed to delivery when the data is naturally modeled as a graph Simplifies multi-hop queries Visualization? Baked-in Ask Yourself Do you need an on-premise solution, or to manage your own database? Lots of options, Neo4J is the market leader Do you want graphs as a service? IBM offering graph as a service through BlueMix (in Beta now)
  • 33. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Upcoming Webinar Dates & Topics June 9 Advances in Natural Language Processing (NLP) 
 July 13 Modern AI and The Future of Work (With Steve Ardire) 
 adrian@storminsights.com Twitter @ajbowles Skype ajbowles A hat-tip to Kamille Nixon…
  • 34. Copyright (c) 2016 by STORM Insights Inc. All Rights reserved. Upcoming Webinar Dates & Topics June 9 Advances in Natural Language Processing (NLP) 
 July 13 Modern AI and The Future of Work (With Steve Ardire) 
 adrian@storminsights.com Twitter @ajbowles Skype ajbowles