SlideShare a Scribd company logo
Selecting the right database for
your semantic storage needs
Taxonomy Boot Camp London 2017
Jim Sweeney, Senior Product Manager
Taxonomy and Ontology Solutions
Database Types
There are a number of different
database types available. We will look
at three key designs as they relate to
storing semantic information.
• Relational Database Management
Systems (RDBMS)
• Property Graph Databases
• RDF Databases (a specialized form
of Graph Database)
A Brief History
• The relational database did not appear until the late
1970’s replacing Navigational database designs of the
previous decade.
• The first commercial versions started to appear in the
early 1980’s with IBM’s DB2 and Oracle’s products
leading the way.
• Microsoft released their SQL Server platform in 1989.
• These three providers continue to be among the leaders
as far as market share across all database types.
A Brief History (Continued)
• Graph database systems begin
to appear in the late 1990’s,
coinciding with the adoption of
RDF as a W3C recommendation.
• Leading Property Graph and
RDF database providers such as
Neo4J, GraphDB, and Jena (to
name just a few) continue to
grow in market share but still
represent just a fraction of the
total market share.
Rank DBMS
Database
Model Score
Oct Oct Oct Sep Oct
2017 2016 2017 2017 2016
1 1 Oracle Relational DBMS 1348.8 -10.29 -68.3
2 2 MySQL Relational DBMS 1298.83 -13.78 -63.82
3 3 Microsoft SQL Server Relational DBMS 1210.32 -2.23 -3.86
4 5 PostgreSQL Relational DBMS 373.27 0.91 54.58
5 4 MongoDB Document store 329.4 -3.33 10.6
6 6 DB2 Relational DBMS 194.59 -3.75 14.03
7 8 Microsoft Access Relational DBMS 129.45 0.64 4.78
8 7 Cassandra Wide column store 124.79 -1.41 -10.27
9 9 Redis Key-value store 122.05 1.65 12.51
10 11 Elasticsearch Search engine 120.23 0.23 21.12
21 21 Neo4j Graph DBMS 37.95 -0.48 1.5
89 86 Jena RDF store 2.31 0.04 0.5
Relational Databases
Management Systems (RDBMS)
• RDBMS are designed using multiple
tables to store data.
• Each table is made up of headers,
rows, and columns, much like a
spreadsheet.
• Each row contains different
elements, including a primary key
and zero to many foreign keys.
• These keys are used to link
information in one table to another
table, which will contain distinct but
related information. Image from neo4j.com
RDBMS Design
• The use of a key in each
table means that there is
no “relationship” as we
would find in natural
language or a graph
structure.
• Table values themselves
are what link the data
together by creating a
join between data
elements using keys. Image from neo4j.com
Pros and Cons of
RDBMS
Pros:
• RDBMS are tried and tested over many
decades, and generally have more
features available than newer
platforms.
• RDBMS are very efficient at storing
indexable information that is described
well in “tabular” structures, such as
accounting or inventory data.
• Data is often indexed at the time of
entry, making keyword searches and
those with limited table joins relatively
fast.
Pros and Cons of
RDBMS
Cons:
• RDBMS require a foreknowledge of all
the data, and consequently data tables,
that you will be storing. (Graph
structures provide more ad hoc design
flexibility.)
• RDBMS don’t perform as well in pattern
matching searches as the number of
relationships to be traversed increases.
• SQL language, while widely known, is
slightly more complex than those used
with Property Graph or RDF systems.
RDBMS
Deconstructed
• As we have seen,
Relational Databases use
tables and key values to
link information
together.
• Let’s focus on just the
highlighted components
to compare this design
with that of a Graph
database.
Image from neo4j.com
The Graph
Model
• With a graph database, such as
Neo4j, each node is a standalone
item that describes a fully-formed
member of the database.
• To link the nodes together, one uses
relationships. These are referred to
as edges.
• The nodes and edges form uniform
structures which are repeated to
relate all of the data elements,
completely describing all necessary
aspects of the information in the
database. Image from neo4j.com
The Graph Model
(continued)
• If we look at our example with
Alice, we see that she belongs to
several departments. The nature of
the link from the person, Alice, to
the Departments is described by
the relationship itself, in this case
“:BELONGS_TO”.
• The node-edge-node combination
that is formed has Alice belonging
to the department “4FUTURE”.
• No longer are we linking items via
table joins. We are now describing
the links is a more human
understandable format.
Image from neo4j.com
RDF – Standardizing the Triple
• RDF or Resource Description
Framework provides standards that
we use to uniformly describe the
relationships between nodes, as well
as describe other attributes that we
want to assign to our data as
metadata.
• The standard establishes a subject-
predicate-object unit called a triple.
• One such RDF standard, FOAF
(Friend of a Friend), is a collection of
relationships and attributes
describing people, their relationships
to one another, and their activities.
• In this example, we see Alice is not
just identified as a member of
departments, but also as a family
member based near a location, and
with a descriptive home page.
RDF – Standardizing the Triple
• To extend the previous
example, Denver might be
linked to the state of Colorado,
in the United States, in North
America using RDF OWL.
• Using Graph database design,
and employing RDF descriptive
language collections, one can
create links between entities
that may virtually go on and on.
• What we create becomes a
dynamic, three dimensional
database model capable of
establishing true semantic
Ontologies.
Linked Data and
the Semantic Web
• In 1991 Tim Berners-Lee described what he
called the World Wide Web, which was
meant to, “allow links to be made to any
information anywhere.”1
• In the 2010’s we are seeing the concept of
Linked Data realizing this goal, utilizing RDF
standards and other technologies such as
SPARQL (Protocol and RDF Query Language)
and JSON (JavaScript Object Notation).
• With Linked Data protocols, one may send
queries across the web to retrieve specific
information from any source published in this
format.
• While this is possible using RDBMS
technologies as well, Graph databases speak
the native language of Linked Data and start
to introduce some of the most exciting
potential of the World Wide Web.
Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar,
Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/
1 https://www.w3.org/People/Berners-Lee/1991/08/art-6484.txt
Pros and Cons of Graph
and RDF Databases
Pros:
• Graph databases are very well
suited to storing information that
contains many relationships and
data points, leading to fast
performance for pattern based
searches.
• SPARQL, Cypher, and other Graph
query languages are generally
simple and powerful.
• RDF data stores can connect easily
and natively to Linked Data sources.
Pros and Cons of Graph
and RDF Databases
Cons:
• Because they are not indexed as
RDBMS are, they are not as effective
in handing very large numbers of
transactions or queries that need to
capture linked information across an
entire database at once.
• Newer technology, less tested, not
as many providers, and not as well
supported, yet.
• May not integrate well with legacy
systems that will be consuming
semantic information.
Conclusions
• RDMBS offer a tried and true technology that is more
than capable of providing the database layer for
most semantic technology applications and services.
• Their wide adoption in nearly all data management
environments make them easy to support and
integrate with external systems.
• When properly indexed and designed, they are very
good at handling many, simultaneous transactions.
• And, they are able to handle some advanced
functionality enabled by RDF and Linked Data with a
little design expertise.
Conclusions
• Graph and RDF databases are newer to the market and have
yet to come close to the saturation levels that RDBMS enjoy,
but they offer a design that is intrinsically effective at
managing semantic information and the technologies around
RDF and Linked Data.
• Graph and RDF databases don’t require foreknowledge or pre-
definition necessary for RDMBS design, making them able to
evolve rapidly and organically.
• Because Graph and RDF databases use basic, semantic
structure as a key design characteristic they are very good at
answering questions posed by real-life queries that we find in
everyday language.
Resources
• Explanation of Graph Structure: http://www.zdnet.com/pictures/a-
graph-is-a-graph-is-graph-rdf-vs-lpg/
• Data Modeling with Graph Databases: https://youtu.be/r4IGWNovzes
• Designing and Building Semantic Applications with Linked Data:
https://youtu.be/ToALLwM7KIw
Thank You!
Jim Sweeney, Senior Product Manager
jim.sweeney@synaptica.com
www.synaptica.com
Twitter: @synaptica

More Related Content

PPTX
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​
PDF
the rising no sql technology
PPTX
Improvement of no sql technology for relational databases v2
ODP
PDF
Jarrar: Data Schema Integration
PPTX
Interaction with Linked Data
PPTX
Big Linked Data - Creating Training Curricula
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​
the rising no sql technology
Improvement of no sql technology for relational databases v2
Jarrar: Data Schema Integration
Interaction with Linked Data
Big Linked Data - Creating Training Curricula

What's hot (20)

PPT
NoSQL Basics - a quick tour
PDF
Wed roman tut_open_datapub
PPTX
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
PDF
Link Sets And Why They Are Important (EDF2012)
PPTX
NoSQL Basics and MongDB
PPTX
Providing Linked Data
PPTX
Big data analytics: Technology's bleeding edge
PPTX
introduction to NOSQL Database
PDF
What flavor of linked data is best for your collection?
PPTX
Scaling up Linked Data
PPT
The Power of Semantic Technologies to Explore Linked Open Data
PPT
Graph db
PDF
Type of Database Management System
PDF
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
PPTX
Semantic Web (Web 3.0)
PDF
Linked Data Notifications Distributed Update Notification and Propagation on ...
PDF
What_do_Knowledge_Graph_Embeddings_Learn.pdf
PPTX
Presentation1
PDF
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
PDF
Oslo baksia2014
NoSQL Basics - a quick tour
Wed roman tut_open_datapub
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
Link Sets And Why They Are Important (EDF2012)
NoSQL Basics and MongDB
Providing Linked Data
Big data analytics: Technology's bleeding edge
introduction to NOSQL Database
What flavor of linked data is best for your collection?
Scaling up Linked Data
The Power of Semantic Technologies to Explore Linked Open Data
Graph db
Type of Database Management System
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Semantic Web (Web 3.0)
Linked Data Notifications Distributed Update Notification and Propagation on ...
What_do_Knowledge_Graph_Embeddings_Learn.pdf
Presentation1
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
Oslo baksia2014
Ad

Similar to Selecting the right database type for your knowledge management needs. (20)

PPTX
NOSQL Databases types and Uses
PPT
NoSQL Databases
PDF
NOSQL- Presentation on NoSQL
PDF
NOSQL in big data is the not only structure langua.pdf
PPTX
Modern database
PPTX
UNIT-2.pptx
PPTX
Big Data Analytics (Collection of Huge Data 3)
PPTX
No SQL- The Future Of Data Storage
DOCX
data base system to new data science lerne
PDF
Presentation On NoSQL Databases
PPTX
NoSQL and MongoDB
PDF
No sql – rise of the clusters
PPTX
Unit-10.pptx
PPTX
Choosing your NoSQL storage
PPTX
DSA _ASSIGNMENT_1(MONGODB VS RDBMS).pptx
PPTX
PPTX
kfddnloiujhfsgklllmnbfhigldktktktkykydlhjjclj
DOCX
Unit II -BIG DATA ANALYTICS.docx
PPTX
nosqldatabnjxjdjases-240121150542-d4ec9e23.pptx
DOCX
Report 2.0.docx
NOSQL Databases types and Uses
NoSQL Databases
NOSQL- Presentation on NoSQL
NOSQL in big data is the not only structure langua.pdf
Modern database
UNIT-2.pptx
Big Data Analytics (Collection of Huge Data 3)
No SQL- The Future Of Data Storage
data base system to new data science lerne
Presentation On NoSQL Databases
NoSQL and MongoDB
No sql – rise of the clusters
Unit-10.pptx
Choosing your NoSQL storage
DSA _ASSIGNMENT_1(MONGODB VS RDBMS).pptx
kfddnloiujhfsgklllmnbfhigldktktktkykydlhjjclj
Unit II -BIG DATA ANALYTICS.docx
nosqldatabnjxjdjases-240121150542-d4ec9e23.pptx
Report 2.0.docx
Ad

More from Synaptica, LLC (10)

PPTX
Using ontologies for more than information categorization
PPTX
Text Analytics for Non-Experts
PPTX
Linked data 20171106
PPTX
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
PPTX
SKOS-XL vs. Traditional Term Based Taxonomy Management
PPTX
Successfully Managing Multilingual Taxonomies: 3 Methods
PPTX
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
PPTX
ProQuest Taxonomy Boot Camp Presentation 2008
PPT
Re-engineering Taxonomy Warehouse as an Ontology
PPT
Synaptica Proquest Talk Taxonomy Boot Camp 2009
Using ontologies for more than information categorization
Text Analytics for Non-Experts
Linked data 20171106
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
SKOS-XL vs. Traditional Term Based Taxonomy Management
Successfully Managing Multilingual Taxonomies: 3 Methods
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
ProQuest Taxonomy Boot Camp Presentation 2008
Re-engineering Taxonomy Warehouse as an Ontology
Synaptica Proquest Talk Taxonomy Boot Camp 2009

Recently uploaded (20)

PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PPTX
Introduction to Artificial Intelligence
PDF
AI in Product Development-omnex systems
PDF
medical staffing services at VALiNTRY
PPTX
Essential Infomation Tech presentation.pptx
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Upgrade and Innovation Strategies for SAP ERP Customers
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
VVF-Customer-Presentation2025-Ver1.9.pptx
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Navsoft: AI-Powered Business Solutions & Custom Software Development
PTS Company Brochure 2025 (1).pdf.......
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Softaken Excel to vCard Converter Software.pdf
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Design an Analysis of Algorithms II-SECS-1021-03
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Introduction to Artificial Intelligence
AI in Product Development-omnex systems
medical staffing services at VALiNTRY
Essential Infomation Tech presentation.pptx
Internet Downloader Manager (IDM) Crack 6.42 Build 41

Selecting the right database type for your knowledge management needs.

  • 1. Selecting the right database for your semantic storage needs Taxonomy Boot Camp London 2017 Jim Sweeney, Senior Product Manager Taxonomy and Ontology Solutions
  • 2. Database Types There are a number of different database types available. We will look at three key designs as they relate to storing semantic information. • Relational Database Management Systems (RDBMS) • Property Graph Databases • RDF Databases (a specialized form of Graph Database)
  • 3. A Brief History • The relational database did not appear until the late 1970’s replacing Navigational database designs of the previous decade. • The first commercial versions started to appear in the early 1980’s with IBM’s DB2 and Oracle’s products leading the way. • Microsoft released their SQL Server platform in 1989. • These three providers continue to be among the leaders as far as market share across all database types.
  • 4. A Brief History (Continued) • Graph database systems begin to appear in the late 1990’s, coinciding with the adoption of RDF as a W3C recommendation. • Leading Property Graph and RDF database providers such as Neo4J, GraphDB, and Jena (to name just a few) continue to grow in market share but still represent just a fraction of the total market share. Rank DBMS Database Model Score Oct Oct Oct Sep Oct 2017 2016 2017 2017 2016 1 1 Oracle Relational DBMS 1348.8 -10.29 -68.3 2 2 MySQL Relational DBMS 1298.83 -13.78 -63.82 3 3 Microsoft SQL Server Relational DBMS 1210.32 -2.23 -3.86 4 5 PostgreSQL Relational DBMS 373.27 0.91 54.58 5 4 MongoDB Document store 329.4 -3.33 10.6 6 6 DB2 Relational DBMS 194.59 -3.75 14.03 7 8 Microsoft Access Relational DBMS 129.45 0.64 4.78 8 7 Cassandra Wide column store 124.79 -1.41 -10.27 9 9 Redis Key-value store 122.05 1.65 12.51 10 11 Elasticsearch Search engine 120.23 0.23 21.12 21 21 Neo4j Graph DBMS 37.95 -0.48 1.5 89 86 Jena RDF store 2.31 0.04 0.5
  • 5. Relational Databases Management Systems (RDBMS) • RDBMS are designed using multiple tables to store data. • Each table is made up of headers, rows, and columns, much like a spreadsheet. • Each row contains different elements, including a primary key and zero to many foreign keys. • These keys are used to link information in one table to another table, which will contain distinct but related information. Image from neo4j.com
  • 6. RDBMS Design • The use of a key in each table means that there is no “relationship” as we would find in natural language or a graph structure. • Table values themselves are what link the data together by creating a join between data elements using keys. Image from neo4j.com
  • 7. Pros and Cons of RDBMS Pros: • RDBMS are tried and tested over many decades, and generally have more features available than newer platforms. • RDBMS are very efficient at storing indexable information that is described well in “tabular” structures, such as accounting or inventory data. • Data is often indexed at the time of entry, making keyword searches and those with limited table joins relatively fast.
  • 8. Pros and Cons of RDBMS Cons: • RDBMS require a foreknowledge of all the data, and consequently data tables, that you will be storing. (Graph structures provide more ad hoc design flexibility.) • RDBMS don’t perform as well in pattern matching searches as the number of relationships to be traversed increases. • SQL language, while widely known, is slightly more complex than those used with Property Graph or RDF systems.
  • 9. RDBMS Deconstructed • As we have seen, Relational Databases use tables and key values to link information together. • Let’s focus on just the highlighted components to compare this design with that of a Graph database. Image from neo4j.com
  • 10. The Graph Model • With a graph database, such as Neo4j, each node is a standalone item that describes a fully-formed member of the database. • To link the nodes together, one uses relationships. These are referred to as edges. • The nodes and edges form uniform structures which are repeated to relate all of the data elements, completely describing all necessary aspects of the information in the database. Image from neo4j.com
  • 11. The Graph Model (continued) • If we look at our example with Alice, we see that she belongs to several departments. The nature of the link from the person, Alice, to the Departments is described by the relationship itself, in this case “:BELONGS_TO”. • The node-edge-node combination that is formed has Alice belonging to the department “4FUTURE”. • No longer are we linking items via table joins. We are now describing the links is a more human understandable format. Image from neo4j.com
  • 12. RDF – Standardizing the Triple • RDF or Resource Description Framework provides standards that we use to uniformly describe the relationships between nodes, as well as describe other attributes that we want to assign to our data as metadata. • The standard establishes a subject- predicate-object unit called a triple. • One such RDF standard, FOAF (Friend of a Friend), is a collection of relationships and attributes describing people, their relationships to one another, and their activities. • In this example, we see Alice is not just identified as a member of departments, but also as a family member based near a location, and with a descriptive home page.
  • 13. RDF – Standardizing the Triple • To extend the previous example, Denver might be linked to the state of Colorado, in the United States, in North America using RDF OWL. • Using Graph database design, and employing RDF descriptive language collections, one can create links between entities that may virtually go on and on. • What we create becomes a dynamic, three dimensional database model capable of establishing true semantic Ontologies.
  • 14. Linked Data and the Semantic Web • In 1991 Tim Berners-Lee described what he called the World Wide Web, which was meant to, “allow links to be made to any information anywhere.”1 • In the 2010’s we are seeing the concept of Linked Data realizing this goal, utilizing RDF standards and other technologies such as SPARQL (Protocol and RDF Query Language) and JSON (JavaScript Object Notation). • With Linked Data protocols, one may send queries across the web to retrieve specific information from any source published in this format. • While this is possible using RDBMS technologies as well, Graph databases speak the native language of Linked Data and start to introduce some of the most exciting potential of the World Wide Web. Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/ 1 https://www.w3.org/People/Berners-Lee/1991/08/art-6484.txt
  • 15. Pros and Cons of Graph and RDF Databases Pros: • Graph databases are very well suited to storing information that contains many relationships and data points, leading to fast performance for pattern based searches. • SPARQL, Cypher, and other Graph query languages are generally simple and powerful. • RDF data stores can connect easily and natively to Linked Data sources.
  • 16. Pros and Cons of Graph and RDF Databases Cons: • Because they are not indexed as RDBMS are, they are not as effective in handing very large numbers of transactions or queries that need to capture linked information across an entire database at once. • Newer technology, less tested, not as many providers, and not as well supported, yet. • May not integrate well with legacy systems that will be consuming semantic information.
  • 17. Conclusions • RDMBS offer a tried and true technology that is more than capable of providing the database layer for most semantic technology applications and services. • Their wide adoption in nearly all data management environments make them easy to support and integrate with external systems. • When properly indexed and designed, they are very good at handling many, simultaneous transactions. • And, they are able to handle some advanced functionality enabled by RDF and Linked Data with a little design expertise.
  • 18. Conclusions • Graph and RDF databases are newer to the market and have yet to come close to the saturation levels that RDBMS enjoy, but they offer a design that is intrinsically effective at managing semantic information and the technologies around RDF and Linked Data. • Graph and RDF databases don’t require foreknowledge or pre- definition necessary for RDMBS design, making them able to evolve rapidly and organically. • Because Graph and RDF databases use basic, semantic structure as a key design characteristic they are very good at answering questions posed by real-life queries that we find in everyday language.
  • 19. Resources • Explanation of Graph Structure: http://www.zdnet.com/pictures/a- graph-is-a-graph-is-graph-rdf-vs-lpg/ • Data Modeling with Graph Databases: https://youtu.be/r4IGWNovzes • Designing and Building Semantic Applications with Linked Data: https://youtu.be/ToALLwM7KIw
  • 20. Thank You! Jim Sweeney, Senior Product Manager jim.sweeney@synaptica.com www.synaptica.com Twitter: @synaptica