SlideShare a Scribd company logo
Database Awareness
09 Mar 2021
@irensaltali
Who am I?
İren Saltalı
.NET Consultant @kloia
Blog : irensaltali.medium.com
Tweet : @irensaltali
LinkedIn : /in/irensaltali
GitHub : github.com/irensaltali
2
Agenda
• Why we need Database Awareness?
• Document vs Relational
• Row-based vs Column-based
• In-memory Database vs In-memory Data grids
• Graph
• Time-series
• Solr vs ElasticSearch
• Event Store
Why we need Database Awareness?
Databases directly affect our system performance, scalability, durability, consistency,
cost, and even how we code. We need to choose the database that meets our demands best. To
do that, we have to know two main topics.
• How database works (Database Awareness)
• How our system works (System Awareness)
4
Document
Unstructured
Frequent updates to the data structure
Application-level joins
Horizontal scaling
Document based data modeling
MongoDB, Apache CouchDB, Couchbase
Table
Schema
No/less updates to the data structure
Server-level joins
Vertical scaling
Relational data modeling
MSSQL, MySQL, PostgreSQL
vs
5
Document vs Relational
6
Document vs Relational – Use Cases
Document
• Content management
• Logging
• Storing third party system’s data
• Web analytics
RDMS
• Banking/Finance
• Booking
• ERP
7
Row-based vs Column-based
Name City Age
İren Ankara 34
Seren Yalova 31
Bilgehan İstanbul 25
İren Ankara 34 Seren Yalova 31 Bilgehan İstanbul 25
İren Seren Bilgehan Anlara Yalova İstanbul 34 31 25
Row-based
Column-based
Row-based vs Column-based - Write
İren Ankara 34 Seren Yalova 31 Bilgehan İstanbul 25 Doğa Ankara 2
İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2
Row-based
Column-based
Doğa Ankara 2
New data
Row-based vs Column-based - Read
İren Ankara 34 Seren Yalova 31 Bilgehan İstanbul 25 Doğa Ankara 2
İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2
Row-based
Column-based
Select * İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2
Select Sum(Age) İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2
10
IMDB vs IMDG
SQL support
No MPP
Replace RDBMS
Network Latency
Redis
No/less SQL support
Massively Parallel Processing
Can’t replace RDBMS
On same server with application
Hazelcast
vs
Graph Databases
A graph is composed of two elements: a node and a relationship.
• Nodes represent entities.
• Edges (graphs, relationships), are the lines that connect nodes
to other nodes.
• Edges can be directed or undirected.
• Edges can store properties represented by key/value pairs.
• High performance on graph-like queries.
Some graph databases: Amazon Neptune, Neo4j, OrientDB
11
image from https://aws.amazon.com/nosql/graph/
Times Series Database (TSDB)
A time series database (TSDB) is a database optimized for time-
stamped or time series data.
• Built specifically for handling metrics and events or
measurements that are time-stamped.
• Discrete characteristics from its continuous values.
• Best for server metrics, application performance monitoring,
network data, sensor data, events, clicks, trades in a market.
Some times series databases: Prometheus, Graphite, InfluxDB, Amazon Timestream
12
Solr vs ElasticSearch
13
XML, CSV, JSON, DB, Word, Pdf
DBC, CSV, XML, Tika, URL, Flat File
REST, Schemaless
Lucene Query
Span queries, Autocomplete, Faceting, Spatial/geo search
Visualisation: Banana (Port of Kibana)
Hard to manage scaling
JSON
ActiveMQ, Amazon SQS, CouchDB, DynamoDB, FileSystem, Git,
GitHub, Hazelcast, JDBC, JMS, Kafka, LDAP, MongoDB, neo4j,
OAI, RabbitMQ, Redis, RSS, Sofa, Solr, St9, Subversion, Twitter,
Wikipedia
Schemaless
Lucene Query, Query DSL
Span queries, Autocomplete, Faceting, Spatial/geo search
Visualisation: Kibana
Built for horizontal scaling
vs
Event Store
An event store databases optimized for storage of events.
• Event are not allowed to be changed.
• Optimized for writes
• Reproducibility
• Snapshots
Some event stores: IBM Db2 Event Store, EventStoreDB, NEventStore
14
Q & A
Thank you for listening.
blog.kloia.com @kloia_com
kloia.com
@irensaltali
RESOURCES MENTIONED IN
THIS SESSION WILL BE
AVAILABLE ON MY TWITTER
Sources
• https://www.digitalocean.com/community/tutorials/a-comparison-of-nosql-database-
management-systems-and-models
• https://www.flydata.com/blog/whats-unique-about-a-columnar-database/
• https://www.mongodb.com/blog/post/6-rules-of-thumb-for-mongodb-schema-design-part-1
• https://dataschool.com/data-modeling-101/row-vs-column-oriented-databases/
• https://www.youtube.com/watch?v=Vw1fCeD06YI
• https://en.wikipedia.org/wiki/Time_series_database
• https://www.influxdata.com/time-series-database/
• https://logz.io/blog/solr-vs-elasticsearch/
• https://solr-vs-elasticsearch.com/
• https://en.wikipedia.org/wiki/Event_store
• https://docs.microsoft.com/en-us/dotnet/architecture/cloud-native/relational-vs-nosql-data

More Related Content

PPTX
Eugene Polonichko "Azure Data Lake: what is it? why is it? where is it?"
PPTX
Schema-on-Read vs Schema-on-Write
PDF
What is an Open Data Lake? - Data Sheets | Whitepaper
PPTX
Azure data factory
PDF
Introduction to Azure Data Lake
PPTX
Azure Synapse Analytics Overview (r1)
PPTX
Data Lake Overview
PPTX
Anatomy of a data driven architecture - Tamir Dresher
Eugene Polonichko "Azure Data Lake: what is it? why is it? where is it?"
Schema-on-Read vs Schema-on-Write
What is an Open Data Lake? - Data Sheets | Whitepaper
Azure data factory
Introduction to Azure Data Lake
Azure Synapse Analytics Overview (r1)
Data Lake Overview
Anatomy of a data driven architecture - Tamir Dresher

What's hot (20)

PDF
The Warranty Data Lake – After, Inc.
PPTX
Azure Lowlands: An intro to Azure Data Lake
PPTX
Big Data with SQL Server
PPSX
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
PPTX
Hadoop and Hive in Enterprises
PPTX
Microsoft Data Platform - What's included
PPTX
Introduction to PolyBase
PPTX
Demystifying data engineering
PPTX
Power BI for Big Data and the New Look of Big Data Solutions
PPTX
SQL vs NoSQL
PDF
Dipping Your Toes: Azure Data Lake for DBAs
PPTX
Hadoop vs. RDBMS for Advanced Analytics
PPTX
Integrating hadoop - Big Data TechCon 2013
PPTX
Data lake-itweekend-sharif university-vahid amiry
PDF
Data Engineering Basics
PPTX
Big data architectures and the data lake
PPTX
Microsoft Azure Big Data Analytics
PPTX
Should I move my database to the cloud?
PPTX
Delta Lake with Azure Databricks
PDF
Modern data warehouse
The Warranty Data Lake – After, Inc.
Azure Lowlands: An intro to Azure Data Lake
Big Data with SQL Server
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
Hadoop and Hive in Enterprises
Microsoft Data Platform - What's included
Introduction to PolyBase
Demystifying data engineering
Power BI for Big Data and the New Look of Big Data Solutions
SQL vs NoSQL
Dipping Your Toes: Azure Data Lake for DBAs
Hadoop vs. RDBMS for Advanced Analytics
Integrating hadoop - Big Data TechCon 2013
Data lake-itweekend-sharif university-vahid amiry
Data Engineering Basics
Big data architectures and the data lake
Microsoft Azure Big Data Analytics
Should I move my database to the cloud?
Delta Lake with Azure Databricks
Modern data warehouse
Ad

Similar to Database awareness (20)

PPTX
Relational databases vs Non-relational databases
PDF
NOSQL- Presentation on NoSQL
PDF
NOSQL in big data is the not only structure langua.pdf
PPTX
Navigating NoSQL in cloudy skies
PPTX
Transform your DBMS to drive engagement innovation with Big Data
PPTX
PPTX
Nosql
DOCX
PDF
the rising no sql technology
PDF
NoSQL BIg Data Analytics Mongo DB and Cassandra .pdf
PPTX
NoSQLDatabases
PDF
Nosql Presentation.pdf for DBMS understanding
PPTX
No sql database
PPTX
NoSQL Architecture Overview
PPTX
Introduction to asdfghjkln b vfgh n v
PPTX
Presentation1
PPT
Graph Database and Neo4j
PDF
One Size Doesn't Fit All: The New Database Revolution
PPT
No sql
PPTX
MinneBar 2013 - Scaling with Cassandra
Relational databases vs Non-relational databases
NOSQL- Presentation on NoSQL
NOSQL in big data is the not only structure langua.pdf
Navigating NoSQL in cloudy skies
Transform your DBMS to drive engagement innovation with Big Data
Nosql
the rising no sql technology
NoSQL BIg Data Analytics Mongo DB and Cassandra .pdf
NoSQLDatabases
Nosql Presentation.pdf for DBMS understanding
No sql database
NoSQL Architecture Overview
Introduction to asdfghjkln b vfgh n v
Presentation1
Graph Database and Neo4j
One Size Doesn't Fit All: The New Database Revolution
No sql
MinneBar 2013 - Scaling with Cassandra
Ad

More from kloia (20)

PPTX
Converged Infrastructures on Kubernetes with Kubevirt
PPTX
Kloia AWS IBM Hashicorp Day Presentation
PPTX
AWS reInvent recap 2024 - Dorian/Derya SEZEN
PPTX
re:Invent recap - Application Modernization
PDF
Isovalent-kloia Cilium Workshop
PPTX
Kloia - Why Microsoft Modernisation Matters
PDF
DotNetKonf23 - NET Modernization Problems & Solutions.pdf
PPTX
AWS User Group Meetup Feb2023.pptx
PDF
re:Invent Recap
PPTX
The New era in QA: k6
PPTX
Etkili Blog Yazım Teknikleri - Tuğba Sertkaya
PPTX
AWS re:Invent 2021 Recap by APN Ambassador
PPTX
Camunda BPM - Said Mengi
PPTX
AlOps - Yetişkan Eliaçık
PPTX
Zaman Yönetimi - Aras Bilgen
PDF
Gravitee API Management - Ahmet AYDIN
PPTX
React Bootcamp Day 2 - Yunus Demirpolat
PPTX
React Bootcamp Day 1 - Yunus Demirpolat
PDF
Contract testing - Baran Gayretli
PDF
Contract Testing
Converged Infrastructures on Kubernetes with Kubevirt
Kloia AWS IBM Hashicorp Day Presentation
AWS reInvent recap 2024 - Dorian/Derya SEZEN
re:Invent recap - Application Modernization
Isovalent-kloia Cilium Workshop
Kloia - Why Microsoft Modernisation Matters
DotNetKonf23 - NET Modernization Problems & Solutions.pdf
AWS User Group Meetup Feb2023.pptx
re:Invent Recap
The New era in QA: k6
Etkili Blog Yazım Teknikleri - Tuğba Sertkaya
AWS re:Invent 2021 Recap by APN Ambassador
Camunda BPM - Said Mengi
AlOps - Yetişkan Eliaçık
Zaman Yönetimi - Aras Bilgen
Gravitee API Management - Ahmet AYDIN
React Bootcamp Day 2 - Yunus Demirpolat
React Bootcamp Day 1 - Yunus Demirpolat
Contract testing - Baran Gayretli
Contract Testing

Recently uploaded (20)

PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PPTX
ai tools demonstartion for schools and inter college
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Essential Infomation Tech presentation.pptx
PDF
System and Network Administraation Chapter 3
PPTX
Reimagine Home Health with the Power of Agentic AI​
PPTX
Odoo POS Development Services by CandidRoot Solutions
PPTX
Introduction to Artificial Intelligence
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
Understanding Forklifts - TECH EHS Solution
PDF
How Creative Agencies Leverage Project Management Software.pdf
Odoo Companies in India – Driving Business Transformation.pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Design an Analysis of Algorithms II-SECS-1021-03
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
ai tools demonstartion for schools and inter college
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PTS Company Brochure 2025 (1).pdf.......
Essential Infomation Tech presentation.pptx
System and Network Administraation Chapter 3
Reimagine Home Health with the Power of Agentic AI​
Odoo POS Development Services by CandidRoot Solutions
Introduction to Artificial Intelligence
CHAPTER 2 - PM Management and IT Context
Design an Analysis of Algorithms I-SECS-1021-03
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
How to Choose the Right IT Partner for Your Business in Malaysia
Understanding Forklifts - TECH EHS Solution
How Creative Agencies Leverage Project Management Software.pdf

Database awareness

  • 1. Database Awareness 09 Mar 2021 @irensaltali
  • 2. Who am I? İren Saltalı .NET Consultant @kloia Blog : irensaltali.medium.com Tweet : @irensaltali LinkedIn : /in/irensaltali GitHub : github.com/irensaltali 2
  • 3. Agenda • Why we need Database Awareness? • Document vs Relational • Row-based vs Column-based • In-memory Database vs In-memory Data grids • Graph • Time-series • Solr vs ElasticSearch • Event Store
  • 4. Why we need Database Awareness? Databases directly affect our system performance, scalability, durability, consistency, cost, and even how we code. We need to choose the database that meets our demands best. To do that, we have to know two main topics. • How database works (Database Awareness) • How our system works (System Awareness) 4
  • 5. Document Unstructured Frequent updates to the data structure Application-level joins Horizontal scaling Document based data modeling MongoDB, Apache CouchDB, Couchbase Table Schema No/less updates to the data structure Server-level joins Vertical scaling Relational data modeling MSSQL, MySQL, PostgreSQL vs 5 Document vs Relational
  • 6. 6 Document vs Relational – Use Cases Document • Content management • Logging • Storing third party system’s data • Web analytics RDMS • Banking/Finance • Booking • ERP
  • 7. 7 Row-based vs Column-based Name City Age İren Ankara 34 Seren Yalova 31 Bilgehan İstanbul 25 İren Ankara 34 Seren Yalova 31 Bilgehan İstanbul 25 İren Seren Bilgehan Anlara Yalova İstanbul 34 31 25 Row-based Column-based
  • 8. Row-based vs Column-based - Write İren Ankara 34 Seren Yalova 31 Bilgehan İstanbul 25 Doğa Ankara 2 İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2 Row-based Column-based Doğa Ankara 2 New data
  • 9. Row-based vs Column-based - Read İren Ankara 34 Seren Yalova 31 Bilgehan İstanbul 25 Doğa Ankara 2 İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2 Row-based Column-based Select * İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2 Select Sum(Age) İren Seren Bilgehan Doğa Ankara Yalova İstanbul Ankara 34 31 25 2
  • 10. 10 IMDB vs IMDG SQL support No MPP Replace RDBMS Network Latency Redis No/less SQL support Massively Parallel Processing Can’t replace RDBMS On same server with application Hazelcast vs
  • 11. Graph Databases A graph is composed of two elements: a node and a relationship. • Nodes represent entities. • Edges (graphs, relationships), are the lines that connect nodes to other nodes. • Edges can be directed or undirected. • Edges can store properties represented by key/value pairs. • High performance on graph-like queries. Some graph databases: Amazon Neptune, Neo4j, OrientDB 11 image from https://aws.amazon.com/nosql/graph/
  • 12. Times Series Database (TSDB) A time series database (TSDB) is a database optimized for time- stamped or time series data. • Built specifically for handling metrics and events or measurements that are time-stamped. • Discrete characteristics from its continuous values. • Best for server metrics, application performance monitoring, network data, sensor data, events, clicks, trades in a market. Some times series databases: Prometheus, Graphite, InfluxDB, Amazon Timestream 12
  • 13. Solr vs ElasticSearch 13 XML, CSV, JSON, DB, Word, Pdf DBC, CSV, XML, Tika, URL, Flat File REST, Schemaless Lucene Query Span queries, Autocomplete, Faceting, Spatial/geo search Visualisation: Banana (Port of Kibana) Hard to manage scaling JSON ActiveMQ, Amazon SQS, CouchDB, DynamoDB, FileSystem, Git, GitHub, Hazelcast, JDBC, JMS, Kafka, LDAP, MongoDB, neo4j, OAI, RabbitMQ, Redis, RSS, Sofa, Solr, St9, Subversion, Twitter, Wikipedia Schemaless Lucene Query, Query DSL Span queries, Autocomplete, Faceting, Spatial/geo search Visualisation: Kibana Built for horizontal scaling vs
  • 14. Event Store An event store databases optimized for storage of events. • Event are not allowed to be changed. • Optimized for writes • Reproducibility • Snapshots Some event stores: IBM Db2 Event Store, EventStoreDB, NEventStore 14
  • 15. Q & A Thank you for listening. blog.kloia.com @kloia_com kloia.com @irensaltali RESOURCES MENTIONED IN THIS SESSION WILL BE AVAILABLE ON MY TWITTER
  • 16. Sources • https://www.digitalocean.com/community/tutorials/a-comparison-of-nosql-database- management-systems-and-models • https://www.flydata.com/blog/whats-unique-about-a-columnar-database/ • https://www.mongodb.com/blog/post/6-rules-of-thumb-for-mongodb-schema-design-part-1 • https://dataschool.com/data-modeling-101/row-vs-column-oriented-databases/ • https://www.youtube.com/watch?v=Vw1fCeD06YI • https://en.wikipedia.org/wiki/Time_series_database • https://www.influxdata.com/time-series-database/ • https://logz.io/blog/solr-vs-elasticsearch/ • https://solr-vs-elasticsearch.com/ • https://en.wikipedia.org/wiki/Event_store • https://docs.microsoft.com/en-us/dotnet/architecture/cloud-native/relational-vs-nosql-data