SlideShare a Scribd company logo
NosqL
 SQL has Ruledfor two decades
 Store persistent data
 Application Integration
 Concurrency Control
 Mostly Standard
 Reporting
Whatis NoSql?
 don’t use the relational data model,
 tend to be designed to run on a cluster
 don’t have a fixed schema,
 allow tostore any data in any record
 seeks to solve the scalabilityandbig data performance issues
 5 Considerations When EvaluatingNoSQL
 Data Model
 Querymodel
 Consistencymodel
 API
 Commercialsupport
 Types of NoSQL databases
 Document: CouchDB, Couchbase, MongoDB
 Key-value:Riak,MemcacheDB, Redis
 Graph: Neo4J
 Column: Cassandra, HBase
Document Based
{ _id:”5466b7c22”,
company:{
_id:”5466b7c22”, company:
“google”, symbol :
“GOOG”, stock_price:
“858”, stock_volume:10M
}}
Nosql Introduction, Basics
Nosql Introduction, Basics
Nosql Introduction, Basics
 Query Model
 ThebiggestdifferencebetweenNoSQLsystemsliesintheability
toquerydata efficiently.
 Document databases provide the richest query
 Key-value stores providea singlemeansofaccessingdata:by
key.
 Consistency Model
 CONSISTENT SYSTEMS
 EVENTUALLY CONSISTENT SYSTEMS
 Reasons to use NoSQL databases
 Scale (horizontal)
 Simpler datamodel (less joins)
 Redundancy/Reliability
 Schema less (no modelling or prototyping)
 Rapid Development/coder friendly
 Flexibility
 semi-structured/ unstructured/Structured
 Cheaperthanrelational/ commodity
 Creatinga cachingLayer
 Environment data/logs
 Graphs/relationships
 Distributedstorage
 Realtime analysis
 Challenges against NoSQL
 ACID
 Maturity
 Analytics/ BI / Reporting
 Ecosystem/tools/addons
 search
 Security
 Data loss
 duplicatedata(Normalisation)
 Administration
 Expertiseavailability
 Relationalis not dead
 Thisleads us to a world of Polyglot Persistence
Nosql Introduction, Basics
 How Twitter UsesNoSQL
 is heavily dependent on MySQL
 users generate 12terrabytes of data a day - about four petabytes peryear
 Twitter uses Scribe to write logs to Hadoop
 Hbase is built on top of Hadoop
 using Cassandra for atomic counting.
 Does Stack Overflow use caching?
 Redis for site cache and global cache
 around 1,300,000 keys in our Redis cache
 a few hundred read/writes a second to Redis
 CPU usage on their dedicated Redis machine is0 – 1 percent
 Memory usage is around 6G
 Instagram loves Redis for caching complex objects
 Pinterest uses Redis to store the graph of followers and
caching, HBase
 LinkedIn uses Hadoop, Voldemort and Espresso.
 Can we use NoSql in our project?

More Related Content

PPTX
Real World NoSQL (by Chris Yuen)
PDF
Introduction of Redis as NoSQL Database
PPTX
No sqlpresentation
PDF
Au cœur de la roadmap de la Suite Elastic
PPT
Nosql Introduction
PPTX
Mongodb vs mysql
PPTX
Your data layer - Choosing the right database solutions for the future
PPTX
Philly Code Camp 2013 Mark Kromer Big Data with SQL Server
Real World NoSQL (by Chris Yuen)
Introduction of Redis as NoSQL Database
No sqlpresentation
Au cœur de la roadmap de la Suite Elastic
Nosql Introduction
Mongodb vs mysql
Your data layer - Choosing the right database solutions for the future
Philly Code Camp 2013 Mark Kromer Big Data with SQL Server

What's hot (20)

PPTX
Azure Big Data Story
PPTX
Big data in Azure
PDF
Bi with apache hadoop(en)
PDF
NoSQL Databases Introduction - UTN 2013
PPTX
Visualizing Austin's data with Elasticsearch and Kibana
PPTX
Introduction to NoSQL
PPTX
Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A...
PDF
Serverless data lake architecture
PDF
Mesos by zigi
PPTX
Database Choices
PPTX
PPTX
Database and types of database
PDF
Securing data and preventing data breaches
PDF
How to visualize Cosmos DB graph data
PDF
Análisis del roadmap del Elastic Stack
PPTX
Intro to bigdata on gcp (1)
PPTX
Advanced Databases: Introduction to NoSQL, Big Data and Google's Big Table
PPTX
Azure DocumentDB for Healthcare Integration
PPTX
Neo4j_allHands_04112013
PPTX
Move your on prem data to a lake in a Lake in Cloud
Azure Big Data Story
Big data in Azure
Bi with apache hadoop(en)
NoSQL Databases Introduction - UTN 2013
Visualizing Austin's data with Elasticsearch and Kibana
Introduction to NoSQL
Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A...
Serverless data lake architecture
Mesos by zigi
Database Choices
Database and types of database
Securing data and preventing data breaches
How to visualize Cosmos DB graph data
Análisis del roadmap del Elastic Stack
Intro to bigdata on gcp (1)
Advanced Databases: Introduction to NoSQL, Big Data and Google's Big Table
Azure DocumentDB for Healthcare Integration
Neo4j_allHands_04112013
Move your on prem data to a lake in a Lake in Cloud
Ad

Similar to Nosql Introduction, Basics (20)

PDF
NoSql and it's introduction features-Unit-1.pdf
PPTX
No sq lv2
PPTX
Introduction to Data Science NoSQL.pptx
PDF
NoSQL Basics - A Quick Tour
PDF
Bill howe 4_bigdatasystems
PPT
No sql
PPTX
NoSql evaluation
PPT
Indic threads pune12-nosql now and path ahead
PPTX
NoSQL(NOT ONLY SQL)
PPTX
cours database pour etudiant NoSQL (1).pptx
PPT
No sql (1)
PPTX
Presentation on NOSQL and mongodb .pptx
PPTX
NoSQL.pptx
PPTX
PPTX
PDF
No sql databases
PPT
NoSql Databases
PPTX
PPTX
Relational databases vs Non-relational databases
PPTX
nosql.pptx
NoSql and it's introduction features-Unit-1.pdf
No sq lv2
Introduction to Data Science NoSQL.pptx
NoSQL Basics - A Quick Tour
Bill howe 4_bigdatasystems
No sql
NoSql evaluation
Indic threads pune12-nosql now and path ahead
NoSQL(NOT ONLY SQL)
cours database pour etudiant NoSQL (1).pptx
No sql (1)
Presentation on NOSQL and mongodb .pptx
NoSQL.pptx
No sql databases
NoSql Databases
Relational databases vs Non-relational databases
nosql.pptx
Ad

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Machine Learning_overview_presentation.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Empathic Computing: Creating Shared Understanding
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Getting Started with Data Integration: FME Form 101
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Approach and Philosophy of On baking technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Group 1 Presentation -Planning and Decision Making .pptx
Big Data Technologies - Introduction.pptx
NewMind AI Weekly Chronicles - August'25-Week II
Accuracy of neural networks in brain wave diagnosis of schizophrenia
“AI and Expert System Decision Support & Business Intelligence Systems”
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Machine Learning_overview_presentation.pptx
A Presentation on Artificial Intelligence
Empathic Computing: Creating Shared Understanding
Building Integrated photovoltaic BIPV_UPV.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Reach Out and Touch Someone: Haptics and Empathic Computing
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Machine learning based COVID-19 study performance prediction
Getting Started with Data Integration: FME Form 101
Profit Center Accounting in SAP S/4HANA, S4F28 Col11

Nosql Introduction, Basics

  • 2.  SQL has Ruledfor two decades  Store persistent data  Application Integration  Concurrency Control  Mostly Standard  Reporting
  • 4.  don’t use the relational data model,  tend to be designed to run on a cluster  don’t have a fixed schema,  allow tostore any data in any record  seeks to solve the scalabilityandbig data performance issues
  • 5.  5 Considerations When EvaluatingNoSQL  Data Model  Querymodel  Consistencymodel  API  Commercialsupport
  • 6.  Types of NoSQL databases  Document: CouchDB, Couchbase, MongoDB  Key-value:Riak,MemcacheDB, Redis  Graph: Neo4J  Column: Cassandra, HBase
  • 7. Document Based { _id:”5466b7c22”, company:{ _id:”5466b7c22”, company: “google”, symbol : “GOOG”, stock_price: “858”, stock_volume:10M }}
  • 11.  Query Model  ThebiggestdifferencebetweenNoSQLsystemsliesintheability toquerydata efficiently.  Document databases provide the richest query  Key-value stores providea singlemeansofaccessingdata:by key.
  • 12.  Consistency Model  CONSISTENT SYSTEMS  EVENTUALLY CONSISTENT SYSTEMS
  • 13.  Reasons to use NoSQL databases  Scale (horizontal)  Simpler datamodel (less joins)  Redundancy/Reliability  Schema less (no modelling or prototyping)  Rapid Development/coder friendly  Flexibility  semi-structured/ unstructured/Structured  Cheaperthanrelational/ commodity  Creatinga cachingLayer  Environment data/logs  Graphs/relationships  Distributedstorage  Realtime analysis
  • 14.  Challenges against NoSQL  ACID  Maturity  Analytics/ BI / Reporting  Ecosystem/tools/addons  search  Security  Data loss  duplicatedata(Normalisation)  Administration  Expertiseavailability
  • 15.  Relationalis not dead  Thisleads us to a world of Polyglot Persistence
  • 17.  How Twitter UsesNoSQL  is heavily dependent on MySQL  users generate 12terrabytes of data a day - about four petabytes peryear  Twitter uses Scribe to write logs to Hadoop  Hbase is built on top of Hadoop  using Cassandra for atomic counting.
  • 18.  Does Stack Overflow use caching?  Redis for site cache and global cache  around 1,300,000 keys in our Redis cache  a few hundred read/writes a second to Redis  CPU usage on their dedicated Redis machine is0 – 1 percent  Memory usage is around 6G
  • 19.  Instagram loves Redis for caching complex objects  Pinterest uses Redis to store the graph of followers and caching, HBase  LinkedIn uses Hadoop, Voldemort and Espresso.
  • 20.  Can we use NoSql in our project?

Editor's Notes

  • #4: Not Only Sql There is no standard definition of what NoSQL means. The term began with a workshop organized in 2009,
  • #18: Twitter uses Pig, a high-level language running on top of Hadoop. Yahoo created Pig for rapid Hadoop development, Hbase is built on top of Hadoop and is designed for low-latency and data mutability. Twitter uses it to power its people search. FlockDB is a real-time, distributed database. As mentioned above, it was created and open-sourced by Twitter. The company uses it for social graph analysis. It's still MySQL underneath Cassandra, an open-source NoSQL database created by Facebook.