SlideShare a Scribd company logo
NoSQL Database: New Era of Databases for Big data
Analytics - Classification, Characteristics and
Comparison
A B M Moniruzzaman and Syed Akhter Hossain

03/04/14

CSC 8710

1
Contents
•
•
•
•
•
•
•
•
•
•

NoSQL databases definition
Why NoSQL databases?
Characteristics of NoSQL Databases
Primary Uses of NoSQL Database
Key-Value databases
Documents databases
Column-Family databases
Graph databases
Adoption of NoSQL Database
Conclusion

03/04/14

CSC 8710

2
NoSQL Database
• NoSQL for Not Only SQL, refers to an eclectic and
increasingly familiar group of non-relational data
management system
• databases are not built primarily on tables, and generally
don't use SQL for data manipulation.
• NoSQL systems are distributed, non-relational database,
designed for large-scale data storage and for massiveparallel data processing across a large number of
commodity servers.

03/04/14

CSC 8710

3
NoSQL Database
• They also use non-SQL languages and mechanisms to
interact with data.
• NoSQL database systems arose alongside major Internet
companies, such as Google, Amazon, and Facebook
which had challenges in dealing with huge quantities of
data
• These systems are designed to scale thousands or
millions of users doing updates as well as reads, in
contrast to traditional DBMSs and data warehouses

03/04/14

CSC 8710

4
Why NoSQL?
• Relational DBMSs have been a successful
technology for many years, providing
persistence, concurrency control and integration
mechanisms.
• The need of processing large amount of data
changes the direction from scaling vertically to
scaling horizontally on clusters.

03/04/14

CSC 8710

5
Why NoSQL?
• NoSQL databases focus on analytical processing
of large scale datasets, offering increased
scalability over commodity hardware
• Organizations that collect large amounts of
unstructured data are increasingly turning to nonrelational databases (NoSQL databases).

03/04/14

CSC 8710

6
Big Data

03/04/14

CSC 8710

7
Characteristics of NoSQL Databases
• Strong Consistency: all clients see the same version of
data.
• High Availability: Data always available, at least one
copy of the requested data even if one of the nodes is
down.
• Partition-tolerance: the total system keeps its
characteristic even when being deployed on different
servers

03/04/14

CSC 8710

8
Characteristics of NoSQL Databases

03/04/14

CSC 8710

9
Primary Uses of NoSQL Database
1. Large-scale data processing
2. Exploratory analytics on semi-structured
data (expert level)
3. Large volume data storage.

03/04/14

CSC 8710

10
Classification of NoSQL Databases
• Key-Value databases
• Documents databases
• Column Family databases
• Graphics databases

03/04/14

CSC 8710

11
Key-Value Databases
• These DMS store items as alpha-numeric identifiers that
refer to the keys. Each key has associated values.
• The values could be simple text strings or more complex
lists and sets
• Search only performed against keys, and limited to exact
matches.
• Search cannot be performed against values

03/04/14

CSC 8710

12
Key-Value Databases

03/04/14

CSC 8710

13
Key-Value characterstics
• The simplicity of Key-Value Store makes them very quick
and light.
• Highly scalable retrieval of the values needed for
application tasks such as retrieving product names.
• This is why Amazon use K-V system, Dynamo, in its
shopping cart. Dynamo is a highly available key-value
storage system.
• Example: Dynamo (Amazon), Voldemort (LinkedIn)
Redis, BerkeleyDB, Riak
03/04/14

CSC 8710

14
Pros and Cons

• pros: anything can be stored in an
aggregate
• cons: only key lookup to access the entire
aggregate is allowed (no query and part of
aggregate retrieval mechanisms)

03/04/14

CSC 8710

15
Document Database

• Designed to manage and store
documents.
• These documents are encoded in a
standard data exchange format such as
XML, JSON (Javascript Option Notation)
or BSON (Binary JSON).
03/04/14

CSC 8710

16
Document Database

03/04/14

CSC 8710

17
Primary Uses
• Document databases are good for storing
and managing Big Data-size collections of
literal documents such as text documents,
email messages.

03/04/14

CSC 8710

18
Pros And Cons
• pros: allow structured queries and partial
aggregate retrieval based on the fields in
the aggregate
• cons: imposes a limit on what can be
placed in a database

03/04/14

CSC 8710

19
Column-Family Databases
• It consists of a Key-Value pair where the value
consists of set of columns.
• The column family databases are represented in
tables, each key-value pair being a row.
• All the related data can be grouped as one family

03/04/14

CSC 8710

20
Primary Uses
1. Large-scale, batch-oriented data processing:
sorting, parsing, conversion :
- conversions between hexadecimal, binary and
decimal code values.

2. Exploratory and predictive analytics performed
by expert statisticians and programmers.

03/04/14

CSC 8710

21
Column-Family

03/04/14

CSC 8710

22
Graph Databases
• Graph databases replace relational tables with structured
relational graphs of interconnected key-value pairings.
• Graph databases are useful when you are more
interested in relationships between data than the data
itself and it works perfectly for the social network.
• It is optimized for relationship traversing not for querying
• Examples: Neo4j, InfoGrid, Sones GraphDB,
AllegroGraph, InfiniteGraph
03/04/14

CSC 8710

23
Graph Databases

03/04/14

CSC 8710

24
Adoption of NoSQL Database
• Organizations that have massive data storage
are looking seriously at NoSQL.
• NoSQL Database expert are highly demanded
for most of the developing organizations.
• The next graph shows job trends of five NoSQL
Databases from Indeed.com

03/04/14

CSC 8710

25
Job Trends of Five NoSQL Databases

03/04/14

CSC 8710

26
Adoption of NoSQL Database
• MongoDB‘s growth means that it has cemented
its place as the most popular NoSQL database.
• According to LinkedIn profile mentions, The
mentions of NoSQL technologies form 45% in
LinkedIn profiles.

03/04/14

CSC 8710

27
LinkedIn statistics

03/04/14

CSC 8710

28
Conclusion
• Computational and storage requirements of applications
such as for Big Data analytics, Business Intelligence and
social networking over peta-byte datasets led us to the
change from SQL to NoSQL DBs.
• This led to the development of horizontally scalable,
distributed non-relational No-SQL databases.
• MongoDB‘s is the most demanded one.

03/04/14

CSC 8710

29
Resources
•

http://arxiv.org/ftp/arxiv/papers/1307/1307.0191.pdf

•

http://en.wikipedia.org/wiki/Column_family

•

http://en.wikipedia.org/wiki/NoSQL

03/04/14

CSC 8710

30
03/04/14

CSC 8710

31
03/04/14

CSC 8710

32

More Related Content

PDF
NOSQL- Presentation on NoSQL
PPTX
Introduction to NOSQL databases
PPT
Introduction to mongodb
PPTX
Nosql databases
PPTX
PPTX
Introduction to MongoDB
PPTX
NOSQL Databases types and Uses
PPTX
Mongodb - NoSql Database
NOSQL- Presentation on NoSQL
Introduction to NOSQL databases
Introduction to mongodb
Nosql databases
Introduction to MongoDB
NOSQL Databases types and Uses
Mongodb - NoSql Database

What's hot (20)

PPT
6 Data Modeling for NoSQL 2/2
PPTX
NOSQL vs SQL
PPTX
MongoDB
PPTX
Introduction to NoSQL Databases
PDF
Data-Ed: Data Warehousing Strategies
PDF
Non Relational Databases
PPTX
Array operations
PPTX
The Data Warehouse Lifecycle
PPTX
Copy of MongoDB .pptx
PPS
Oracle Database Overview
PDF
Data Migration Between MongoDB and Oracle
PPT
Introduction to MongoDB
PPTX
Key-Value NoSQL Database
PPTX
NoSql Data Management
PPTX
Introduction to NoSQL
PPTX
NoSQL databases
PPTX
Mongo DB Presentation
PPTX
Data models in NoSQL
6 Data Modeling for NoSQL 2/2
NOSQL vs SQL
MongoDB
Introduction to NoSQL Databases
Data-Ed: Data Warehousing Strategies
Non Relational Databases
Array operations
The Data Warehouse Lifecycle
Copy of MongoDB .pptx
Oracle Database Overview
Data Migration Between MongoDB and Oracle
Introduction to MongoDB
Key-Value NoSQL Database
NoSql Data Management
Introduction to NoSQL
NoSQL databases
Mongo DB Presentation
Data models in NoSQL
Ad

Similar to NoSQL databases (20)

PPTX
No SQL- The Future Of Data Storage
PPSX
A Seminar on NoSQL Databases.
PPTX
NOSQL IN BIGDATA FOR PG STUDENTS FOR COL
PPTX
NoSQL in Big Data Analytics Tools .pptx
PPTX
NoSQL databases - An introduction
DOCX
Report 1.0.docx
PDF
Business Intelligence & NoSQL Databases
PDF
NOSQL in big data is the not only structure langua.pdf
DOCX
Report 2.0.docx
PDF
NoSQL Databases Introduction - UTN 2013
PDF
Couchbase 3.0.2 d1
PPT
NoSQL Seminer
PPTX
NoSQL Basics and MongDB
PPTX
unit2-ppt1.pptx
PDF
Nosql part1 8th December
PPTX
2.Introduction to NOSQL (Core concepts).pptx
PPTX
Introduction to asdfghjkln b vfgh n v
DOCX
Unit II -BIG DATA ANALYTICS.docx
PDF
NOsql Presentation.pdf
PPTX
UNIT-2.pptx
No SQL- The Future Of Data Storage
A Seminar on NoSQL Databases.
NOSQL IN BIGDATA FOR PG STUDENTS FOR COL
NoSQL in Big Data Analytics Tools .pptx
NoSQL databases - An introduction
Report 1.0.docx
Business Intelligence & NoSQL Databases
NOSQL in big data is the not only structure langua.pdf
Report 2.0.docx
NoSQL Databases Introduction - UTN 2013
Couchbase 3.0.2 d1
NoSQL Seminer
NoSQL Basics and MongDB
unit2-ppt1.pptx
Nosql part1 8th December
2.Introduction to NOSQL (Core concepts).pptx
Introduction to asdfghjkln b vfgh n v
Unit II -BIG DATA ANALYTICS.docx
NOsql Presentation.pdf
UNIT-2.pptx
Ad

Recently uploaded (20)

PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
A Complete Guide to Streamlining Business Processes
PDF
Business Analytics and business intelligence.pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Transcultural that can help you someday.
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Introduction to the R Programming Language
PDF
Introduction to Data Science and Data Analysis
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
A Complete Guide to Streamlining Business Processes
Business Analytics and business intelligence.pdf
climate analysis of Dhaka ,Banglades.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Mega Projects Data Mega Projects Data
Qualitative Qantitative and Mixed Methods.pptx
Transcultural that can help you someday.
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Introduction to the R Programming Language
Introduction to Data Science and Data Analysis
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Data_Analytics_and_PowerBI_Presentation.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
168300704-gasification-ppt.pdfhghhhsjsjhsuxush

NoSQL databases

  • 1. NoSQL Database: New Era of Databases for Big data Analytics - Classification, Characteristics and Comparison A B M Moniruzzaman and Syed Akhter Hossain 03/04/14 CSC 8710 1
  • 2. Contents • • • • • • • • • • NoSQL databases definition Why NoSQL databases? Characteristics of NoSQL Databases Primary Uses of NoSQL Database Key-Value databases Documents databases Column-Family databases Graph databases Adoption of NoSQL Database Conclusion 03/04/14 CSC 8710 2
  • 3. NoSQL Database • NoSQL for Not Only SQL, refers to an eclectic and increasingly familiar group of non-relational data management system • databases are not built primarily on tables, and generally don't use SQL for data manipulation. • NoSQL systems are distributed, non-relational database, designed for large-scale data storage and for massiveparallel data processing across a large number of commodity servers. 03/04/14 CSC 8710 3
  • 4. NoSQL Database • They also use non-SQL languages and mechanisms to interact with data. • NoSQL database systems arose alongside major Internet companies, such as Google, Amazon, and Facebook which had challenges in dealing with huge quantities of data • These systems are designed to scale thousands or millions of users doing updates as well as reads, in contrast to traditional DBMSs and data warehouses 03/04/14 CSC 8710 4
  • 5. Why NoSQL? • Relational DBMSs have been a successful technology for many years, providing persistence, concurrency control and integration mechanisms. • The need of processing large amount of data changes the direction from scaling vertically to scaling horizontally on clusters. 03/04/14 CSC 8710 5
  • 6. Why NoSQL? • NoSQL databases focus on analytical processing of large scale datasets, offering increased scalability over commodity hardware • Organizations that collect large amounts of unstructured data are increasingly turning to nonrelational databases (NoSQL databases). 03/04/14 CSC 8710 6
  • 8. Characteristics of NoSQL Databases • Strong Consistency: all clients see the same version of data. • High Availability: Data always available, at least one copy of the requested data even if one of the nodes is down. • Partition-tolerance: the total system keeps its characteristic even when being deployed on different servers 03/04/14 CSC 8710 8
  • 9. Characteristics of NoSQL Databases 03/04/14 CSC 8710 9
  • 10. Primary Uses of NoSQL Database 1. Large-scale data processing 2. Exploratory analytics on semi-structured data (expert level) 3. Large volume data storage. 03/04/14 CSC 8710 10
  • 11. Classification of NoSQL Databases • Key-Value databases • Documents databases • Column Family databases • Graphics databases 03/04/14 CSC 8710 11
  • 12. Key-Value Databases • These DMS store items as alpha-numeric identifiers that refer to the keys. Each key has associated values. • The values could be simple text strings or more complex lists and sets • Search only performed against keys, and limited to exact matches. • Search cannot be performed against values 03/04/14 CSC 8710 12
  • 14. Key-Value characterstics • The simplicity of Key-Value Store makes them very quick and light. • Highly scalable retrieval of the values needed for application tasks such as retrieving product names. • This is why Amazon use K-V system, Dynamo, in its shopping cart. Dynamo is a highly available key-value storage system. • Example: Dynamo (Amazon), Voldemort (LinkedIn) Redis, BerkeleyDB, Riak 03/04/14 CSC 8710 14
  • 15. Pros and Cons • pros: anything can be stored in an aggregate • cons: only key lookup to access the entire aggregate is allowed (no query and part of aggregate retrieval mechanisms) 03/04/14 CSC 8710 15
  • 16. Document Database • Designed to manage and store documents. • These documents are encoded in a standard data exchange format such as XML, JSON (Javascript Option Notation) or BSON (Binary JSON). 03/04/14 CSC 8710 16
  • 18. Primary Uses • Document databases are good for storing and managing Big Data-size collections of literal documents such as text documents, email messages. 03/04/14 CSC 8710 18
  • 19. Pros And Cons • pros: allow structured queries and partial aggregate retrieval based on the fields in the aggregate • cons: imposes a limit on what can be placed in a database 03/04/14 CSC 8710 19
  • 20. Column-Family Databases • It consists of a Key-Value pair where the value consists of set of columns. • The column family databases are represented in tables, each key-value pair being a row. • All the related data can be grouped as one family 03/04/14 CSC 8710 20
  • 21. Primary Uses 1. Large-scale, batch-oriented data processing: sorting, parsing, conversion : - conversions between hexadecimal, binary and decimal code values. 2. Exploratory and predictive analytics performed by expert statisticians and programmers. 03/04/14 CSC 8710 21
  • 23. Graph Databases • Graph databases replace relational tables with structured relational graphs of interconnected key-value pairings. • Graph databases are useful when you are more interested in relationships between data than the data itself and it works perfectly for the social network. • It is optimized for relationship traversing not for querying • Examples: Neo4j, InfoGrid, Sones GraphDB, AllegroGraph, InfiniteGraph 03/04/14 CSC 8710 23
  • 25. Adoption of NoSQL Database • Organizations that have massive data storage are looking seriously at NoSQL. • NoSQL Database expert are highly demanded for most of the developing organizations. • The next graph shows job trends of five NoSQL Databases from Indeed.com 03/04/14 CSC 8710 25
  • 26. Job Trends of Five NoSQL Databases 03/04/14 CSC 8710 26
  • 27. Adoption of NoSQL Database • MongoDB‘s growth means that it has cemented its place as the most popular NoSQL database. • According to LinkedIn profile mentions, The mentions of NoSQL technologies form 45% in LinkedIn profiles. 03/04/14 CSC 8710 27
  • 29. Conclusion • Computational and storage requirements of applications such as for Big Data analytics, Business Intelligence and social networking over peta-byte datasets led us to the change from SQL to NoSQL DBs. • This led to the development of horizontally scalable, distributed non-relational No-SQL databases. • MongoDB‘s is the most demanded one. 03/04/14 CSC 8710 29

Editor's Notes

  • #8: Enterprise Resourse planning: Customer Relationship management