Quick Understanding of NoSQL

         Edward J. Yoon
           @ Oracle
Agenda
• Why Big Data solutions are appeared?
  – What was problem?
• What’s HBase?
• NoSQLs Trends
  – NoSQLs Comparison
Managing user profiles

 User-ID, …
Managing user profiles

…    User-ID, …




                  …

…        …
Managing user profiles

…    User-ID, …




                       …

…        …




                     Let’s assume that,
      Most million+ users are login between 9 ~ 10 AM.

       And, I wanna update their last logged in time.
Managing user profiles

…
                          !   1~3 millions records .
     User-ID, …
                      Max connections, Hardware limit,
                       Query performance on, …., etc.
                  …

…        …
Managing user profiles

…
                                 !   1~3 millions records .
        User-ID, …
                           Max connections, Hardware limit,
                            Query performance on, …., etc.
                       …

…           …
                                              …        User-ID, …




                                                                    …

         Solution!?                           …               …
Manual Range/Hash Partitioning
Managing user profiles

…
                                 !   1~3 millions records .
        User-ID, …
                           Max connections, Hardware limit,
                            Query performance on, …., etc.
                       …
          What happens, if scheme should be changed or,
…         …         need to increase nodes?
                                              …        User-ID, …




                                                                    …

         Solution!?                           …               …
Manual Range/Hash Partitioning
NoSQL
• Scheme-free
• Horizontal Scalability
  – Distributed Storage
• Distributed computing e.g., M/R
• Connection Pool
NoSQLs trends
Comparison
                                                      Cassandra (V1.2)
MongoDB (V2.2)

•   Written in: C++                                   •   Written in Java
•   License: AGPL                                     •   License: Apache
•   Support Languages: All                            •   Support Languages: All
•   Architecture: Master/slave, JSON                  •   Architecture: Master/slave, Columnar
•   Queries: javascript expressions, MapReduce
•   Known issues: None                                •   Queries: Hive (MapReduce)
•   Suitable for: Online applications                 •   BigTable clone
                                                      •   Suitable for: Ad-hoc Analytics, Column scanning
HBase (V0.92.0)
                                                      Oracle NoSQL
•   Written in: Java
•   License: Apache
•   Support Languages: All                            •   Written in: Java
•   Architecture: Master/slave, Columnar              •   Support Languages: Java
•   Queries: Hive (MapReduce)                         •   Architecture: Master/slave, ACID
•   BigTable clone                                    •   Suitable for: Online applications
•   Suitable for: Ad-hoc Analytics, Column scanning
                                                           •     Synchronous Replication

More Related Content

PDF
Introduction to apache horn (incubating)
PDF
Apache Hama at Samsung Open Source Conference
PPT
Apache hama @ Samsung SW Academy
PDF
Apache Hama 0.4
PDF
Introduction of Apache Hama - 2011
PDF
Distributed deep learning
PDF
Matrix Factorizations at Scale: a Comparison of Scientific Data Analytics on ...
PDF
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
Introduction to apache horn (incubating)
Apache Hama at Samsung Open Source Conference
Apache hama @ Samsung SW Academy
Apache Hama 0.4
Introduction of Apache Hama - 2011
Distributed deep learning
Matrix Factorizations at Scale: a Comparison of Scientific Data Analytics on ...
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal

What's hot (20)

PPTX
2011.10.14 Apache Giraph - Hortonworks
PDF
Neural Networks, Spark MLlib, Deep Learning
PPTX
Map Reduce
PDF
Apache Giraph
PPTX
Machine Learning with Hadoop
PDF
CUDA performance study on Hadoop MapReduce Cluster
PDF
Hadoop Ecosystem Architecture Overview
PPTX
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
PDF
Enterprise Scale Topological Data Analysis Using Spark
PDF
Introduction To Elastic MapReduce at WHUG
PDF
Giraph
PPTX
PyMADlib - A Python wrapper for MADlib : in-database, parallel, machine learn...
PDF
A Scalable Implementation of Deep Learning on Spark (Alexander Ulanov)
PPT
Map Reduce
PDF
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
PPTX
Big Data Analytics-Open Source Toolkits
PDF
Large Scale Graph Processing with Apache Giraph
PPTX
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
PPTX
PPT
Map Reduce introduction
2011.10.14 Apache Giraph - Hortonworks
Neural Networks, Spark MLlib, Deep Learning
Map Reduce
Apache Giraph
Machine Learning with Hadoop
CUDA performance study on Hadoop MapReduce Cluster
Hadoop Ecosystem Architecture Overview
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
Enterprise Scale Topological Data Analysis Using Spark
Introduction To Elastic MapReduce at WHUG
Giraph
PyMADlib - A Python wrapper for MADlib : in-database, parallel, machine learn...
A Scalable Implementation of Deep Learning on Spark (Alexander Ulanov)
Map Reduce
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
Big Data Analytics-Open Source Toolkits
Large Scale Graph Processing with Apache Giraph
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Map Reduce introduction
Ad

Viewers also liked (8)

PDF
Apache Hadoop - Big Data Engineering
PDF
차세대하둡과 주목해야할 오픈소스
PDF
빅 데이터의 핵심 플랫폼, 기업용 하둡 동향 2013-2-12
PDF
Hadoop발표자료
PPTX
The Evolution of the Hadoop Ecosystem
PDF
The Hadoop Ecosystem
PDF
Big data infra core technology 빅데이터 전문인력-양성사업_분석과정-특강
PPTX
Apache Hadoop at 10
Apache Hadoop - Big Data Engineering
차세대하둡과 주목해야할 오픈소스
빅 데이터의 핵심 플랫폼, 기업용 하둡 동향 2013-2-12
Hadoop발표자료
The Evolution of the Hadoop Ecosystem
The Hadoop Ecosystem
Big data infra core technology 빅데이터 전문인력-양성사업_분석과정-특강
Apache Hadoop at 10
Ad

Similar to Quick Understanding of NoSQL (20)

PPTX
Apache Drill
PDF
Michael stack -the state of apache h base
PDF
No SQL Technologies
PPTX
Microsoft Openness Mongo DB
PDF
Apache Arrow -- Cross-language development platform for in-memory data
PDF
Engineering practices in big data storage and processing
PDF
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
PDF
An introduction to apache drill presentation
PDF
Liferay & Big Data Dev Con 2014
PDF
My Sql And Search At Craigslist
PPTX
Sharing a Startup’s Big Data Lessons
PDF
Hadoop programming
PDF
You suck at Memory Analysis
PDF
Cassandra Talk: Austin JUG
PPTX
Drill Bay Area HUG 2012-09-19
PDF
Sep 2012 HUG: Apache Drill for Interactive Analysis
PDF
MySQL Cluster Scaling to a Billion Queries
PPTX
Drill at the Chug 9-19-12
PPTX
CodeFutures - Scaling Your Database in the Cloud
PDF
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apache Drill
Michael stack -the state of apache h base
No SQL Technologies
Microsoft Openness Mongo DB
Apache Arrow -- Cross-language development platform for in-memory data
Engineering practices in big data storage and processing
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
An introduction to apache drill presentation
Liferay & Big Data Dev Con 2014
My Sql And Search At Craigslist
Sharing a Startup’s Big Data Lessons
Hadoop programming
You suck at Memory Analysis
Cassandra Talk: Austin JUG
Drill Bay Area HUG 2012-09-19
Sep 2012 HUG: Apache Drill for Interactive Analysis
MySQL Cluster Scaling to a Billion Queries
Drill at the Chug 9-19-12
CodeFutures - Scaling Your Database in the Cloud
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand

More from Edward Yoon (11)

PDF
(소스콘 2015 발표자료) Apache HORN, a large scale deep learning
PDF
K means 알고리즘을 이용한 영화배우 클러스터링
PPT
The evolution of web and big data
PPTX
MongoDB introduction
PDF
Monitoring and mining network traffic in clouds
PDF
Apache hama 0.2-userguide
PDF
Usage case of HBase for real-time application
PDF
Apache HAMA: An Introduction toBulk Synchronization Parallel on Hadoop
PDF
Understand Of Linear Algebra
PDF
BigTable And Hbase
PPT
Heart Proposal
(소스콘 2015 발표자료) Apache HORN, a large scale deep learning
K means 알고리즘을 이용한 영화배우 클러스터링
The evolution of web and big data
MongoDB introduction
Monitoring and mining network traffic in clouds
Apache hama 0.2-userguide
Usage case of HBase for real-time application
Apache HAMA: An Introduction toBulk Synchronization Parallel on Hadoop
Understand Of Linear Algebra
BigTable And Hbase
Heart Proposal

Recently uploaded (20)

PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PPTX
Virtual and Augmented Reality in Current Scenario
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PDF
Empowerment Technology for Senior High School Guide
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PPTX
Computer Architecture Input Output Memory.pptx
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
What if we spent less time fighting change, and more time building what’s rig...
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
Hazard Identification & Risk Assessment .pdf
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
Virtual and Augmented Reality in Current Scenario
Environmental Education MCQ BD2EE - Share Source.pdf
Empowerment Technology for Senior High School Guide
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Computer Architecture Input Output Memory.pptx
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
202450812 BayCHI UCSC-SV 20250812 v17.pptx
What if we spent less time fighting change, and more time building what’s rig...
A powerpoint presentation on the Revised K-10 Science Shaping Paper
Unit 4 Computer Architecture Multicore Processor.pptx
Hazard Identification & Risk Assessment .pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
Weekly quiz Compilation Jan -July 25.pdf
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
FORM 1 BIOLOGY MIND MAPS and their schemes
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
Paper A Mock Exam 9_ Attempt review.pdf.

Quick Understanding of NoSQL

  • 1. Quick Understanding of NoSQL Edward J. Yoon @ Oracle
  • 2. Agenda • Why Big Data solutions are appeared? – What was problem? • What’s HBase? • NoSQLs Trends – NoSQLs Comparison
  • 3. Managing user profiles User-ID, …
  • 4. Managing user profiles … User-ID, … … … …
  • 5. Managing user profiles … User-ID, … … … … Let’s assume that, Most million+ users are login between 9 ~ 10 AM. And, I wanna update their last logged in time.
  • 6. Managing user profiles … ! 1~3 millions records . User-ID, … Max connections, Hardware limit, Query performance on, …., etc. … … …
  • 7. Managing user profiles … ! 1~3 millions records . User-ID, … Max connections, Hardware limit, Query performance on, …., etc. … … … … User-ID, … … Solution!? … … Manual Range/Hash Partitioning
  • 8. Managing user profiles … ! 1~3 millions records . User-ID, … Max connections, Hardware limit, Query performance on, …., etc. … What happens, if scheme should be changed or, … … need to increase nodes? … User-ID, … … Solution!? … … Manual Range/Hash Partitioning
  • 9. NoSQL • Scheme-free • Horizontal Scalability – Distributed Storage • Distributed computing e.g., M/R • Connection Pool
  • 11. Comparison Cassandra (V1.2) MongoDB (V2.2) • Written in: C++ • Written in Java • License: AGPL • License: Apache • Support Languages: All • Support Languages: All • Architecture: Master/slave, JSON • Architecture: Master/slave, Columnar • Queries: javascript expressions, MapReduce • Known issues: None • Queries: Hive (MapReduce) • Suitable for: Online applications • BigTable clone • Suitable for: Ad-hoc Analytics, Column scanning HBase (V0.92.0) Oracle NoSQL • Written in: Java • License: Apache • Support Languages: All • Written in: Java • Architecture: Master/slave, Columnar • Support Languages: Java • Queries: Hive (MapReduce) • Architecture: Master/slave, ACID • BigTable clone • Suitable for: Online applications • Suitable for: Ad-hoc Analytics, Column scanning • Synchronous Replication