SlideShare a Scribd company logo
CC 2.0 by Bryan Pearson | http://flic.kr/p/RUfEt
11.
                                   April
                                   2013

                                    2




          HBase is an
   open source, distributed,
  column-oriented data store
 modeled after Google’s BigTable


HBase Introduction

About HBase
11.
                                        April
                                        2013

                                         3
•  Sorted map data store
•  Table consists of rows, each has a
   row key (primary key)
•  Each row may have any number of
   columns (Map<byte[], byte []>)
•  Rows are sorted lexicographically
   based on row key



HBase Introduction

Data Model
11.
                                                                                 April
                                                                                 2013
 Different types of data separated                                                 4
 into different “column families”                            Data is all byte[]

         Row key     Data
         amuller     info: { ‘height’: ‘2.0m’, ‘state’: ‘ZH’ }
                     roles: { ‘IBM’: ‘Sales Manager’ }
         cguegi      info: { ‘height’: ‘1.85m’, ‘state’: ‘BE’ }
                     roles: { ‘Sentric’: ‘Architect’@ts=2011,
                               ‘Sentric’: ‘Mentor’@ts=2012,
                               ‘SBDUG’: ‘Founder’ }

 Different rows may have different
 sets of columns (table is sparse)

                                    Single cell may have different values
                                    at different timestampes

HBase Introduction

Sorted Map (Logical View)
11.
                                            info Column Family                    April
                                                                                  2013

                             Row key   Column key      Timestamp      Value        5
                             amuller   info:height     1333883187     2.0m
                             amuller   info:state      1273871824     ZH
Sorted on disk by row key,
column key, descending ts

                             cguegi    info:height     1325755229     1.85m
                             cguegi    info:state      1325751049     TG

                                             roles Column Family
                             Row key   Column key      Timestamp      Value
                             amuller   roles:IBM       1320105636     Developer
                             cguegi    roles:SBDUG     1330561785     Founder
                             cguegi    roles:Sentric   1325376723     Mentor
                             cguegi    roles:Sentric   1293840959     Architect


                                                     Unix timestamp
    HBase Introduction

    Sorted Map (Physical View)
11.
                                                                April
                                                                2013

                                                                 6


              HBase                         API

                                         RegionServer

                       Master               HFile    Memstore
                                             Write-Ahead Log




                           HDFS                     ZooKeeper

          [HBase: The Definitive Guide]




HBase Introduction

HBase Architecture
11.
                                             April
                                             2013

                                              7
•     Favors Consistency over Availability
•     Great Hadoop integration
•     Ordered range partitions
•     Automatically shards/scales
•     Sparse column storage




HBase Introduction

HBase vs other “NoSQL”
CC 2.0 by Aurelien Guichard | http://flic.kr/p/cjg9yw
11.
                                    April
                                    2013


                                     9
•     http://hbase.apache.org
•     http://www.sentric.ch
•     http://bigdata-usergroup.ch
•     http://about.me/cguegi




HBase Introduction

Resources
11.
                                                                                                                      April
                                                                                                                      2013

                                                                                                                      10




Source: http://blogs.the451group.com/information_management/2013/02/04/updated-database-lanscape-map-february-2013/



HBase Introduction

Database Landscape Map

More Related Content

PPT
Less03 db dbca
PPT
Mysql database
PPT
Less01 architecture
PPT
Less08 users
DOCX
Db2 Important questions to read
PDF
Hbase: Introduction to column oriented databases
PPTX
Key-Value Pairs
PDF
Modern MySQL Monitoring and Dashboards.
Less03 db dbca
Mysql database
Less01 architecture
Less08 users
Db2 Important questions to read
Hbase: Introduction to column oriented databases
Key-Value Pairs
Modern MySQL Monitoring and Dashboards.

Similar to Apache HBase: Introduction to a column-oriented data store (20)

PDF
Intro to HBase
PDF
Optimization on Key-value Stores in Cloud Environment
PDF
Mar 2012 HUG: Hive with HBase
PPTX
No SQL introduction
PDF
Apache HBase 1.0 Release
PPTX
Apache HBase™
PDF
Integration of HIve and HBase
PDF
Integration of Hive and HBase
PPTX
Ten tools for ten big data areas 04_Apache Hive
PDF
HBase Mongo_DB Project
PDF
Hypertable Distilled by edydkim.github.com
DOC
Hadoop cluster configuration
PDF
Intro to HBase Internals & Schema Design (for HBase users)
PPTX
Big Data Analytics (BAD601) Module-4.pptx
PDF
Hadoop and HBase in the Real World
PDF
Techincal Talk Hbase-Ditributed,no-sql database
PDF
Facebook keynote-nicolas-qcon
PDF
支撑Facebook消息处理的h base存储系统
PDF
Facebook Messages & HBase
PPTX
Hadoop and h base in the real world
Intro to HBase
Optimization on Key-value Stores in Cloud Environment
Mar 2012 HUG: Hive with HBase
No SQL introduction
Apache HBase 1.0 Release
Apache HBase™
Integration of HIve and HBase
Integration of Hive and HBase
Ten tools for ten big data areas 04_Apache Hive
HBase Mongo_DB Project
Hypertable Distilled by edydkim.github.com
Hadoop cluster configuration
Intro to HBase Internals & Schema Design (for HBase users)
Big Data Analytics (BAD601) Module-4.pptx
Hadoop and HBase in the Real World
Techincal Talk Hbase-Ditributed,no-sql database
Facebook keynote-nicolas-qcon
支撑Facebook消息处理的h base存储系统
Facebook Messages & HBase
Hadoop and h base in the real world
Ad

More from Christian Gügi (7)

PPTX
Real-Time Fraud Detection in Payment Transactions
PDF
Building Scalable Big Data Pipelines
PPTX
Case Study: In-Store Analysis
PDF
Apachecon Europe 2012: Operating HBase - Things you need to know
PDF
Online Media Data Stream Processing with Kafka
PDF
Near Real Time Processing of Social Media Data with HBase
PDF
Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...
Real-Time Fraud Detection in Payment Transactions
Building Scalable Big Data Pipelines
Case Study: In-Store Analysis
Apachecon Europe 2012: Operating HBase - Things you need to know
Online Media Data Stream Processing with Kafka
Near Real Time Processing of Social Media Data with HBase
Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...
Ad

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Approach and Philosophy of On baking technology
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
cuic standard and advanced reporting.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Understanding_Digital_Forensics_Presentation.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Big Data Technologies - Introduction.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
Approach and Philosophy of On baking technology
The Rise and Fall of 3GPP – Time for a Sabbatical?
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
cuic standard and advanced reporting.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation_ Review paper, used for researhc scholars
NewMind AI Monthly Chronicles - July 2025
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Diabetes mellitus diagnosis method based random forest with bat algorithm
Understanding_Digital_Forensics_Presentation.pptx
The AUB Centre for AI in Media Proposal.docx
Machine learning based COVID-19 study performance prediction
Reach Out and Touch Someone: Haptics and Empathic Computing
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Apache HBase: Introduction to a column-oriented data store

  • 1. CC 2.0 by Bryan Pearson | http://flic.kr/p/RUfEt
  • 2. 11. April 2013 2 HBase is an open source, distributed, column-oriented data store modeled after Google’s BigTable HBase Introduction About HBase
  • 3. 11. April 2013 3 •  Sorted map data store •  Table consists of rows, each has a row key (primary key) •  Each row may have any number of columns (Map<byte[], byte []>) •  Rows are sorted lexicographically based on row key HBase Introduction Data Model
  • 4. 11. April 2013 Different types of data separated 4 into different “column families” Data is all byte[] Row key Data amuller info: { ‘height’: ‘2.0m’, ‘state’: ‘ZH’ } roles: { ‘IBM’: ‘Sales Manager’ } cguegi info: { ‘height’: ‘1.85m’, ‘state’: ‘BE’ } roles: { ‘Sentric’: ‘Architect’@ts=2011, ‘Sentric’: ‘Mentor’@ts=2012, ‘SBDUG’: ‘Founder’ } Different rows may have different sets of columns (table is sparse) Single cell may have different values at different timestampes HBase Introduction Sorted Map (Logical View)
  • 5. 11. info Column Family April 2013 Row key Column key Timestamp Value 5 amuller info:height 1333883187 2.0m amuller info:state 1273871824 ZH Sorted on disk by row key, column key, descending ts cguegi info:height 1325755229 1.85m cguegi info:state 1325751049 TG roles Column Family Row key Column key Timestamp Value amuller roles:IBM 1320105636 Developer cguegi roles:SBDUG 1330561785 Founder cguegi roles:Sentric 1325376723 Mentor cguegi roles:Sentric 1293840959 Architect Unix timestamp HBase Introduction Sorted Map (Physical View)
  • 6. 11. April 2013 6 HBase API RegionServer Master HFile Memstore Write-Ahead Log HDFS ZooKeeper [HBase: The Definitive Guide] HBase Introduction HBase Architecture
  • 7. 11. April 2013 7 •  Favors Consistency over Availability •  Great Hadoop integration •  Ordered range partitions •  Automatically shards/scales •  Sparse column storage HBase Introduction HBase vs other “NoSQL”
  • 8. CC 2.0 by Aurelien Guichard | http://flic.kr/p/cjg9yw
  • 9. 11. April 2013 9 •  http://hbase.apache.org •  http://www.sentric.ch •  http://bigdata-usergroup.ch •  http://about.me/cguegi HBase Introduction Resources
  • 10. 11. April 2013 10 Source: http://blogs.the451group.com/information_management/2013/02/04/updated-database-lanscape-map-february-2013/ HBase Introduction Database Landscape Map