HBase is a NoSQL (Not Only SQL) database. HBase store data
rows in labeled tables, each row have sortable key and an
arbitrary number of columns.
v HBase is linear and modular scaling
v It have automatic region server failover
v Tables are distributed on the cluster via
regions which supports automatic
sharding
v Hadoop/HDFS integration
v Supports MPP (massively parallelized
processing)
v Support Thrift and REST API
v Supports Block Cache and Bloom Filters
for high volume query optimization
v Provides build-in web pages for
operational insight
v Supports strongly consistent
reads/writes
v No joins supported as RDBMS
v Supports Tables with column family,
rows and columns
v All columns belong to column family
v Have table cells with intersection of row
and column coordinates and are
versioned {row, column, version}
v You can run get command to select row,
put to insert or update a row, scan to do
a loop for multiple rows and Delete to
delete record
By: Milind Zodge
v If you have millions or billions of rows,
then HBase is a good candidate
v You do not need advanced query
language SQL and can leave without
secondary indexes and typed columns
v You have enough hardware available as
even HDFS doesn’t do well with
anything less than 5 Data Node so you
will at least need 5 nodes cluster
v If your application has a variable
schema where each row is slightly
different
v If you data I stored in collections
v If you need key based access to data
when storing or retrieving
About HBase When to use HBase

More Related Content

PPT
Hbase introduction
DOCX
Certifications
PPTX
Different type of databases
PDF
Cassandra one page
PDF
Big datawarehouse
PDF
Mongo db onepage
PPTX
Data Staging Strategy
PDF
Open source information architecture
Hbase introduction
Certifications
Different type of databases
Cassandra one page
Big datawarehouse
Mongo db onepage
Data Staging Strategy
Open source information architecture

Similar to H base one page (20)

PPT
Chicago Data Summit: Apache HBase: An Introduction
PPTX
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
ODP
Apache hadoop hbase
PPTX
HBase.pptx
PPTX
Apache HBase™
PPTX
PPTX
PPTX
Apache h base
PPTX
H-Base in Data Base Mangement System
PPTX
PPTX
Introduction to HBase
PPTX
ODP
Breaking with relational DBMS and dating with Hbase [5th IndicThreads.com Con...
PDF
Hbase 20141003
PDF
Intro to HBase - Lars George
PDF
Apache HBase
PDF
Hbase: an introduction
ODP
HBase introduction talk
PDF
Nyc hadoop meetup introduction to h base
PPTX
Hbasepreso 111116185419-phpapp02
Chicago Data Summit: Apache HBase: An Introduction
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
Apache hadoop hbase
HBase.pptx
Apache HBase™
Apache h base
H-Base in Data Base Mangement System
Introduction to HBase
Breaking with relational DBMS and dating with Hbase [5th IndicThreads.com Con...
Hbase 20141003
Intro to HBase - Lars George
Apache HBase
Hbase: an introduction
HBase introduction talk
Nyc hadoop meetup introduction to h base
Hbasepreso 111116185419-phpapp02
Ad

H base one page

  • 1. HBase is a NoSQL (Not Only SQL) database. HBase store data rows in labeled tables, each row have sortable key and an arbitrary number of columns. v HBase is linear and modular scaling v It have automatic region server failover v Tables are distributed on the cluster via regions which supports automatic sharding v Hadoop/HDFS integration v Supports MPP (massively parallelized processing) v Support Thrift and REST API v Supports Block Cache and Bloom Filters for high volume query optimization v Provides build-in web pages for operational insight v Supports strongly consistent reads/writes v No joins supported as RDBMS v Supports Tables with column family, rows and columns v All columns belong to column family v Have table cells with intersection of row and column coordinates and are versioned {row, column, version} v You can run get command to select row, put to insert or update a row, scan to do a loop for multiple rows and Delete to delete record By: Milind Zodge v If you have millions or billions of rows, then HBase is a good candidate v You do not need advanced query language SQL and can leave without secondary indexes and typed columns v You have enough hardware available as even HDFS doesn’t do well with anything less than 5 Data Node so you will at least need 5 nodes cluster v If your application has a variable schema where each row is slightly different v If you data I stored in collections v If you need key based access to data when storing or retrieving About HBase When to use HBase