Facebook - Jonthan Gray - Hadoop World 2010

HBase at Facebook
Jonathan Gray
Hadoop World
October 12, 2010

1 Data at Facebook
2 HBase Development
3 Future Work
Agenda

500 Million active monthly users
>500 Billion page views per month
25 Billion pieces of content per month

Cache
OS Web server Database Language
Data analysis

Daily Federated MySQLSilver Cluster
Platinum
Hadoop
Cluster
5-15 minutes5-15 minutes
Cluster 1 Cluster 2
Scribe/Hadoop
cluster
Scribe/Hadoop
cluster

HBase
Key Properties
▪ Linearly scalable
▪ Fast indexed writes
▪ Tight integration with Hadoop
Bridges gap between online and ofﬂine

Use Case #1
Incremental Updates into Data Warehouse
▪ Currently
▪ Nightly dumps of UDBs into Warehouse
▪ With HBase
▪ Tail UDB replication logs into HBase
UDB to Warehouse in minutes

Use Case #2
High Frequency Counters and Realtime Analytics
▪ Currently
▪ Scribe to HDFS, periodically aggregate to UDB
▪ With HBase
▪ Scribe to HBase, read in realtime with API or MR
Storage, serving, and analysis in one

Use Case #3
User-facing Database for Write Intensive Workloads
▪ Currently
▪ Constantly expanding UDB and Memcache tiers
▪ With HBase
▪ Fast writes, automatic partitioning, linear scaling
Fast and scalable writes, just add nodes

Hive Integration
HBase and Hive
▪ HBase Tables usable as Hive Tables
▪ ETL data target
▪ Query data source
▪ Support for different read/write patterns
▪ API random write or MR bulk load
▪ API random read or MR table scan

HBase Master
Re-architected for HA and Testability
▪ Increased usage of ZooKeeper for failover
▪ Region transitions in ZK
▪ Working master failover in all cases
▪ Refactor/Redesign of major components
▪ Load balancer, cluster startup, failover redesigned
▪ Emphasis on independent testability

Random Read Optimizations
Performance degrades with lots of files
▪ Bloom filters
▪ Dynamic Row or Row+Column as HFile metadata
▪ Skip files on disk that don’t match
▪ Timestamp ranges
▪ Stored as HFile metadata
▪ Skip files on disk that don’t cover time range

Random Read Optimizations
Performance degrades with wide rows
▪ Aggressively seek/reseek
▪ Use query and block index to skip blocks
▪ Stop processing as soon as we ﬁnish query
▪ Expose seeking to Filter API
▪ Allow specialized optimizations
▪ Millions of versions in a row, grab 10

Administration Tools
Detect and repair potential issues
▪ HBCK
▪ FSCK for HBase
▪ Detect and repair cluster issues
▪ Cluster Veriﬁcation
▪ Ensure cluster can be written to, read from
▪ Tables can be created/disabled/dropped

Hadoop Improvements
HDFS Appends
▪ Hadoop 0.20
▪ Widely deployed but no support for appends
▪ Hadoop 0.20 with append support
▪ Apache Hadoop 0.20-Append
▪ Cloudera’s CDH version 3
▪ Facebook’s version of Hadoop 0.20
▪ http://github.com/facebook/hadoop-20-append

Hadoop Improvements
HDFS rolling upgrades and NameNode HA
▪ HDFS in online application
▪ Need to support upgrades without downtime
▪ More sensitive to NameNode SPOF
▪ Hadoop AvatarNode
▪ Hot standby pair of NameNodes
▪ Failover to new version of NameNode
▪ Failover to hot standby in seconds under failure

Coming Soon
New Features
▪ East coast / west coast replication
▪ Asynchronous replication between data centers
▪ Faster recovery
▪ Distributed log splitting
▪ Master controlled rolling restart
▪ Fast and retaining assignment information

Coprocessors
Complex server-side operations
▪ Dynamically loaded server-side logic
▪ Hook into read/write and cluster operations
▪ Endless possibilities
▪ Server-side merges and joins
▪ Lightweight MapReduce for aggregation
▪ Efﬁcient secondary indexing

Intelligent Load Balancing
Complex notion of load
▪ Currently based only on region count
▪ Different regions have different access patterns
▪ And data locality equally important
▪ Next generation load balancing algorithms
▪ Consider complex notion of read load / write load
▪ And HDFS block locations for locality
▪ Retain assignment information between restarts

Other Future Work
Cluster Performance
▪ Quality of service
▪ One MapReduce job can take down cluster
▪ Dynamic conﬁguration changes
▪ Change important parameters on running cluster
▪ HDFS performance
▪ Critical target for long-term HBase performance

Facebook - Jonthan Gray - Hadoop World 2010

More Related Content

What's hot (20)

Viewers also liked (20)

Similar to Facebook - Jonthan Gray - Hadoop World 2010 (20)

More from Cloudera, Inc. (20)

Recently uploaded (20)

Facebook - Jonthan Gray - Hadoop World 2010