HBaseCon 2012 | You’ve got HBase! How AOL Mail Handles Big Data

You’ve got HBase
How AOL Mail handles Big Data

Presented at HBaseCon
May 22, 2012

The AOL Mail System
Over 15 years old
Constantly evolving
10,000+ hosts
70+ Million mailboxes
50+ Billion emails
A technology stack that runs the gamut

Presented at
HBaseCon 2012
Page 2

What that means…
Lots of data
Lots of moving parts
Tight SLAs
Mature system + Young software = Tough marriage
We don’t buy “commodity” hardware
Engrained Dev/QA/Prod product lifecycle
Somewhat “version locked” to tried-and-true platforms
Expect service outages to be quickly mitigated by our NOC w/out waiting for an on-call

Presented at
HBaseCon 2012
Page 3

So where does HBase fit?
It’s a component, not the foundation
Currently used in two places
Being evaluated for more
It will remain a tool in our diverse Big Data arsenal

Presented at
HBaseCon 2012
Page 4

An “Activity Profiler”
Watches for particular behaviors
Designed and built in 6/2010
Originally “vanilla” Hadoop 0.20.2 + HBase 0.90.2
Currently CDH3
1.4+ Million Events/min
60x 24TB (raw) DataNodes w/ local RegionServers
15x application hosts
Is an internal-only tool
Used by automated anti-abuse systems
Leveraged by data analysts for adhoc queries/MapRed

Presented at
HBaseCon 2012
Page 6

An “Activity Profiler”

Presented at
HBaseCon 2012
Page 7

Why the “Event Catcher” layer?
Has to “speak the language” of our existing systems
Easy to plug an HBase translator in to existing data feeds
Hard to modify the infrastructure to speak HBase

Flume was too young at the time

Presented at
HBaseCon 2012
Page 8

Why batch load via MapRed?
Real time is not currently a requirement
Allows filtering at different points
Allows us to “trigger” events
Designed before coprocessors

Early data integrity issues necessitated “replaying”
Missing append support early on
Holes in the Meta table
Long splits and GC pauses caused client timeouts

Can sample data into a “sandbox” for job development
Makes pig, hive, and other MapRed easy and stable
We keep the raw data around as well

Presented at
HBaseCon 2012
Page 9

HBase and MapRed can live in harmony
Bigger than “average” hardware
36+GB RAM
8+ cores

Proper system tuning is essential
Good information on tuning Hadoop is prolific, but…
XFS > EXT
JBOD > RAID
As far as HBase is concerned…
Just go buy Lars’ book

Careful job development, optimization is key!

Presented at
HBaseCon 2012
Page 10

Contact History API
Services a member-facing API
Designed and built in 10/2010
Modeled after the previous application
Built by a different Engineering team
Used to solve a very different problem

250K+ Inserts/min
3+ Million Inserts/min during MapRed
20x 24TB (raw) DataNodes w/ local RegionServers
14x application hosts
Leverages Memcached to reduce query load on HBase
Presented at
HBaseCon 2012
Page 12

Contact History API

Presented at
HBaseCon 2012
Page 13

Amusing mistakes to learn from
Exploding regions
Batch inserts via MapRed result in fast, symmetrical key space growth
Attempting to split every region at the same time is a bad idea
Turning off region splitting and using a custom “rolling region splitter” is a good idea
Take time and load into consideration when selecting regions to split

Backups, backups, backups!
You can never have to many

Large, non-splitable regions tell you things
Our key space maps to accounts
Excessively large keys equal excessively “active” accounts

Presented at
HBaseCon 2012
Page 15

Next-generation model

Presented at
HBaseCon 2012
Page 16

Thanks!

Presented at
HBaseCon 2012
Page 17

HBaseCon 2012 | You’ve got HBase! How AOL Mail Handles Big Data

More Related Content

What's hot (20)

Viewers also liked (20)

Similar to HBaseCon 2012 | You’ve got HBase! How AOL Mail Handles Big Data (20)

More from Cloudera, Inc. (20)

Recently uploaded (20)

HBaseCon 2012 | You’ve got HBase! How AOL Mail Handles Big Data

Editor's Notes