HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

HBase to Save the Planet

Alex Newman
posix4e@apache.org
Architect, Drawn to Scale
Strategic Advisor, Opower

My life with HBase

Drawn to
Factset Cloudera Opower
Scale

About Opower

Opower is a customer engagement
platform for the utility industry

About Opower

Home energy reports
Customized utility bills
Energy efficiency programs for utilities

About Opower

Opower runs on analytics
Analytics run on Hadoop + HBase

Opower analysis relies on data
from a variety of sources

» Electric Utility Usage » Thermostat » Weather » Gas Utility Usage
Data data data Data

Data Storage & 4
Shared Energy
Processing Signature
Repository
3 1 2

Disaggregation OPOWER
Algorithms Platform

Opower’s first architecture could
not support their analytic vision
MySQL
Scalability?
Performance?
Data integration?

Opower’s first architecture could
not support their analytic vision
Analytic workflow instead of
analytic apps:
SQL -> CSV -> R -> too little, too slow

Problem #1
Data Lake Cost

Usage AMI Regional AMI Sensor Data Data Lake

Problem #2
Slower and slower queries
Smart-grid-scale data
Lots of supporting data: weather, demographics, etc.

Problem #3
It was taking lots of “magic”
Intense analytics
Strange schemas
Segmented queries

Hadoop + HBase at Opower

Opower determined that they needed
an entirely new data architecture

Hadoop + HBase at Opower
Early success:
HBase AMI

What rocked

Endless, cheap scalability

What rocked

The analytics team loved it!

What sucked

Hard on the ops team – still trying to
grok it

What sucked
NoSchema p1.
Creating Schema
Managing MetaData
Schema <=> Performance

What sucked

HA
Failover
Snapshots

What sucked
No secondary index
Aggregation is slow (Rollup/OLAP)
Poor Client Performance

It would be better if only …

Developers were not forced to know
how the data is stored, indexed, etc.


There were nicer APIs and better
query languages (SQL?)


Version migrations were easy
Hierarchical Tables


Real-time tuning


Did I mention HA?

In summary

HBase has helped Opower achieve their analytic
vision
But they’ve still got a long way to go
HBase still has a long way to go

Questions?

Alex Newman
posix4e@apache.org
Architect, Drawn to Scale
Strategic Advisor, Opower

HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

More Related Content

What's hot (20)

Viewers also liked (20)

Similar to HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower (20)

More from Cloudera, Inc. (20)

Recently uploaded (20)

HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Editor's Notes