4 hadoop for-the-disillusioned

Hadoop for the disillusioned
Steve Watt, Red Hat

CC flickr rubenswieringa

@wattsteve

Wired Magazine - July 2008

@wattsteve

Hadoop in 2013
Platform Layers

Technologies

Computational
Runtimes

YARN, GiRAPH, MapReduce,
HBase, Phoenix, Spark/BDAS,
Drill, Impala, Stinger & more

FileSystems

Azure, CassandraFS, CephFS,
CleverSafe, GlusterFS, GridGain,
HDFS, Lustre
MapR FS, S3, SWIFT, Quantcast
FS, Symantec VCFS & more

Infrastructures

System on a Chip, x86,
Virtualization and Cloud

Distributions

Cloudera, Hortonworks, IBM,
Intel, MapR, WanDisco

CC flickr lowfatbrains

@wattsteve

Source: Gartner Hype Cycle

@wattsteve

Your data is growing beyond your ability to manage & query it

CC flickr kakadu

@wattsteve

Save money when asking the same questions of your data

CC flickr martijnsnels

@wattsteve

Hadoop Customer, “Great, but now what?”
Innovators

Early
Adopters

Early
Majority

Late
Majority

Laggards

CHASM

Geoffrey Moore’s Technology Adoption Lifecycle

@wattsteve

new
and build data products

CC flickr cbcastro

@wattsteve







Ask your domain experts and LOB folks what unanswered questions they have
Where can you get the data you need to answer that question? (domain experts should know
where to get it)
Some of this data may be outside your organization (Social Media, Sensor Data, Data
brokerages/Marketplaces, Web Pages) and some of it may be inside.
If the data for the query doesn’t exist, figure out how to instrument or gather it.
Pair your domain experts with your data engineers so they can work out how to obtain and
massage the data given the types of queries desired

CC flickr birdwatcher63

@wattsteve

• Building data products is a similar exercise except that it involves typical product planning,
such as identifying a market.
• This is also a great way for an organization to explore what assets they have within their data

CC flickr syume

@wattsteve

Mapping the night sky

CC flickr bobfamiliar

@wattsteve

Analyzing farm soil content
to predict human conflict

CC flickr oxfam

@wattsteve

Crisis Management for the
Chilean Earthquake

CC flickr flodigrip

@wattsteve

Thanks for listening

Steve Watt

swatt@redhat.com

@wattsteve

4 hadoop for-the-disillusioned

More Related Content

What's hot (19)

Viewers also liked (6)

Similar to 4 hadoop for-the-disillusioned (20)

More from BigDataCamp (10)

Recently uploaded (20)

4 hadoop for-the-disillusioned

Editor's Notes