SlideShare a Scribd company logo
Presenter: Mike Olson, CEO, Cloudera
Thursday, December 22, 2011
11:00 am PT, 2:00 pm ET
Hadoop: Twelve Predictions for 2012
Looking Back at Apache Hadoop in 2011
• 2011 was the year that Apache Hadoop became part of mainstream
dialogue, and industry interest in Hadoop exploded.
– VC firms and investors continued to make significant commitments to Hadoop,
like Accel Partners’ $100M Big Data Fund.
– Big corporate players such as EMC, IBM, Microsoft and Oracle announced their
entry into the Hadoop space. A handful of Hadoop-focused startups also
launched this year.
– Hadoop-related inquiries into major analyst firms moved from “What exactly is
Hadoop?” to “Which vendors offer robust Hadoop solutions?”
• Hadoop adoption increased within enterprises and new industry use
cases emerged.
• With the uptick of interest in Hadoop, there was a notable increase
in demand for skilled Hadoop workers and data scientists.
2
3
Key Analysts’ Predictions
for Big Data and Hadoop in 2012
Looking Ahead
Industry Analysts Predict…
“We predict that through 2015, organizations integrating high-value,
diverse, new information types and sources into a coherent
information management infrastructure will outperform their industry
peers financially by more than 20%.”
— Merv Adrian, Research VP, Information Management, Gartner
4
Industry Analysts Predict…
“The core Hadoop platform will mature rapidly.
“While I question some of the Hadoop industry's marketing, I have
an optimistic view toward the community's plans for maturing core
Hadoop. There's low-hanging technical fruit. There's understanding
of the need. There's money. There are great engineers. There are
partners with market and technical understanding. The Hadoop
engineering stars are favorably aligned.”
— Curt Monash, Analyst, Monash Research
5
Industry Analysts Predict…
“1. Big Data was the term du jour for 2011. In 2012, we’ll move away from this term, as
ANY DATA will be of paramount importance for enterprises. At the same time, IT
resources are scarce, and quite frankly, most IT organizations are trying to simplify their
IT environments and look at it more strategically and holistically. Intelligent management
will be key, as enterprises look to analyze any data, at any given time (and as near real-
time as possible), and management tools will need to determine, in the most efficient
manner, where the HW/SW resources must come from, whether it’s for structured, semi-
structured, or unstructured…or any combination thereof. This demand will push the
ecosystem to continue to grow and deliver the necessary solutions to the market.
“ 2. Data protection, security are key areas that have not been adequately
addressed…and will become important to do so as enterprises think about leveraging
cloud for their analytics.
“3. ‘Big Data as a service’ the new term du jour. Enterprises will look to leverage both
their own resources for sensitive datasets and cloud for others…service providers will
become players in delivering these services.”
— Vanessa Alvarez, Analyst, Infrastructure & Operations, Forrester
6
Industry Analysts Predict…
“1. We will see a major shift of current experimental Hadoop users (the
bulk of current users) to mainstream production applications that will
ultimately drive the need for more enterprise-class, advanced data
management features. While these applications will be limited in scope
and moderate in cluster size, the business requirements will drive the
need for more options related to HA, advanced data protection and
remote-site failover.
“2. We will see more IT organizations initially considering Hadoop as an
alternative lower cost online storage for Big Data – essentially a
cheaper data store (versus in relational databases with associated license
and SAN attached storage) in preparation for future data processing and
analytics tasks.”
— Julie Lockner, Senior Analyst and Vice President, Data
Management Solutions, ESG
7
Industry Analysts Predict…
“In 2011, the federal markets with the most dramatic growth in
Hadoop were associated with counter terrorism missions and law
enforcement. Fraud detection was also an important market, as was
the use of Hadoop-centric solutions for search needs.
“We expect 2012 will result in growth in federal heath missions and
in military logistics. Research and interest by agencies indicates
health related solutions will leverage Hadoop for predicting
treatment outcomes based on patient data and historical medical
use. We expect military logistics use cases to provide DoD with new
efficiencies in optimizing supply chains and predicting needs before
they arise.”
— Bob Gourley, Founder and CTO, Crucial Point LLC
8
Industry Analysts Predict…
“We believe that during 2012, enterprise distributions of Hadoop will
mature enough that enterprises will accelerate production
deployments and begin to yield tangible organizational value.”
— Ben Woo, VP, Storage and Big Data, IDC
9
10
Mike Olson’s Twelve Predictions
for Hadoop in 2012
#12 The Open Source Platform Expands
• New contributions will go mostly to existing and new packages that
complement Hadoop. In 2011, nearly 70% of all new work was
outside the core project.
• New and incubator projects like Flume, Bigtop and Crunch will
graduate to critical pieces of the Hadoop platform. This is the
continued “Linuxification” of Hadoop.
11
2006 2007 2008 2009 2010 2011
Core
Hadoop
as % of
New
Contribs
Nothing but
Hadoop
… plus 11… plus 8…plus 6Hadoop
plus 3
100% 100%
37%
58%
37% 31%
#11 Vendor Investment Accelerates
• Every significant server, storage and data infrastructure vendor will
have a clearly-articulated Hadoop strategy by end of year.
• Product development dollars will go to:
– Contributions to the open source platform
– Integration of the open source platform with existing IT infrastructure
• Best-of-breed, rather than single-vendor, deployments will dominate
in 2012 (and for at least three years following).
12
The Cloudera Connect partner program
has added at least one new member every
day since its launch in 3Q11.
#10 The Platform Grows Up
• Key issues that worry enterprise users will be addressed.
– High availability guaranteed by improvements to storage infrastructure
• No single point of failure!
– Dramatic performance improvements for storage and analytical workloads
• Integration with existing data center infrastructure will become complete.
• Computer science PhD will no longer be required to operate Hadoop in
production.
13
#9 Hadoop Stops Being Cool
• Fascination with the technology ebbs and platform novelty subsides.
• New focus: What problems can Hadoop solve for me, and how do
I put together a complete suite of products – stack, analytics,
visualization – that I can deploy and use easily?
14
(This is a lie. Hadoop will remain cool.)
#8 It’s All About Apps
• Existing BI and analytics products will connect to Hadoop, unlocking
it to a large audience of business and technical professionals.
• New offerings from established vendors and emerging companies
will be designed from the ground up to take advantage of rich data,
powerful algorithms and new techniques for data visualization. This
represents a major expansion of BI and analytics into new data and
problem domains.
15
#7 Chasm? What Chasm?
• Hadoop adoption by mainstream enterprises will accelerate.
• Smaller companies are seeing the value of Big Data, too, and will
need the tools to analyze it.
• Deployments expand beyond single use cases inside departments to
the broader enterprise.
• It's not how much data you have (PB, TB) but what kind of data you
have and what you do with it.
16
Financial ServicesTelecommunications Life SciencesRetailGovernment
Platform maturity and app proliferation will drive broad adoption.
#6 Here Comes Everybody
• Small and emerging companies,
not widely viewed as having Big
Data problems, will discover
ways to capture and analyze
information in new ways, just like
their larger counterparts.
• Cloud deployment of Hadoop will
accelerate as a result of this
trend – small companies and
startups have little sunk
infrastructure cost and are cash-
conscious, so will deploy on
hosted platforms faster than will
their peers at large enterprises.
17
• With the creation of the Accel Big Data Fund and other VC firms’
interest in Big Data and Hadoop, we will see the launch of numerous
Hadoop-focused startups in 2012.
#5 Opportunity Expands: Security, Storage, Efficiency
• As use of Hadoop in core workloads – storage, data processing and
analytics – matures inside enterprises, they will begin to look at it as
a more general-purpose platform able to attack other business
problems.
• Improvements in platform security, access control and audit logging
means that some compliance workloads will begin to migrate to
Hadoop in the second half of the year.
• Improvements in the open source storage layer mean that storage
gets cheaper still, making Hadoop attractive as an online archive for
data previously written to tape and stored in caves.
• Better measuring and management of resource consumption will
lead to improvements in computational efficiency.
• The variety of problems that Hadoop addresses will expand.
18
#4 Skilled People Get Easier to Find
• Professional training courses aimed at developers, operators and
users will be widely available and very popular in 2012 and beyond.
– Finding people who can deploy, configure and use Hadoop has been
difficult in 2011.
• Systems integrators and consulting firms will establish Hadoop-
focused practices around the world.
19
#3 Data Science Gets Hotter
• Widespread use of Hadoop in new industries, aimed at previously-
intractable problems, will create a new role in the organization:
“Data Scientist.”
– Mathematical and statistical expertise
– Programming skill
– A passion for data and an understanding of the business
• Industry and academia will collaborate to produce this new class
of expert.
20
#2 The Demise of the “Hadoop-Killer”
• Second and third quarters of 2011 saw several vendors announce
“Hadoop killers” – legacy products, deployed in niche markets, that
they wanted to use to capitalize on the opportunity that Big Data
presents.
• In the fourth quarter of 2011, Microsoft announced it was
discontinuing its investment in Dryad. Dryad was, simply, the most
interesting alternative to Hadoop in the market. Instead, the
company is investing in Hadoop.
• The vendors offering alternatives to Hadoop will follow Microsoft’s
lead. These products have no real traction in the Big Data market
and will quietly disappear by the end of 2012.
21
#1 Forks Begin to Fade
• The open source Hadoop platform will be robust, scalable and reliable
enough to take on any enterprise Big Data workload.
• As a result, proprietary forks – MapReduce grafted onto legacy-vendor SQL
databases, appliances that replace the storage layer with proprietary file
systems and others – will see their differentiation erode or disappear.
• Buyers will elect to deploy the open source platform once it demonstrates
feature parity or superiority with these forked products.
22
Commercial opportunities
will constrict for vendors of
forked products.
It will take several years
for these offerings to
disappear altogether.
#0 Bonus Prediction! M&A Consolidation Begins
• Large vendors lacking product and expertise in Hadoop will survey
the market for teams and technologies that can close those gaps.
• Specialist vendors, especially those that have struggled to grow or
with positioning and differentiation challenges, will hire investment
bankers to seek acquirers.
• These early, small acquisitions, combined with an expanding market
and accelerated demand for big data solutions, will drive valuations
for new and existing independent companies.
Buyer meets seller. Nature follows its course.
23
24
We appreciate your time
and interest.
For additional information:
cloudera.com
+1 (888) 789-1488
sales@cloudera.com
twitter.com/
cloudera
facebook.com/
cloudera

More Related Content

PDF
Leaders in the Cloud: Identifying Cloud Business Value for Customers
PDF
Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare)
PDF
Cloud Computing for Enterprise Architects
PDF
Cloud Computing – Time for delivery. The question is not “if”, but “how, whe...
PDF
Challenges in cloud computing to enable future internet of things v0.3
PDF
Cloud Computing Tutorial - Jens Nimis
PDF
Cloud computing overview
PDF
Microsoft Cloud Computing E-Book
Leaders in the Cloud: Identifying Cloud Business Value for Customers
Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare)
Cloud Computing for Enterprise Architects
Cloud Computing – Time for delivery. The question is not “if”, but “how, whe...
Challenges in cloud computing to enable future internet of things v0.3
Cloud Computing Tutorial - Jens Nimis
Cloud computing overview
Microsoft Cloud Computing E-Book

What's hot (17)

PPT
cloud computing
PPTX
Cloud Computing: What it Means for Libraries, Library Staff, Training and Skills
PDF
Defining Your Cloud Strategy
PDF
Cloud Computing for Lawyers: Practical and Ethical Uses of the Cloud
DOCX
Cloud migration plan1. executive summary ( 1 page)2. scope (
PPTX
Cloud 101: The Basics of Cloud Computing
PDF
Cloud what is the best model for vietnam
PPTX
Understanding Cloud Computing & Its Relevance to Financial Software Solutions
PDF
Cloud Computing 101 Workshop Sample
PDF
Cloud Computing Essentials
PDF
Presentation citrix desktop virtualization (2)
PPTX
EMC Big Data Solutions Overview
PDF
Brochure : The EMC Big Data Solution
 
PDF
Impact of busines model elements on cloud computing adoption
PPTX
Lean Cloud - Amazon Web Services
PPTX
Hadoop-as-a-Service for Lifecycle Management Simplicity
PPTX
Transitioning to Hybrid Cloud
cloud computing
Cloud Computing: What it Means for Libraries, Library Staff, Training and Skills
Defining Your Cloud Strategy
Cloud Computing for Lawyers: Practical and Ethical Uses of the Cloud
Cloud migration plan1. executive summary ( 1 page)2. scope (
Cloud 101: The Basics of Cloud Computing
Cloud what is the best model for vietnam
Understanding Cloud Computing & Its Relevance to Financial Software Solutions
Cloud Computing 101 Workshop Sample
Cloud Computing Essentials
Presentation citrix desktop virtualization (2)
EMC Big Data Solutions Overview
Brochure : The EMC Big Data Solution
 
Impact of busines model elements on cloud computing adoption
Lean Cloud - Amazon Web Services
Hadoop-as-a-Service for Lifecycle Management Simplicity
Transitioning to Hybrid Cloud
Ad

Viewers also liked (11)

PDF
Open APIs: What's Hot, What's Not?
PDF
Hybrid Customer Insight - Data Collection and Analysis from On-premise and in...
PDF
Scaling the Cloud - Cloud Security
PPT
Cloud Computing Integration Introduction
PDF
Cloud Computing Technology Overview 2012
PDF
2012 Future of Cloud Computing
PPTX
Future of cloud computing linthicum
PPT
How to Get Cloud Architecture and Design Right the First Time
PPTX
Zuora @ AlwaysOn 2012 - The Only 3 SaaS Metrics That Matter
PPTX
Big data and intelligent platforms
PDF
Architectures for open and scalable clouds
Open APIs: What's Hot, What's Not?
Hybrid Customer Insight - Data Collection and Analysis from On-premise and in...
Scaling the Cloud - Cloud Security
Cloud Computing Integration Introduction
Cloud Computing Technology Overview 2012
2012 Future of Cloud Computing
Future of cloud computing linthicum
How to Get Cloud Architecture and Design Right the First Time
Zuora @ AlwaysOn 2012 - The Only 3 SaaS Metrics That Matter
Big data and intelligent platforms
Architectures for open and scalable clouds
Ad

Similar to Hadoop Twelve Predictions for 2012 (20)

PDF
IIA: The Current State of Hadoop in the Enterprise
PDF
Combining hadoop with big data analytics
PPTX
Big data an elephant business opportunities
PDF
Intel Big Data Analysis Peer Research Slideshare 2013
PDF
Hadoop for Finance - sample chapter
PDF
Are You Prepared For The Future Of Data Technologies?
PDF
Big Data Tools: A Deep Dive into Essential Tools
PPTX
Top 10 renowned big data companies
PDF
2012 iia-predictions-brief-final
PPTX
10 top notch big data trends to watch out for in 2017
PDF
Hadoop Overview
PDF
The Forrester Wave - Big Data Hadoop
PPTX
Big data analytics
PPTX
Big Idea For Big Data
PDF
Top 5 Trends in Big Data & Analytics
PDF
Big dataimplementation hadoop_and_beyond
PPTX
Top 5 Trends in Big Data & Analytics
PPTX
Top 5 Trends in Big Data & Analytics.
PDF
11 Major Reasons To Learn Hadoop .
PDF
The Big Picture on Big Data and Cognos
IIA: The Current State of Hadoop in the Enterprise
Combining hadoop with big data analytics
Big data an elephant business opportunities
Intel Big Data Analysis Peer Research Slideshare 2013
Hadoop for Finance - sample chapter
Are You Prepared For The Future Of Data Technologies?
Big Data Tools: A Deep Dive into Essential Tools
Top 10 renowned big data companies
2012 iia-predictions-brief-final
10 top notch big data trends to watch out for in 2017
Hadoop Overview
The Forrester Wave - Big Data Hadoop
Big data analytics
Big Idea For Big Data
Top 5 Trends in Big Data & Analytics
Big dataimplementation hadoop_and_beyond
Top 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & Analytics.
11 Major Reasons To Learn Hadoop .
The Big Picture on Big Data and Cognos

More from Cloudera, Inc. (20)

PPTX
Partner Briefing_January 25 (FINAL).pptx
PPTX
Cloudera Data Impact Awards 2021 - Finalists
PPTX
2020 Cloudera Data Impact Awards Finalists
PPTX
Edc event vienna presentation 1 oct 2019
PPTX
Machine Learning with Limited Labeled Data 4/3/19
PPTX
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
PPTX
Introducing Cloudera Data Science Workbench for HDP 2.12.19
PPTX
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
PPTX
Leveraging the cloud for analytics and machine learning 1.29.19
PPTX
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
PPTX
Leveraging the Cloud for Big Data Analytics 12.11.18
PPTX
Modern Data Warehouse Fundamentals Part 3
PPTX
Modern Data Warehouse Fundamentals Part 2
PPTX
Modern Data Warehouse Fundamentals Part 1
PPTX
Extending Cloudera SDX beyond the Platform
PPTX
Federated Learning: ML with Privacy on the Edge 11.15.18
PPTX
Analyst Webinar: Doing a 180 on Customer 360
PPTX
Build a modern platform for anti-money laundering 9.19.18
PPTX
Introducing the data science sandbox as a service 8.30.18
Partner Briefing_January 25 (FINAL).pptx
Cloudera Data Impact Awards 2021 - Finalists
2020 Cloudera Data Impact Awards Finalists
Edc event vienna presentation 1 oct 2019
Machine Learning with Limited Labeled Data 4/3/19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Leveraging the cloud for analytics and machine learning 1.29.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Leveraging the Cloud for Big Data Analytics 12.11.18
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 1
Extending Cloudera SDX beyond the Platform
Federated Learning: ML with Privacy on the Edge 11.15.18
Analyst Webinar: Doing a 180 on Customer 360
Build a modern platform for anti-money laundering 9.19.18
Introducing the data science sandbox as a service 8.30.18

Recently uploaded (20)

PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPT
Teaching material agriculture food technology
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Machine learning based COVID-19 study performance prediction
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Approach and Philosophy of On baking technology
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Unlocking AI with Model Context Protocol (MCP)
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Modernizing your data center with Dell and AMD
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
NewMind AI Monthly Chronicles - July 2025
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
GamePlan Trading System Review: Professional Trader's Honest Take
20250228 LYD VKU AI Blended-Learning.pptx
Teaching material agriculture food technology
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
Machine learning based COVID-19 study performance prediction
Review of recent advances in non-invasive hemoglobin estimation
Approach and Philosophy of On baking technology
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Unlocking AI with Model Context Protocol (MCP)
The AUB Centre for AI in Media Proposal.docx
Modernizing your data center with Dell and AMD
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
NewMind AI Monthly Chronicles - July 2025

Hadoop Twelve Predictions for 2012

  • 1. Presenter: Mike Olson, CEO, Cloudera Thursday, December 22, 2011 11:00 am PT, 2:00 pm ET Hadoop: Twelve Predictions for 2012
  • 2. Looking Back at Apache Hadoop in 2011 • 2011 was the year that Apache Hadoop became part of mainstream dialogue, and industry interest in Hadoop exploded. – VC firms and investors continued to make significant commitments to Hadoop, like Accel Partners’ $100M Big Data Fund. – Big corporate players such as EMC, IBM, Microsoft and Oracle announced their entry into the Hadoop space. A handful of Hadoop-focused startups also launched this year. – Hadoop-related inquiries into major analyst firms moved from “What exactly is Hadoop?” to “Which vendors offer robust Hadoop solutions?” • Hadoop adoption increased within enterprises and new industry use cases emerged. • With the uptick of interest in Hadoop, there was a notable increase in demand for skilled Hadoop workers and data scientists. 2
  • 3. 3 Key Analysts’ Predictions for Big Data and Hadoop in 2012 Looking Ahead
  • 4. Industry Analysts Predict… “We predict that through 2015, organizations integrating high-value, diverse, new information types and sources into a coherent information management infrastructure will outperform their industry peers financially by more than 20%.” — Merv Adrian, Research VP, Information Management, Gartner 4
  • 5. Industry Analysts Predict… “The core Hadoop platform will mature rapidly. “While I question some of the Hadoop industry's marketing, I have an optimistic view toward the community's plans for maturing core Hadoop. There's low-hanging technical fruit. There's understanding of the need. There's money. There are great engineers. There are partners with market and technical understanding. The Hadoop engineering stars are favorably aligned.” — Curt Monash, Analyst, Monash Research 5
  • 6. Industry Analysts Predict… “1. Big Data was the term du jour for 2011. In 2012, we’ll move away from this term, as ANY DATA will be of paramount importance for enterprises. At the same time, IT resources are scarce, and quite frankly, most IT organizations are trying to simplify their IT environments and look at it more strategically and holistically. Intelligent management will be key, as enterprises look to analyze any data, at any given time (and as near real- time as possible), and management tools will need to determine, in the most efficient manner, where the HW/SW resources must come from, whether it’s for structured, semi- structured, or unstructured…or any combination thereof. This demand will push the ecosystem to continue to grow and deliver the necessary solutions to the market. “ 2. Data protection, security are key areas that have not been adequately addressed…and will become important to do so as enterprises think about leveraging cloud for their analytics. “3. ‘Big Data as a service’ the new term du jour. Enterprises will look to leverage both their own resources for sensitive datasets and cloud for others…service providers will become players in delivering these services.” — Vanessa Alvarez, Analyst, Infrastructure & Operations, Forrester 6
  • 7. Industry Analysts Predict… “1. We will see a major shift of current experimental Hadoop users (the bulk of current users) to mainstream production applications that will ultimately drive the need for more enterprise-class, advanced data management features. While these applications will be limited in scope and moderate in cluster size, the business requirements will drive the need for more options related to HA, advanced data protection and remote-site failover. “2. We will see more IT organizations initially considering Hadoop as an alternative lower cost online storage for Big Data – essentially a cheaper data store (versus in relational databases with associated license and SAN attached storage) in preparation for future data processing and analytics tasks.” — Julie Lockner, Senior Analyst and Vice President, Data Management Solutions, ESG 7
  • 8. Industry Analysts Predict… “In 2011, the federal markets with the most dramatic growth in Hadoop were associated with counter terrorism missions and law enforcement. Fraud detection was also an important market, as was the use of Hadoop-centric solutions for search needs. “We expect 2012 will result in growth in federal heath missions and in military logistics. Research and interest by agencies indicates health related solutions will leverage Hadoop for predicting treatment outcomes based on patient data and historical medical use. We expect military logistics use cases to provide DoD with new efficiencies in optimizing supply chains and predicting needs before they arise.” — Bob Gourley, Founder and CTO, Crucial Point LLC 8
  • 9. Industry Analysts Predict… “We believe that during 2012, enterprise distributions of Hadoop will mature enough that enterprises will accelerate production deployments and begin to yield tangible organizational value.” — Ben Woo, VP, Storage and Big Data, IDC 9
  • 10. 10 Mike Olson’s Twelve Predictions for Hadoop in 2012
  • 11. #12 The Open Source Platform Expands • New contributions will go mostly to existing and new packages that complement Hadoop. In 2011, nearly 70% of all new work was outside the core project. • New and incubator projects like Flume, Bigtop and Crunch will graduate to critical pieces of the Hadoop platform. This is the continued “Linuxification” of Hadoop. 11 2006 2007 2008 2009 2010 2011 Core Hadoop as % of New Contribs Nothing but Hadoop … plus 11… plus 8…plus 6Hadoop plus 3 100% 100% 37% 58% 37% 31%
  • 12. #11 Vendor Investment Accelerates • Every significant server, storage and data infrastructure vendor will have a clearly-articulated Hadoop strategy by end of year. • Product development dollars will go to: – Contributions to the open source platform – Integration of the open source platform with existing IT infrastructure • Best-of-breed, rather than single-vendor, deployments will dominate in 2012 (and for at least three years following). 12 The Cloudera Connect partner program has added at least one new member every day since its launch in 3Q11.
  • 13. #10 The Platform Grows Up • Key issues that worry enterprise users will be addressed. – High availability guaranteed by improvements to storage infrastructure • No single point of failure! – Dramatic performance improvements for storage and analytical workloads • Integration with existing data center infrastructure will become complete. • Computer science PhD will no longer be required to operate Hadoop in production. 13
  • 14. #9 Hadoop Stops Being Cool • Fascination with the technology ebbs and platform novelty subsides. • New focus: What problems can Hadoop solve for me, and how do I put together a complete suite of products – stack, analytics, visualization – that I can deploy and use easily? 14 (This is a lie. Hadoop will remain cool.)
  • 15. #8 It’s All About Apps • Existing BI and analytics products will connect to Hadoop, unlocking it to a large audience of business and technical professionals. • New offerings from established vendors and emerging companies will be designed from the ground up to take advantage of rich data, powerful algorithms and new techniques for data visualization. This represents a major expansion of BI and analytics into new data and problem domains. 15
  • 16. #7 Chasm? What Chasm? • Hadoop adoption by mainstream enterprises will accelerate. • Smaller companies are seeing the value of Big Data, too, and will need the tools to analyze it. • Deployments expand beyond single use cases inside departments to the broader enterprise. • It's not how much data you have (PB, TB) but what kind of data you have and what you do with it. 16 Financial ServicesTelecommunications Life SciencesRetailGovernment Platform maturity and app proliferation will drive broad adoption.
  • 17. #6 Here Comes Everybody • Small and emerging companies, not widely viewed as having Big Data problems, will discover ways to capture and analyze information in new ways, just like their larger counterparts. • Cloud deployment of Hadoop will accelerate as a result of this trend – small companies and startups have little sunk infrastructure cost and are cash- conscious, so will deploy on hosted platforms faster than will their peers at large enterprises. 17 • With the creation of the Accel Big Data Fund and other VC firms’ interest in Big Data and Hadoop, we will see the launch of numerous Hadoop-focused startups in 2012.
  • 18. #5 Opportunity Expands: Security, Storage, Efficiency • As use of Hadoop in core workloads – storage, data processing and analytics – matures inside enterprises, they will begin to look at it as a more general-purpose platform able to attack other business problems. • Improvements in platform security, access control and audit logging means that some compliance workloads will begin to migrate to Hadoop in the second half of the year. • Improvements in the open source storage layer mean that storage gets cheaper still, making Hadoop attractive as an online archive for data previously written to tape and stored in caves. • Better measuring and management of resource consumption will lead to improvements in computational efficiency. • The variety of problems that Hadoop addresses will expand. 18
  • 19. #4 Skilled People Get Easier to Find • Professional training courses aimed at developers, operators and users will be widely available and very popular in 2012 and beyond. – Finding people who can deploy, configure and use Hadoop has been difficult in 2011. • Systems integrators and consulting firms will establish Hadoop- focused practices around the world. 19
  • 20. #3 Data Science Gets Hotter • Widespread use of Hadoop in new industries, aimed at previously- intractable problems, will create a new role in the organization: “Data Scientist.” – Mathematical and statistical expertise – Programming skill – A passion for data and an understanding of the business • Industry and academia will collaborate to produce this new class of expert. 20
  • 21. #2 The Demise of the “Hadoop-Killer” • Second and third quarters of 2011 saw several vendors announce “Hadoop killers” – legacy products, deployed in niche markets, that they wanted to use to capitalize on the opportunity that Big Data presents. • In the fourth quarter of 2011, Microsoft announced it was discontinuing its investment in Dryad. Dryad was, simply, the most interesting alternative to Hadoop in the market. Instead, the company is investing in Hadoop. • The vendors offering alternatives to Hadoop will follow Microsoft’s lead. These products have no real traction in the Big Data market and will quietly disappear by the end of 2012. 21
  • 22. #1 Forks Begin to Fade • The open source Hadoop platform will be robust, scalable and reliable enough to take on any enterprise Big Data workload. • As a result, proprietary forks – MapReduce grafted onto legacy-vendor SQL databases, appliances that replace the storage layer with proprietary file systems and others – will see their differentiation erode or disappear. • Buyers will elect to deploy the open source platform once it demonstrates feature parity or superiority with these forked products. 22 Commercial opportunities will constrict for vendors of forked products. It will take several years for these offerings to disappear altogether.
  • 23. #0 Bonus Prediction! M&A Consolidation Begins • Large vendors lacking product and expertise in Hadoop will survey the market for teams and technologies that can close those gaps. • Specialist vendors, especially those that have struggled to grow or with positioning and differentiation challenges, will hire investment bankers to seek acquirers. • These early, small acquisitions, combined with an expanding market and accelerated demand for big data solutions, will drive valuations for new and existing independent companies. Buyer meets seller. Nature follows its course. 23
  • 24. 24 We appreciate your time and interest. For additional information: cloudera.com +1 (888) 789-1488 sales@cloudera.com twitter.com/ cloudera facebook.com/ cloudera

Editor's Notes

  • #3: Hadoop-related inquiries into major analyst firms moved from “What exactly is Hadoop?” to “Which vendors offer robust Hadoop solutions?”: (This noted in a blog post by Forrester analyst James Kobielus)