SlideShare a Scribd company logo
2020 Big Data & Analytics
Maturity Survey Results
In late 2019 and early 2020, AtScale, Cloudera and ODPI.org collaborated on a
survey to learn about the maturity of the Big Data & Analytics Marketplace.
We collected responses from over 150 Big Data & Analytics leaders from across
industries around the globe.
While it’s clear that enterprises have entered the era of Big Data & Analytics
which confirmed our suspicions, we uncovered a few surprises too.
2
Introduction
▵ Multi-cloud strategies are the reality for most enterprises
Most are going with a hybrid/multi-cloud strategy with only 24% choosing to go with a single vendor.
▵ Investment in Hadoop is staying the same or growing
Hadoop investment is staying the same for 53% of respondents while 30% of respondents plan to invest more in
Hadoop.
▵ More companies are implementing data virtualization
55% of respondents plan to invest in data virtualization soon if they are not doing so already
▵ Data governance is a top challenge across the board
80% of respondents said that data governance is important to them
▵ Azure is gaining market share in the public cloud space
Azure offers the most popular cloud data warehouse used in our survey followed by AWS, Google and Snowflake
3
Key Findings
4
Audience Metrics
5
Which on-premise data platforms do you use?
6
If you are using Open Source, how do you plan to invest in the future?
7
How do you plan to invest in existing on-premise data platforms?
8
Do you currently operate data platforms in the public cloud?
9
Which cloud data platforms are you using?
10
Do you plan on deploying cloud data platforms in the future?
11
Rank your organization’s maturity running a cloud data platform
12
What is your timeline for deploying a cloud data platform?
13
What is your cloud deployment strategy?
14
What are you hoping to achieve with the public cloud?
15
What workloads are you deploying in the public cloud?
16
What is your strategy for storing data in the public cloud?
17
Which technologies are you using today and in the future?
18
What challenges are you experiencing with the public cloud?
19
What challenges do you have with your analytics infrastructure?
20
Which BI tools is your organization using today and in the future?
21
Which AI/ML tools is your organization using today and in the future?
22
What would you prefer your data platform to be?
23
How important is data governance for your data in the cloud?
24
How important is data governance across all analytics & ML?
Download the 2020 Big Data & Analytics Maturity
Survey Report at: www.atscale.com
For More Information
25

More Related Content

PPTX
The Challenge of Driving Business Value from the Analytics of Things (AOT)
PDF
Turning Your Data Lake into Measurable Business Value
PPTX
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
PDF
Oracle Stream Analytics - Developer Introduction
PPTX
Hadoop for the Masses
PPTX
Analytics at the Speed of Thought: Actian Express Overview
PDF
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
PPTX
SQL + Hadoop: The High Performance Advantage�
The Challenge of Driving Business Value from the Analytics of Things (AOT)
Turning Your Data Lake into Measurable Business Value
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Oracle Stream Analytics - Developer Introduction
Hadoop for the Masses
Analytics at the Speed of Thought: Actian Express Overview
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
SQL + Hadoop: The High Performance Advantage�

What's hot (18)

PDF
Dickey's Barbecue Pit Heats Up Analytics with Amazon Web Services
PPTX
Swimming Across the Data Lake, Lessons learned and keys to success
PPTX
Extending Data Lake using the Lambda Architecture June 2015
PDF
Intelligent Integration OOW2017 - Jeff Pollock
PPTX
Security, ETL, BI & Analytics, and Software Integration
PPTX
Cloudera, Azure and Big Data at Cloudera Meetup '17
PPTX
An Introduction to Talend Integration Cloud
PPTX
Hadoop Hadoop & Spark meetup - Altiscale
PPTX
Big Data Management: What's New, What's Different, and What You Need To Know
PPTX
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
PDF
Achieving Agility and Scale for Your Data Lake - Talend
PDF
4AA6-4492ENW
PPTX
Rob Bearden Keynote Hadoop Summit San Jose
PDF
SAP HANA Data Center Intelligence Overview
PPTX
Hadoop for Humans: Introducing SnapReduce 2.0
PPTX
Analysis of Major Trends in Big Data Analytics
PPTX
Spark meets Smart Meters
PPTX
Hortonworks Oracle Big Data Integration
Dickey's Barbecue Pit Heats Up Analytics with Amazon Web Services
Swimming Across the Data Lake, Lessons learned and keys to success
Extending Data Lake using the Lambda Architecture June 2015
Intelligent Integration OOW2017 - Jeff Pollock
Security, ETL, BI & Analytics, and Software Integration
Cloudera, Azure and Big Data at Cloudera Meetup '17
An Introduction to Talend Integration Cloud
Hadoop Hadoop & Spark meetup - Altiscale
Big Data Management: What's New, What's Different, and What You Need To Know
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Achieving Agility and Scale for Your Data Lake - Talend
4AA6-4492ENW
Rob Bearden Keynote Hadoop Summit San Jose
SAP HANA Data Center Intelligence Overview
Hadoop for Humans: Introducing SnapReduce 2.0
Analysis of Major Trends in Big Data Analytics
Spark meets Smart Meters
Hortonworks Oracle Big Data Integration
Ad

Similar to 2020 Big Data & Analytics Maturity Survey Results (20)

PDF
6 HOTTEST DATA ANALYTICS TRENDS TO PREPARE AHEAD OF 2025.pdf
PDF
Intel Big Data Analysis Peer Research Slideshare 2013
PDF
Technology Trends 2014 and Beyond
PDF
Big Data Trends and Challenges Report - Whitepaper
PDF
Top-Five-Data-Science-and-Generative-AI-Trends-for-2024.pdf
PDF
Big Data Management: A Unified Approach to Drive Business Results
PPTX
Creating an Enterprise AI Strategy
PPTX
Big Data Maturity Scorecard
PDF
Successful cloud partners idc (en)
PPTX
10 top notch big data trends to watch out for in 2017
PDF
Adoption is the only option hadoop is changing our world and changing yours f...
PPTX
Module 6 The Future of Big and Smart Data- Online
PDF
The Top 8 Trends for Big Data in 2016
PDF
Supply chain and Big data : top 5 Trends
PDF
Business with Cloud Computing
PPTX
Hadoop Twelve Predictions for 2012
PDF
The boom in Xaas and the knowledge graph
PDF
Forecast of Big Data Trends
PDF
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
PDF
IIA: The Current State of Hadoop in the Enterprise
6 HOTTEST DATA ANALYTICS TRENDS TO PREPARE AHEAD OF 2025.pdf
Intel Big Data Analysis Peer Research Slideshare 2013
Technology Trends 2014 and Beyond
Big Data Trends and Challenges Report - Whitepaper
Top-Five-Data-Science-and-Generative-AI-Trends-for-2024.pdf
Big Data Management: A Unified Approach to Drive Business Results
Creating an Enterprise AI Strategy
Big Data Maturity Scorecard
Successful cloud partners idc (en)
10 top notch big data trends to watch out for in 2017
Adoption is the only option hadoop is changing our world and changing yours f...
Module 6 The Future of Big and Smart Data- Online
The Top 8 Trends for Big Data in 2016
Supply chain and Big data : top 5 Trends
Business with Cloud Computing
Hadoop Twelve Predictions for 2012
The boom in Xaas and the knowledge graph
Forecast of Big Data Trends
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
IIA: The Current State of Hadoop in the Enterprise
Ad

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation theory and applications.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Approach and Philosophy of On baking technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Cloud computing and distributed systems.
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Electronic commerce courselecture one. Pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
The AUB Centre for AI in Media Proposal.docx
MIND Revenue Release Quarter 2 2025 Press Release
Programs and apps: productivity, graphics, security and other tools
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation theory and applications.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Approach and Philosophy of On baking technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Agricultural_Statistics_at_a_Glance_2022_0.pdf
sap open course for s4hana steps from ECC to s4
Cloud computing and distributed systems.
Digital-Transformation-Roadmap-for-Companies.pptx
Empathic Computing: Creating Shared Understanding
Spectral efficient network and resource selection model in 5G networks
Electronic commerce courselecture one. Pdf
20250228 LYD VKU AI Blended-Learning.pptx

2020 Big Data & Analytics Maturity Survey Results

  • 1. 2020 Big Data & Analytics Maturity Survey Results
  • 2. In late 2019 and early 2020, AtScale, Cloudera and ODPI.org collaborated on a survey to learn about the maturity of the Big Data & Analytics Marketplace. We collected responses from over 150 Big Data & Analytics leaders from across industries around the globe. While it’s clear that enterprises have entered the era of Big Data & Analytics which confirmed our suspicions, we uncovered a few surprises too. 2 Introduction
  • 3. ▵ Multi-cloud strategies are the reality for most enterprises Most are going with a hybrid/multi-cloud strategy with only 24% choosing to go with a single vendor. ▵ Investment in Hadoop is staying the same or growing Hadoop investment is staying the same for 53% of respondents while 30% of respondents plan to invest more in Hadoop. ▵ More companies are implementing data virtualization 55% of respondents plan to invest in data virtualization soon if they are not doing so already ▵ Data governance is a top challenge across the board 80% of respondents said that data governance is important to them ▵ Azure is gaining market share in the public cloud space Azure offers the most popular cloud data warehouse used in our survey followed by AWS, Google and Snowflake 3 Key Findings
  • 5. 5 Which on-premise data platforms do you use?
  • 6. 6 If you are using Open Source, how do you plan to invest in the future?
  • 7. 7 How do you plan to invest in existing on-premise data platforms?
  • 8. 8 Do you currently operate data platforms in the public cloud?
  • 9. 9 Which cloud data platforms are you using?
  • 10. 10 Do you plan on deploying cloud data platforms in the future?
  • 11. 11 Rank your organization’s maturity running a cloud data platform
  • 12. 12 What is your timeline for deploying a cloud data platform?
  • 13. 13 What is your cloud deployment strategy?
  • 14. 14 What are you hoping to achieve with the public cloud?
  • 15. 15 What workloads are you deploying in the public cloud?
  • 16. 16 What is your strategy for storing data in the public cloud?
  • 17. 17 Which technologies are you using today and in the future?
  • 18. 18 What challenges are you experiencing with the public cloud?
  • 19. 19 What challenges do you have with your analytics infrastructure?
  • 20. 20 Which BI tools is your organization using today and in the future?
  • 21. 21 Which AI/ML tools is your organization using today and in the future?
  • 22. 22 What would you prefer your data platform to be?
  • 23. 23 How important is data governance for your data in the cloud?
  • 24. 24 How important is data governance across all analytics & ML?
  • 25. Download the 2020 Big Data & Analytics Maturity Survey Report at: www.atscale.com For More Information 25

Editor's Notes

  • #5: We collected data from over 150 data practitioners for this survey. The respondents are working across industries and are located around the world. 49% identified as data & analytics professionals while the other 51% were in business intelligence (BI) or IT teams or were data consumers.
  • #6: Most survey respondents have data on multiple on-premise platforms. The most common sources are Open Source (Hadoop), Oracle and SQL Server. A recent CRN article asserts that “With the bulk of information technology spending still on-premises, enterprises’ modernization of their core applications will continue to transform business technology and the industry this year.” Our respondents have 33% of their data on Open Source (Hadoop), 28% on SQL Server, 18% on Oracle and 8% on Teradata.
  • #7: 53% of survey respondents indicated that they plan to keep their investment in Open Source (Hadoop) the same. While 30% plan to invest more.
  • #8: Respondents were more evenly split on their plans to invest in their existing on-premise data platforms other than Open Source (Hadoop) in 2020. 42% plan to say the same. 33% plan to invest more. And 25% plan to invest less.
  • #9: 61% of respondents are operating data platforms in the public cloud. This is in line with Forrester’s research that shows that currently, 65% of North American enterprises rely on public cloud platforms.
  • #10: When asked to check off which data platforms they were working with, respondents showed that they are using multiple data platforms in the public cloud from AWS to Open Source to Teradata Cloud.
  • #11: Of the 38% of respondents who indicated that they are not on the cloud at all, 48% plan to deploy a public cloud data warehouse/platform in the future, 16% said that thy plan to stay off the cloud, and 36% are still evaluating their options.
  • #12: When asked to rank how your organization’s maturity with running a data platform in the cloud, 23% are deployed in the cloud and say that it’s working well. 29% are running a data platform in the cloud and are still working out wrinkles while 3% say that it’s not working well. The remainder of respondents said they were either not on the cloud or want to get there but haven’t started planning yet.
  • #13: 41% of survey respondents are either deployed or in the process of deploying on the public cloud. 21% will be deployed on the public cloud in less than a year.
  • #14: We found it surprising that only 24% of those surveyed are all in with a single cloud vendor. The majority of respondents are working with multiple public cloud vendors and have a hybrid cloud strategy in place.
  • #15: When respondents were asked what they were trying to achieve in the public cloud, the top responses were: flexibility to scale up and down, get better data and analytics, and lower costs. Close behind that was the ability to access new technologies and deploy new applications faster.
  • #16: Our respondents told us that they are looking to deploy data science, data warehousing, and business intelligence in the cloud. And how are they going to do it? By streaming real-time data and using ETL.
  • #17: Most of our respondents are storing data on a data lake in the public cloud or in a cloud data warehouse.
  • #18: Our survey shows that 49 respondents are using data virtualization now and 57 respondents plan to use data virtualization in the future. Data virtualization technology has been around for quite some time. However, the explosion in data size and variety, coupled with the increased focus on analytical use cases, has created new challenges for legacy data virtualization technologies. The need for ad hoc access to both live and historical data has steadily increased among business users as they leverage artificial intelligence (AI) and machine learning (ML) platforms for analytics.
  • #19: When asked about the challenges that respondents are experiencing with the public cloud, 32 people indicated that security was a challenge. Given that 79% of respondents indicated that having consistent, integrated security and governance for your data in public cloud, private cloud and/or hybrid cloud is very important, this is not a surprise. It’s also not as surprising to see that 29 people said that the cost of the public cloud is higher than expected.
  • #20: When asked to select all areas that apply to challenges that you’re experiencing with your analytics infrastructure, governance was the number one challenge followed by skills, performance, costs and security.
  • #21: Microsoft applications - Excel and PowerBI - are some of the most popular business intelligence tools today along with Tableau. Looking towards the future, Excel will become less of a “go to” business intelligence tool as enterprises look to using more PowerBI and Tableau. When it comes to artificial intelligence (AI) and machine learning (ML), Spark (Open Source) is the most used technology with respondents.
  • #22: In the future, Spark remains the technology of choice, along with Databricks and Cloudera Data Science Workbench. When respondents chose “other” the most frequent responses were: Pecan, Azure, and “don’t know yet.”
  • #23: So what is the preferred data platform? 39% said that they’d like analytics and machine learning from multiple vendors that are tied together.
  • #24: Most respondents 79% indicated that having consistent, integrated security and governance for your data in public cloud, private cloud and/or hybrid cloud is very important.
  • #25: 73% indicated that having consistent, integrated security and governance across all of your analytics and machine learning functions is very important.
  • #26: Security and data governance are extremely important as enterprise customers store data on data lakes and in the public cloud and as they perform analytics and artificial intelligence on their data. And, more and more data users are looking to data virtualization in order to get live connections to data from multiple sources so that they can make better data-driven decisions for their business. Thank you again for joining our webinar today. For more information, download the 2020 Big Data & Analytics Maturity Survey Results at www.atscale.com