Reinventing the Information Pipeline
From Big Data Strategy to Big Value December 2016
Agenda
• Introduction
• Challenges in the Information Pipeline
• Paxata in the Converged Data Platform
Paxata’s mission (since 2012)
Deliver the only enterprise-grade data preparation platform
for everyone to transform raw, meaningless data into
valuable, contextual and complete information
4
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
83%Companies agree that data is
their most strategic asset
5
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
6
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
7
The data chasm
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
Challenges in the Information Pipeline
Traditional data preparation
creates a bottleneck
Traditional data preparation creates a bottleneck
Business teams have complex data sources for analytics projects
Traditional data preparation creates a bottleneck
Business teams funnel their requirements to IT
IT-centric data preparation
Business
Information
Traditional data preparation creates a bottleneck
IT runs requirements through a linear ETL process
executed with manual scripting or coding
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
IT reviews with business. Makes changes, fixes errors.
(Repeat)
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Business teams make decisions before data is available
-or-
Ask for changes and restart the process.
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
Designed for highly specialized technical people to prepare data for
business teams
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
Designing for highly specialized technical
people to prepare data for business teams.
Expensive
Complicated
Error-prone
Time-consuming
Modern architecture balances
freedom with responsibility
Modern architecture: balancing freedom with responsibility
Built for business
•Freedom and
flexibility with
collaboration
Modern architecture: balancing freedom with responsibility
Collect and manage data
Time
Built for business
•Freedom and
flexibility with
collaboration
Enabled by IT
•Data governance,
scale, efficiency
Modern information pipeline is
Built for business
Freedom and flexibility with collaboration
Enabled by IT
Data governance, scale, efficiency
Data prep must address the
range of information workers
Data prep must address the range of information workers
Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between
Enterprise and Consumer Technologies,” August 30, 2012
Deep Technical Skills Limited Technical Skills
Data Scientist
Data Developer
Data Analyst
Business Analyst
Information
Worker
Data prep must address the range of information workers
Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between
Enterprise and Consumer Technologies,” August 30, 2012
Deep Technical Skills Limited Technical Skills
Data Scientist
(200K)
Data Developer
(600K)
Data Analyst
(100M)
Business Analyst
(275M)
Information
Worker
(460M)
Paxata accelerates the
data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive
Consumer experience for preparing data
Architecture of the Paxata Adaptive Information Platform
Architecture of the Paxata Adaptive Information Platform
Contact us
Paxata in the apps gallery
Register for Paxata Live:
www.paxata.com/events
info@paxata.com
www.youtube.com/PaxataTV
www.paxata.com
December 8, 2016© Paxata, Inc. 32
Thank You!

More Related Content

PPTX
Self-Service Analytics
PPTX
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Panel - Interactive Applic...
PPT
NLB Analytics Overview
PDF
General Data Protection Regulation - BDW Meetup, October 11th, 2017
PPTX
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
PPTX
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
PDF
Intro to Data Science on Hadoop
PPTX
Using Machine Learning & Spark to Power Data-Driven Marketing
Self-Service Analytics
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Panel - Interactive Applic...
NLB Analytics Overview
General Data Protection Regulation - BDW Meetup, October 11th, 2017
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Intro to Data Science on Hadoop
Using Machine Learning & Spark to Power Data-Driven Marketing

What's hot (20)

PDF
The Rise of the CDO in Today's Enterprise
PPTX
A modern, flexible approach to Hadoop implementation incorporating innovation...
PDF
Data catalog
PDF
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
PDF
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
PDF
Focus on Your Analysis, Not Your SQL Code
PPTX
Big and fast data strategy 2017 jr
PDF
How to Consume Your Data for AI
PDF
Building a New Platform for Customer Analytics
PDF
Data Catalog as the Platform for Data Intelligence
PDF
Using Machine Learning to Understand and Predict Marketing ROI
PDF
You're the New CDO, Now What?
PDF
Getting down to business on Big Data analytics
PDF
Big Data Analytics on the Cloud
PDF
Data lineage to drive compliance and as a business imperative
PPTX
DataOps: Nine steps to transform your data science impact Strata London May 18
PDF
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
PDF
Chief Data & Analytics Officer Fall Boston - Presentation
PDF
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
PDF
DI&A Slides: Data Lake vs. Data Warehouse
The Rise of the CDO in Today's Enterprise
A modern, flexible approach to Hadoop implementation incorporating innovation...
Data catalog
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Focus on Your Analysis, Not Your SQL Code
Big and fast data strategy 2017 jr
How to Consume Your Data for AI
Building a New Platform for Customer Analytics
Data Catalog as the Platform for Data Intelligence
Using Machine Learning to Understand and Predict Marketing ROI
You're the New CDO, Now What?
Getting down to business on Big Data analytics
Big Data Analytics on the Cloud
Data lineage to drive compliance and as a business imperative
DataOps: Nine steps to transform your data science impact Strata London May 18
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Chief Data & Analytics Officer Fall Boston - Presentation
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
DI&A Slides: Data Lake vs. Data Warehouse
Ad

Viewers also liked (20)

PPTX
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
PDF
MonitoringFrameWork
PDF
Data Culture Series - Keynote - 27th Jan, London
PDF
Making Hadoop based analytics simple for everyone to use
PDF
Supply chain and Big data : top 5 Trends
PPTX
Continuous Performance Testing
PPSX
광역화 집단에너지사업제안서
PPT
Exploring Data Preparation and Visualization Tools for Urban Forestry
PDF
Data Preparation for Data Science
PPT
Trace 3 interview questions and answers
PPTX
Essential Data Engineering for Data Scientist
PPTX
Jagger release 2.0
PPTX
Driving Retail Success with Machine Data Intelligence
PDF
How Can You Calculate the Cost of Your Data?
PPTX
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
PPTX
Потоковая обработка больших данных
KEY
Geemus
PPT
Cohodatawebinar
PDF
Engine Yard Cloud Architecture Enhancements
PDF
6 tips for improving ruby performance
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
MonitoringFrameWork
Data Culture Series - Keynote - 27th Jan, London
Making Hadoop based analytics simple for everyone to use
Supply chain and Big data : top 5 Trends
Continuous Performance Testing
광역화 집단에너지사업제안서
Exploring Data Preparation and Visualization Tools for Urban Forestry
Data Preparation for Data Science
Trace 3 interview questions and answers
Essential Data Engineering for Data Scientist
Jagger release 2.0
Driving Retail Success with Machine Data Intelligence
How Can You Calculate the Cost of Your Data?
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Потоковая обработка больших данных
Geemus
Cohodatawebinar
Engine Yard Cloud Architecture Enhancements
6 tips for improving ruby performance
Ad

Similar to Reinventing the Modern Information Pipeline: Paxata and MapR (20)

PPTX
Data Analytics in Digital Transformation
PDF
Decision Ready Data: Power Your Analytics with Great Data
PPTX
Modernizing Architecture for a Complete Data Strategy
PPTX
IBM Solutions Connect 2013 - Getting started with Big Data
PPTX
The Power of your Data Achieved - Next Gen Modernization
PPTX
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
PPTX
Making advanced analytics work for you
PPTX
Making advanced analytics work for you
PPTX
State and Trends of the Analytics Market by Jose Fernandez
PPTX
Designing Data Pipelines for Automous and Trusted Analytics
PDF
Data Trends for 2019: Extracting Value from Data
PPTX
Building the Analytics Capability
PPTX
Decentralizing Analytics - A Strategy for Organizing Effective Analytics Teams
PPTX
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
PDF
Capgemini Leap Data Transformation Framework with Cloudera
PDF
Data & Analytic Innovations: 5 lessons from our customers
PDF
Data Virtualization - Enabling Next Generation Analytics
PDF
Automating intelligence
PDF
Event-driven Business: How Leading Companies are Adopting Streaming Strategies
PPTX
Become Data Driven With Hadoop as-a-Service
Data Analytics in Digital Transformation
Decision Ready Data: Power Your Analytics with Great Data
Modernizing Architecture for a Complete Data Strategy
IBM Solutions Connect 2013 - Getting started with Big Data
The Power of your Data Achieved - Next Gen Modernization
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
Making advanced analytics work for you
Making advanced analytics work for you
State and Trends of the Analytics Market by Jose Fernandez
Designing Data Pipelines for Automous and Trusted Analytics
Data Trends for 2019: Extracting Value from Data
Building the Analytics Capability
Decentralizing Analytics - A Strategy for Organizing Effective Analytics Teams
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Capgemini Leap Data Transformation Framework with Cloudera
Data & Analytic Innovations: 5 lessons from our customers
Data Virtualization - Enabling Next Generation Analytics
Automating intelligence
Event-driven Business: How Leading Companies are Adopting Streaming Strategies
Become Data Driven With Hadoop as-a-Service

Recently uploaded (20)

PPTX
Steganography Project Steganography Project .pptx
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
Transcultural that can help you someday.
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
PPTX
New ISO 27001_2022 standard and the changes
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPT
DU, AIS, Big Data and Data Analytics.ppt
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PDF
Global Data and Analytics Market Outlook Report
PPTX
modul_python (1).pptx for professional and student
PPTX
A Complete Guide to Streamlining Business Processes
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPT
Image processing and pattern recognition 2.ppt
PPTX
SET 1 Compulsory MNH machine learning intro
DOCX
Factor Analysis Word Document Presentation
PPTX
chrmotography.pptx food anaylysis techni
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Steganography Project Steganography Project .pptx
Pilar Kemerdekaan dan Identi Bangsa.pptx
CYBER SECURITY the Next Warefare Tactics
Transcultural that can help you someday.
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
New ISO 27001_2022 standard and the changes
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Optimise Shopper Experiences with a Strong Data Estate.pdf
DU, AIS, Big Data and Data Analytics.ppt
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
Global Data and Analytics Market Outlook Report
modul_python (1).pptx for professional and student
A Complete Guide to Streamlining Business Processes
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Image processing and pattern recognition 2.ppt
SET 1 Compulsory MNH machine learning intro
Factor Analysis Word Document Presentation
chrmotography.pptx food anaylysis techni
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf

Reinventing the Modern Information Pipeline: Paxata and MapR

  • 1. Reinventing the Information Pipeline From Big Data Strategy to Big Value December 2016
  • 2. Agenda • Introduction • Challenges in the Information Pipeline • Paxata in the Converged Data Platform
  • 3. Paxata’s mission (since 2012) Deliver the only enterprise-grade data preparation platform for everyone to transform raw, meaningless data into valuable, contextual and complete information
  • 4. 4 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 83%Companies agree that data is their most strategic asset
  • 5. 5 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 6. 6 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 7. 7 The data chasm Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 8. Challenges in the Information Pipeline
  • 10. Traditional data preparation creates a bottleneck Business teams have complex data sources for analytics projects
  • 11. Traditional data preparation creates a bottleneck Business teams funnel their requirements to IT IT-centric data preparation Business Information
  • 12. Traditional data preparation creates a bottleneck IT runs requirements through a linear ETL process executed with manual scripting or coding IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information
  • 13. Traditional data preparation creates a bottleneck IT reviews with business. Makes changes, fixes errors. (Repeat) IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information
  • 14. Business teams make decisions before data is available -or- Ask for changes and restart the process. IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck
  • 15. Designed for highly specialized technical people to prepare data for business teams IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck
  • 16. Designing for highly specialized technical people to prepare data for business teams. Expensive Complicated Error-prone Time-consuming
  • 18. Modern architecture: balancing freedom with responsibility Built for business •Freedom and flexibility with collaboration
  • 19. Modern architecture: balancing freedom with responsibility Collect and manage data Time Built for business •Freedom and flexibility with collaboration Enabled by IT •Data governance, scale, efficiency
  • 20. Modern information pipeline is Built for business Freedom and flexibility with collaboration Enabled by IT Data governance, scale, efficiency
  • 21. Data prep must address the range of information workers
  • 22. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist Data Developer Data Analyst Business Analyst Information Worker
  • 23. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist (200K) Data Developer (600K) Data Analyst (100M) Business Analyst (275M) Information Worker (460M)
  • 24. Paxata accelerates the data to information pipeline
  • 25. Data Lake Enterprise Local Paxata accelerates the data to information pipeline
  • 26. Data Lake Enterprise Local Paxata accelerates the data to information pipeline
  • 27. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive
  • 28. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive Consumer experience for preparing data
  • 29. Architecture of the Paxata Adaptive Information Platform
  • 30. Architecture of the Paxata Adaptive Information Platform
  • 31. Contact us Paxata in the apps gallery Register for Paxata Live: www.paxata.com/events info@paxata.com www.youtube.com/PaxataTV www.paxata.com
  • 32. December 8, 2016© Paxata, Inc. 32 Thank You!

Editor's Notes

  • #4: Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
  • #5: To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  • #6: To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  • #7: To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  • #8: To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  • #9: Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
  • #11: Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  • #12: Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  • #13: Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  • #14: Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  • #15: Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  • #16: Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  • #19: Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
  • #20: Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
  • #23: Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep. Source: Prakash VC deck
  • #24: Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep.
  • #25: Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information