SlideShare a Scribd company logo
7-Step Big Data Journey
for Enterprises
1www.BigDataTrunk.com
www.BigDataTrunk.com
Raju Shreewastava
Engineer/Architect by Profession
Teacher at Heart
Artist in Soul
2
Working : Big Data Consulting and Training
www.BigDataTrunk.com 3
www.BigDataTrunk.com
1. Ask Why?
4
www.BigDataTrunk.com
1. Ask Why? – Typical Reasons
5
C-staff Competition Big data Problem
www.BigDataTrunk.com
Three Real Reasons
6
7
8
www.BigDataTrunk.com
Next Frontier
Artificial Intelligence Internet of Things ( IOT)
9
www.BigDataTrunk.com
STEP 1) Tips – Needs Commitment
a) Marathon
b) Business Driven
c) Not Free or Cheap
10
www.BigDataTrunk.com 11
www.BigDataTrunk.com 12
Blog Post : https://www.linkedin.com/pulse/open-source-different-beast-raju-shreewastava?trk=prof-post
www.BigDataTrunk.com 13
14
LearnabilityProcess
SME
15
Vacancy -Super Hero Engineer Needed
Hadoop, Hive ,Pig …
Spark
HBase, Cassandra,
MongoDB, or other NoSQL
Technology
Java, Python, or Scala
Kafka
Apache Phoenix
Cloud - AWS/Azure/Google
Solr or Elastic Search
16
Big Data Architect
New Hires
Engineers
Existing Staff
Engineers
Scrum Master
QA
SME..
www.BigDataTrunk.com 17
STEP 2) Tips – Change is the only Constant
a) Modular Design
b) Learnability
c) Hire Full team
www.BigDataTrunk.com 18
19
www.BigDataTrunk.com 20
www.BigDataTrunk.com
 Public Cloud
 Private Cloud
 Hybrid
21
www.BigDataTrunk.com
Agility
Cost
Maintenance
Scalability
22
www.BigDataTrunk.com
Security
Volume
Industry – Compliance
23
www.BigDataTrunk.com 24
STEP 3) Tips – Considerations to Cloud
a) Industry/Compliance
b) Company Strategy
c) Project Needs
www.BigDataTrunk.com 25
www.BigDataTrunk.com 26
www.BigDataTrunk.com
Common Services
27
Packaging Support Training
www.BigDataTrunk.com
4. Key Differentiators
Cloudera Hortonworks MapR Amazon EMR
Developing
Tools
• Cloudera
Manager
• Imapala
Staying Open
Source
Developing
Open Source
tools
• Ambari
Proprietary
route
E.g. MapRFS
High
Avalability
Cloud
Offering
Integration to
other AWS
Services
28
www.BigDataTrunk.com 29
STEP 4) Tips – Selection
a) Apache Hadoop is Mother
b) Align with Company Strategy
Apache Hadoop
www.BigDataTrunk.com 30
STEP 5. Take Test Drive (Pilot , not just POC)
www.BigDataTrunk.com
POC Vs Pilot
POC Pilot
• Will this technology meet our needs?
• Will this product perform as advertised?
• Will the prospective end user
communities be productive with the new
way of doing things?
• Will the ultimate solution be feasible?
• Detailed Architectural Design based on an
assessment of business and technical
requirements
• Environment build and configuration
• Infrastructure testing to validate failover,
high availability, and possibly scalability
• User testing and iterative feedback to
optimize the user experience
• Documentation or training for pilot users
and the help desk
31
www.BigDataTrunk.com 32
STEP 5) Tips – Take the plunge
a) First Use Case
b) Trial and Error
www.BigDataTrunk.com 33
www.BigDataTrunk.com
Big data is here to Stay
Focus on Business Value
Don’t underestimate Data Integration
Hiring is a Big Challenge
Be ready to Re-invent yourself
34
www.BigDataTrunk.com 35
STEP 6) Tips – Adaptation
a) R&D – New technology
b) 6 months review
www.BigDataTrunk.com 36
Taking to the Enterprise Level
www.BigDataTrunk.com
7. Taking to the Enterprise (Tips)
Governance Monitoring
Usage Analysis Automation
Enterprise Scale
37
www.BigDataTrunk.com
1. Ask Why?
2. Build Team
3. To cloud or
Not?
4. Pick your
distribution
5. Perform a
pilot
6. Fail , Learn
and Improve
7. Take it to
the Enterprise
38
www.BigDataTrunk.com 39
slideshare.net/RajuShreewastavaPMP
www.BigDataTrunk.com 40
www.BigDataTrunk.com
Thank you
41

More Related Content

PPTX
The Journey to Big Data Analytics
PPTX
Customer Journey Analytics and Big Data
PPTX
Customer Journey Analytics
PDF
Stewarding Data : Why Financial Services Firms Need a Chief Data Officier
PPTX
How analytics will transform banking in luxembourg
PDF
Cognizant Analytics for Banking & Financial Services Firms
PDF
Raising Your Digital Quotient - McKinsey
PPTX
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
The Journey to Big Data Analytics
Customer Journey Analytics and Big Data
Customer Journey Analytics
Stewarding Data : Why Financial Services Firms Need a Chief Data Officier
How analytics will transform banking in luxembourg
Cognizant Analytics for Banking & Financial Services Firms
Raising Your Digital Quotient - McKinsey
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...

What's hot (20)

PDF
Big Data - an actuarial perspective
PDF
PwC: New IT Platform From Strategy Through Execution
PDF
Financial Technology Gartner Summit Briefing - Vin Malhotra, Partner Accenture
PDF
Technolony Vision 2016 - Primacy Of People First In A Digital World - Vin Mal...
PDF
Forecasting in a digital world
PDF
2015-16 Global Chief Procurement Officer Survey - CPO
PDF
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
PDF
From Smart Meters to Smart Products: Reviewing Big Data driven Product Innova...
PDF
Big Data: Real-life examples of Business Value Generation with Cloudera
PDF
The Future of IT Infrastructure
PDF
Digital Leadership Series : Shawn O'Neal
PDF
Big Data: Real-life Examples of Business Value Generation
PPTX
Digital Migration - Telco
PDF
IBM Banking videocast - 3/20/2013
PDF
Pi cube banking on predictive analytics151
PDF
Virtual Reality in Financial Services (A Primer)
PDF
Retail Revolution: Thrive in Disruption
PDF
Digital disruption in CIB
PDF
Cloud Enabled Transformation In Insurance
PDF
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
Big Data - an actuarial perspective
PwC: New IT Platform From Strategy Through Execution
Financial Technology Gartner Summit Briefing - Vin Malhotra, Partner Accenture
Technolony Vision 2016 - Primacy Of People First In A Digital World - Vin Mal...
Forecasting in a digital world
2015-16 Global Chief Procurement Officer Survey - CPO
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
From Smart Meters to Smart Products: Reviewing Big Data driven Product Innova...
Big Data: Real-life examples of Business Value Generation with Cloudera
The Future of IT Infrastructure
Digital Leadership Series : Shawn O'Neal
Big Data: Real-life Examples of Business Value Generation
Digital Migration - Telco
IBM Banking videocast - 3/20/2013
Pi cube banking on predictive analytics151
Virtual Reality in Financial Services (A Primer)
Retail Revolution: Thrive in Disruption
Digital disruption in CIB
Cloud Enabled Transformation In Insurance
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
Ad

Similar to 7 Steps Big Data Journey for Enterprises (20)

PDF
How to make your data scientists happy
PDF
Lunch and Learn: You have the data, now what?
PPTX
Essential Prerequisites for Maximizing Success from Big Data
PDF
Challenges of Executing AI
PDF
Machine Learning - why the hype and how it does its magic
PDF
Building Data Products with BigQuery for PPC and SEO (SMX 2022)
PDF
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202
PDF
DevOps Days Rockies MLOps
PPTX
Maximising likelihood of success: Applying Product Management to AI/ML/DS pr...
PDF
Business Applications of Predictive Modeling at Scale - KDD 2016 Tutorial
PPTX
TechEvent DWH Modernization
PDF
Bimodal IT and EDW Modernization
PPTX
2022cindatatttpptlesson41647542012061.pptx
PDF
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
PDF
D365 power platform-user-group-deck-v02
PDF
Big Data Refinery: Distilling Value for User-Driven Analytics
PPTX
20180701 - 1st Meeting - Data Science Orientation
PPTX
AI Orange Belt - Session 3
PPTX
Productionalizing Machine Learning Models: The Good, the Bad, and the Ugly
PPTX
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
How to make your data scientists happy
Lunch and Learn: You have the data, now what?
Essential Prerequisites for Maximizing Success from Big Data
Challenges of Executing AI
Machine Learning - why the hype and how it does its magic
Building Data Products with BigQuery for PPC and SEO (SMX 2022)
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202
DevOps Days Rockies MLOps
Maximising likelihood of success: Applying Product Management to AI/ML/DS pr...
Business Applications of Predictive Modeling at Scale - KDD 2016 Tutorial
TechEvent DWH Modernization
Bimodal IT and EDW Modernization
2022cindatatttpptlesson41647542012061.pptx
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
D365 power platform-user-group-deck-v02
Big Data Refinery: Distilling Value for User-Driven Analytics
20180701 - 1st Meeting - Data Science Orientation
AI Orange Belt - Session 3
Productionalizing Machine Learning Models: The Good, the Bad, and the Ugly
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
Ad

Recently uploaded (20)

PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Introduction to Knowledge Engineering Part 1
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Foundation of Data Science unit number two notes
PDF
Mega Projects Data Mega Projects Data
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Supervised vs unsupervised machine learning algorithms
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Introduction to Knowledge Engineering Part 1
.pdf is not working space design for the following data for the following dat...
Business Ppt On Nestle.pptx huunnnhhgfvu
Foundation of Data Science unit number two notes
Mega Projects Data Mega Projects Data
Clinical guidelines as a resource for EBP(1).pdf
Supervised vs unsupervised machine learning algorithms
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
1_Introduction to advance data techniques.pptx
Database Infoormation System (DBIS).pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...

7 Steps Big Data Journey for Enterprises

Editor's Notes

  • #4: Slideshare, 7 steps , Break Myths , Tips Sharing