SlideShare a Scribd company logo
PROJECT MANAGEMENT &
BIG DATA ANALYTICS
Sandeep Kumar PMP®
INTRODUCTION
IT Strategy & Business Transformation
Industries:
Media & Advertising, Telecom, BFSI,
FMCG, Manufacturing, BPO/KPO
Services:
Shared Service Delivery, GIC & Back
Office, PMO, Lean Six Sigma, Continuous
Improvement, Enterprise IT, ERP, Cloud &
Infrastructure, Development, Outsourcing
and IT Security & Governance
TWO STREAMS
Project Management of
Analytics
• Big Data
• Data Warehousing
• Lean Six Sigma
• In-memory computing
• Internet of things
• Social Media
Analytics of Project
Management
• Time-Cost-Spec Analytics
• Feasibility Analytics
• Resource Analytics
• Management of Collaboration
• Agile & SCRUM
APOLOGIES, DISCLAIMERS, ET AL
• Big Data is over-hyped
• Big Data is still evolving
• Analytics is old, the tools are new!
• Project Management solves most of the problems
• Its importance is usually understated
• The success of Big Data initiative lies primarily on the management, then
on the PM & the DS
• Hold the PM responsible only if you know what you want!
• The roles I talk about here are essentially with respect Big Data Projects
If you have
Questions,
I will try &
Answer
them!
UNDER SCANNER !
Big Data / Analytics
Myths
• It is mature and cool
• Is an extension of EDW
• Data Quality can be slightly compromised
• A single pre-built technology (e.g. Hadoop) will suffice
• Data scientists are easy to get
• Virtualization / Clustering will take care of infra needs
• If you have huge data, every solution is a Big Data solution
Project Management
Myths
• Managing only activities
• Just time and cost management
• Mere resource allocation
• Reaching the finishing-line!
• General management suffices
• Have time to learn
UNDER SCANNER !
Big Data / Analytics
Facts
• Responsible for Business Case and ROI
definitions
• Executive Sponsorship & Funds
• ‘Real’ Resource Provisioning
• Based on Enterprise Architecture
• Highly complex and iterative process
• Loads of scientific knowledge required
• Source of data increases every day
• Should be able to adapt with time
Project Management
Facts
• Responsible for Scope & acceptance by all
parties
• Direction setting & KPI-SF definitions
• ‘Real’ Resource Management
• Right to procure & deploy the appropriate
resource
• Stakeholder & Communication management
• Accountability and Responsibility for the success
(and failure)
THE DATA SCIENTIST
Key skills of a Data Scientist – the hard skills guy
• Basic Tools: Knowledge of statistical programming languages, like R or Python, and SQL
• Basic Statistics: Familiar with statistical tests, distributions, maximum likelihood estimators, etc.
• ETL Tools: Best in class like Informatica, IBM Infosphere, SAP BO, Oracle or SAS Data Integrator, Penta-ho, AB-Initio
• Machine Learning / Artificial Intelligence / Pattern Recognition: Methods for Classification and Regression like k-nearest
neighbours, random forests, etc.
• Multivariable Calculus & Linear Algebra: Specially required where data is used for predictive performance or algorithm
optimization
• Data Munging / Scrubbing or Cleanliness: For example inconsistent string formatting as ND or Del for New Delhi; date
alignment as [mm-dd-yyyy] or [dd-mm-yyyy] or [yyyy-dd-mm]
• Data Visualization & Communication tools: Principles of and tools of Data Visualization like ggplot and d3.js.
• Software Engineering: Strong software engineering background, SDLC, Agile, Scrum, DB techniques, Data intensive product development
• Software Testing skills – To make sure the output delivers what the business needs
• Basic Project Management Skills: Thinking like a Project Manager
THE PROJECT MANAGER
Key skills of a Project Manager – the soft skills guy
• Project Charter: Project Stakeholders and Objectives documented and signed-off
• Business Case: Asks the ‘whats’ and ‘whys’ of the business requirement
• Scheduling Tools: Creates a Plan of Action to answer the ‘hows’ of the project
• Vendor Management: Links up all 1st and 3rd party resources
• Risk Management: The real management tool, with the mitigant
• Communication Management: The core of collaboration and Management
• Software Engineering: Software engineering background with fair knowledge of tools
• Software Testing skills: To make sure the output delivers what the business needs
• Basic Data Management Skills: Thinking like a Data Scientist
MERGING ROLES
The Data Scientist
 The Requirements guy
 The Data Tools guy
 The Resource guy
 The Specialist
 The Enterprise Architect
 The Software guy
Breaking the Technical Barriers
The Project Manager
 The Requirements & Scope guy
 The Project Tools guy
 The Resource guy
 The Generalist
 The Program Manager
 The hardware & Software guy
Breaking the Cultural Barriers
WHY DO ANALYTICS
PROJECTS FAIL ?
When do Projects fail, in general?
• Not completed within budgets
• Not completed on time
• Not completed as per specifications
Whys and Wherefores…
• Poor scoping – unclear objective
• Inadequate resources – lack of talent
• Inappropriate Solution – wrong tool selection
• Bad planning – Insufficient analysis
• Bad execution – poor Project Management
“3 out of 4 Big Data Projects fail”
• Inaccurate Project Scope
• Lack of Talent
• Challenging Tools
• Even more Challenging Concepts
• Poor Planning
• Ownership Issues – Business
Initiative or IT Project?
Apache Hadoop vs. Apache Cassandra
55% of Big Data projects don’t get completed; in case of IT projects in general, it is only 25%.
THE SUCCESS MANTRA!
• Get a Data Scientist at the PM or SME
• Or, at least get the Data Scientist as one of the leads
• Ask very tough questions to sponsors for a Business Case
• Get the CFO and the End-User on your side – get the expectations right
• Take your time – use appropriate Project Management methodologies
• Use the most appropriate Platform / Tool
• Accept requirements’ volatility
• Scrum: accepting that the problem cannot be fully understood or defined, focusing instead on maximizing the team's ability to deliver quickly
• POC  Deployment  Acceptance  Enhancement  Deployment … …
• Agile: adaptive planning, evolutionary development, early delivery, continuous improvement
• Document and Handover to End-User at every stage
PM answers “How” as long as the Business knows “Why & What”
QUESTIONS, ANY?

More Related Content

PDF
BIG DATA WORKBOOK OCT 2015
PPTX
Big Data Project Manager
PDF
Big Data Maturity Model and Governance
PDF
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
PDF
Agile & Data Modeling – How Can They Work Together?
PDF
Focus on Your Analysis, Not Your SQL Code
PDF
The Big Data Dream Team
PDF
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
BIG DATA WORKBOOK OCT 2015
Big Data Project Manager
Big Data Maturity Model and Governance
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
Agile & Data Modeling – How Can They Work Together?
Focus on Your Analysis, Not Your SQL Code
The Big Data Dream Team
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture

What's hot (20)

PDF
Building a Collaborative Data Architecture
PDF
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
PDF
Data-Ed: Data Architecture Requirements
PDF
DI&A Slides: Data-Centric Development
PPTX
Enterprise Data Architecture Deliverables
PDF
Back to Square One: Building a Data Science Team from Scratch
PDF
Data-Ed Online Webinar: Data Architecture Requirements
PDF
DI&A Slides: Data Lake vs. Data Warehouse
PDF
The Emerging Role of the Data Lake
PDF
How to Consume Your Data for AI
PDF
Slides: How AI Makes Analytics More Human
PDF
Building a data fluent organization
PDF
The Key to Big Data Modeling: Collaboration
PPTX
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...
PDF
Data Strategy Best Practices
PDF
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
PDF
Getting Started with Data Stewardship
PDF
Modeling Webinar: State of the Union for Data Innovation - 2016
PDF
How to Build Data Science Teams
Building a Collaborative Data Architecture
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
Data-Ed: Data Architecture Requirements
DI&A Slides: Data-Centric Development
Enterprise Data Architecture Deliverables
Back to Square One: Building a Data Science Team from Scratch
Data-Ed Online Webinar: Data Architecture Requirements
DI&A Slides: Data Lake vs. Data Warehouse
The Emerging Role of the Data Lake
How to Consume Your Data for AI
Slides: How AI Makes Analytics More Human
Building a data fluent organization
The Key to Big Data Modeling: Collaboration
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...
Data Strategy Best Practices
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Getting Started with Data Stewardship
Modeling Webinar: State of the Union for Data Innovation - 2016
How to Build Data Science Teams
Ad

Viewers also liked (18)

PPTX
BDM - project management in big data context.pptx
PPTX
Project management for Big Data projects
PDF
Big data Hadoop Analytic and Data warehouse comparison guide
PDF
Meetup BigData et Machine Learning
PDF
Présentation Meetup #2
PDF
Yarn optimization (Real life use case)
PPTX
Introduction project managemen
PPTX
Lan yogi-nipa’s organizational analysis
PDF
Big data initiative justification and prioritization framework
PDF
The Ecosystem is too damn big
PDF
Big data project management
PDF
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
PDF
How to use Innovative Architectures for Digital Enterprises
PDF
Digital Transformation, Enterprise Architecture, Big Data by Danairat
PDF
Mobile App Development- Project Management Process
PDF
Online shopping
KEY
Intro to Data Science for Enterprise Big Data
PPTX
Digital transformation: New purpose for enterprise architecture
BDM - project management in big data context.pptx
Project management for Big Data projects
Big data Hadoop Analytic and Data warehouse comparison guide
Meetup BigData et Machine Learning
Présentation Meetup #2
Yarn optimization (Real life use case)
Introduction project managemen
Lan yogi-nipa’s organizational analysis
Big data initiative justification and prioritization framework
The Ecosystem is too damn big
Big data project management
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
How to use Innovative Architectures for Digital Enterprises
Digital Transformation, Enterprise Architecture, Big Data by Danairat
Mobile App Development- Project Management Process
Online shopping
Intro to Data Science for Enterprise Big Data
Digital transformation: New purpose for enterprise architecture
Ad

Similar to Project management for Big Data projects (20)

PDF
Data-Ed Webinar: Data Architecture Requirements
PPTX
Lessons after working as a data scientist for 1 year
PDF
Advanced Project Data Analytics for Improved Project Delivery
PDF
Lean Analytics: How to get more out of your data science team
PPT
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
DOC
Ajaykumar Reddy - Resume
PDF
Intranet Design - How To Undertake An Intranet Redesign
PPTX
Managing Data Science | Lessons from the Field
PDF
II-SDV 2015, 20 - 21 April, in Nice
PPTX
Adding Hadoop to Your Analytics Mix?
PDF
Getting Data Quality Right
PPTX
Technology Consulting by Prasanna
PPTX
Career Sessions OM SCM
PPTX
RACI_Product_Services_Training.ChainSys.
PPTX
Career Conversation Technology Consulting
PPT
PPTX
OM SCM
PPTX
Career Sessions OPS/SCM
PPTX
Operations and Supply Chain CC
PPTX
Leveraging Cloud Technologies to Boost Your Start Up
Data-Ed Webinar: Data Architecture Requirements
Lessons after working as a data scientist for 1 year
Advanced Project Data Analytics for Improved Project Delivery
Lean Analytics: How to get more out of your data science team
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
Ajaykumar Reddy - Resume
Intranet Design - How To Undertake An Intranet Redesign
Managing Data Science | Lessons from the Field
II-SDV 2015, 20 - 21 April, in Nice
Adding Hadoop to Your Analytics Mix?
Getting Data Quality Right
Technology Consulting by Prasanna
Career Sessions OM SCM
RACI_Product_Services_Training.ChainSys.
Career Conversation Technology Consulting
OM SCM
Career Sessions OPS/SCM
Operations and Supply Chain CC
Leveraging Cloud Technologies to Boost Your Start Up

Recently uploaded (20)

PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPT
Quality review (1)_presentation of this 21
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Fluorescence-microscope_Botany_detailed content
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Clinical guidelines as a resource for EBP(1).pdf
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
IB Computer Science - Internal Assessment.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
1_Introduction to advance data techniques.pptx
Business Acumen Training GuidePresentation.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Quality review (1)_presentation of this 21
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...

Project management for Big Data projects

  • 1. PROJECT MANAGEMENT & BIG DATA ANALYTICS Sandeep Kumar PMP®
  • 2. INTRODUCTION IT Strategy & Business Transformation Industries: Media & Advertising, Telecom, BFSI, FMCG, Manufacturing, BPO/KPO Services: Shared Service Delivery, GIC & Back Office, PMO, Lean Six Sigma, Continuous Improvement, Enterprise IT, ERP, Cloud & Infrastructure, Development, Outsourcing and IT Security & Governance
  • 3. TWO STREAMS Project Management of Analytics • Big Data • Data Warehousing • Lean Six Sigma • In-memory computing • Internet of things • Social Media Analytics of Project Management • Time-Cost-Spec Analytics • Feasibility Analytics • Resource Analytics • Management of Collaboration • Agile & SCRUM
  • 4. APOLOGIES, DISCLAIMERS, ET AL • Big Data is over-hyped • Big Data is still evolving • Analytics is old, the tools are new! • Project Management solves most of the problems • Its importance is usually understated • The success of Big Data initiative lies primarily on the management, then on the PM & the DS • Hold the PM responsible only if you know what you want! • The roles I talk about here are essentially with respect Big Data Projects If you have Questions, I will try & Answer them!
  • 5. UNDER SCANNER ! Big Data / Analytics Myths • It is mature and cool • Is an extension of EDW • Data Quality can be slightly compromised • A single pre-built technology (e.g. Hadoop) will suffice • Data scientists are easy to get • Virtualization / Clustering will take care of infra needs • If you have huge data, every solution is a Big Data solution Project Management Myths • Managing only activities • Just time and cost management • Mere resource allocation • Reaching the finishing-line! • General management suffices • Have time to learn
  • 6. UNDER SCANNER ! Big Data / Analytics Facts • Responsible for Business Case and ROI definitions • Executive Sponsorship & Funds • ‘Real’ Resource Provisioning • Based on Enterprise Architecture • Highly complex and iterative process • Loads of scientific knowledge required • Source of data increases every day • Should be able to adapt with time Project Management Facts • Responsible for Scope & acceptance by all parties • Direction setting & KPI-SF definitions • ‘Real’ Resource Management • Right to procure & deploy the appropriate resource • Stakeholder & Communication management • Accountability and Responsibility for the success (and failure)
  • 7. THE DATA SCIENTIST Key skills of a Data Scientist – the hard skills guy • Basic Tools: Knowledge of statistical programming languages, like R or Python, and SQL • Basic Statistics: Familiar with statistical tests, distributions, maximum likelihood estimators, etc. • ETL Tools: Best in class like Informatica, IBM Infosphere, SAP BO, Oracle or SAS Data Integrator, Penta-ho, AB-Initio • Machine Learning / Artificial Intelligence / Pattern Recognition: Methods for Classification and Regression like k-nearest neighbours, random forests, etc. • Multivariable Calculus & Linear Algebra: Specially required where data is used for predictive performance or algorithm optimization • Data Munging / Scrubbing or Cleanliness: For example inconsistent string formatting as ND or Del for New Delhi; date alignment as [mm-dd-yyyy] or [dd-mm-yyyy] or [yyyy-dd-mm] • Data Visualization & Communication tools: Principles of and tools of Data Visualization like ggplot and d3.js. • Software Engineering: Strong software engineering background, SDLC, Agile, Scrum, DB techniques, Data intensive product development • Software Testing skills – To make sure the output delivers what the business needs • Basic Project Management Skills: Thinking like a Project Manager
  • 8. THE PROJECT MANAGER Key skills of a Project Manager – the soft skills guy • Project Charter: Project Stakeholders and Objectives documented and signed-off • Business Case: Asks the ‘whats’ and ‘whys’ of the business requirement • Scheduling Tools: Creates a Plan of Action to answer the ‘hows’ of the project • Vendor Management: Links up all 1st and 3rd party resources • Risk Management: The real management tool, with the mitigant • Communication Management: The core of collaboration and Management • Software Engineering: Software engineering background with fair knowledge of tools • Software Testing skills: To make sure the output delivers what the business needs • Basic Data Management Skills: Thinking like a Data Scientist
  • 9. MERGING ROLES The Data Scientist  The Requirements guy  The Data Tools guy  The Resource guy  The Specialist  The Enterprise Architect  The Software guy Breaking the Technical Barriers The Project Manager  The Requirements & Scope guy  The Project Tools guy  The Resource guy  The Generalist  The Program Manager  The hardware & Software guy Breaking the Cultural Barriers
  • 10. WHY DO ANALYTICS PROJECTS FAIL ? When do Projects fail, in general? • Not completed within budgets • Not completed on time • Not completed as per specifications Whys and Wherefores… • Poor scoping – unclear objective • Inadequate resources – lack of talent • Inappropriate Solution – wrong tool selection • Bad planning – Insufficient analysis • Bad execution – poor Project Management “3 out of 4 Big Data Projects fail” • Inaccurate Project Scope • Lack of Talent • Challenging Tools • Even more Challenging Concepts • Poor Planning • Ownership Issues – Business Initiative or IT Project? Apache Hadoop vs. Apache Cassandra 55% of Big Data projects don’t get completed; in case of IT projects in general, it is only 25%.
  • 11. THE SUCCESS MANTRA! • Get a Data Scientist at the PM or SME • Or, at least get the Data Scientist as one of the leads • Ask very tough questions to sponsors for a Business Case • Get the CFO and the End-User on your side – get the expectations right • Take your time – use appropriate Project Management methodologies • Use the most appropriate Platform / Tool • Accept requirements’ volatility • Scrum: accepting that the problem cannot be fully understood or defined, focusing instead on maximizing the team's ability to deliver quickly • POC  Deployment  Acceptance  Enhancement  Deployment … … • Agile: adaptive planning, evolutionary development, early delivery, continuous improvement • Document and Handover to End-User at every stage PM answers “How” as long as the Business knows “Why & What”