SlideShare a Scribd company logo
Planning & Project Management Fahri Firdausillah [M031010012]
Joke First, Serious Later Consultant:   So, your company is into data warehousing? How many data marts do you have? Project Manager:   Eleven. Consultant:   That’s great. But why so many? Project Manager:   Ten mistakes.
Defining the Business Requirements Chapter 5
Preamble OLTP and DW planning is different in term of requirements clarity Planning DW is about solving users’ problems and providing strategic information to the user. OLTP systems are primarily data capture systems. On the other hand, data warehouse systems are information delivery systems. Unlike an OLTP system, which is needed to run the day-to-day business, no immediate payout is seen in a decision support system.
Dimensional Analysis the users are generally unable to define their requirements clearly. For most of the users, this could be the very first data warehouse. How can you build something the users are unable to define clearly and precisely? Need different approach of requirements gathering.
Dimensional Analysis (cont'd) They can tell you what measurement units are important for them, how they combine the various pieces of information for strategic decision making. Although the actual proposed usage of a data warehouse could be unclear, the business dimensions used by the managers for decision making are not nebulous at all
Dimensional Analysis “in Action”
More Complex Dimensional Model
Information Packages The business dimensions and their hierarchical levels form the basis for all further development phases. The dimension hierarchies are the paths for drilling down or rolling up in our analysis
Requirements Gathering Methods Types of Questions Open Ended Question These open up options for interviewees to respond Closed Question These allow limited responses to interviewees Probes These are really follow-up questions. Probes may be used after open-ended or closed questions Questionnaires Group Session Interview
Sample Expectation from Interviews Senior Executives Dept. Managers IT Dept. Professionals Organization objectives Criteria for measuring success  Key business issues, current & future  Problem identification Vision and direction for the organization  Anticipated usage of the DW Departmental objectives  Success metrics  Factors limiting success  Key business issues  Products & Services  Useful business dimensions for analysis Anticipated usage of the DW  Key operational source systems  Current information delivery processes  Types of routine analysis Known quality issues Current IT support for information requests  Concerns about proposed DW
Adapting JAD Identify project objectives and limitations Identify critical success factors Define project deliverables  Define the schedule of workshop activities Select the participants Prepare the workshop material Organize workshop activities and exercises Prepare, inform, educate the workshop participants Coordinate workshop logistics 1. Project  Definition 2. Research 3. Preparation 4. JAD  Sessions 5. Final  Document
Requirement Definition: Scope & Content Requirements definition document is the basis for the next phases. Formal documentation will also validate your findings when reviewed with the users Data Sources Data Transformation Data Storage Information Delivery Information Package Diagrams
Requirements Definition Document Outline Introduction General Requirements Descriptions Specific Requirements Information Packages Other Requirements User Expectations User Participation and Sign-Off General Implementation Plan
Requirements as the Driving Force for Data Warehousing Chapter 6
Preamble If accurate requirements definition is important for any operational system, it is many times more important for a data warehouse extremely important that your datawarehouse contains the right elements of information in the most optimal formats Every task that is performed in every phase in the development of the data warehouse is determined by the requirements Every decision made during the design phase is totally influenced by the requirements.
Data Design
Data Design (cont'd) Structure for Business Dimensions Importance of having the appropriate dimensions and the right contents in the  information package diagrams . Structure for Key Measurements Users measure performance by using and comparing key measurements In order to review using proper key measurements, DW has to guarantee the information package diagrams contain all the relevant keys. Levels of Detail DW needs to provide drill-down and roll-up facilities for analysis How deep detail of data is needed in DW
Data Design “in Action” Structure for  Business Dimensions Structure for  Key Measurements Levels of Detail
The Architectural Plan
Source Data Production Data: Data get from operational system. Normally include financial system, customer relationship system, manufacturing system, etc. Internal Data: Private data keep by internal organization. Could be spreadsheets, documents, even departmental database Archived Data: Old data that is already not to be used in operational system. External Data: Data from outside systems, it can also from outside company. This type of data usually do not conform internal format
Data Staging Bad data lead to bad decision,  data quality in data warehouse is sacrosanct ETL process ensure data to be ready stored and processed in DW. In many cases data need to be extracted from sources in different scheme, different vendor, even in different format of flat files. If data extraction for a DW poses great challenges, data transformation presents even greater challenges. Data need to be cleaned from misspelling, resolution conflict, duplication, setting default missing values, etc. Initial load moves very large volumes of data. After that data staging will continuously extract the changes from sources. Extract Transform Load
Sample Architecture
Data Storage Specifications DBMS Selection User requirements affect the selection of the proper DBMS. Choice of the DBMS may be conditioned by its tool kit component. Features to be considered: Level of User Experience, Types of Queries, Need for Openness, Data Loads, Metadata Management, Data Repository Locations, Data Warehouse Growth. Storage Sizing Determined by how many data source and how much the data will grows continuously. If DW is expected to support Online Analytical Processing OLAP, then how much OLAP is necessary.
Information Delivery Strategy
Metadata Operational Metadata:   When deliver information to the end-users, you must be able to tie that back to the original source data sets. Operational metadata contain all of this information about theoperational data sources. Extraction and Transformation Metadata: Storing information of extraction frequencies, extraction methods, and business rules for the data extraction. End-User Metadata: Navigational map of the data warehouse, allows the end-users to use their own business terminology and look for information in those ways.
Management & Control Sits on top of all the other components. Controls the data transformation and the data transfer into the data warehouse storage. Interacts with the metadata component to perform the management and control functions. Metadata is the source of information for the management module.
End of Presentation & Thank You Very Much

More Related Content

PPTX
DW Migration Webinar-March 2022.pptx
PDF
Mdm: why, when, how
PPT
Gartner: Master Data Management Functionality
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
PDF
Time to Talk about Data Mesh
PPTX
Business requirements gathering for bi
PDF
Sample - Data Warehouse Requirements
PDF
Introduction to ETL and Data Integration
DW Migration Webinar-March 2022.pptx
Mdm: why, when, how
Gartner: Master Data Management Functionality
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Time to Talk about Data Mesh
Business requirements gathering for bi
Sample - Data Warehouse Requirements
Introduction to ETL and Data Integration

What's hot (20)

PDF
Gathering Business Requirements for Data Warehouses
PPTX
Chapter 1: The Importance of Data Assets
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PDF
Master Data Management - Aligning Data, Process, and Governance
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
PPTX
Building a Big Data Pipeline
PPTX
Master Data Management
PPTX
Free Training: How to Build a Lakehouse
PDF
White Paper - Data Warehouse Project Management
PDF
Moving to Databricks & Delta
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PPTX
Azure data platform overview
PDF
DAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PPTX
Snowflake Architecture.pptx
PDF
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
PDF
Snowflake SnowPro Certification Exam Cheat Sheet
PDF
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
PPTX
Building Modern Data Platform with Microsoft Azure
PDF
Building Big Data Analytics Center Of Excellence
Gathering Business Requirements for Data Warehouses
Chapter 1: The Importance of Data Assets
Enabling a Data Mesh Architecture with Data Virtualization
Master Data Management - Aligning Data, Process, and Governance
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Building a Big Data Pipeline
Master Data Management
Free Training: How to Build a Lakehouse
White Paper - Data Warehouse Project Management
Moving to Databricks & Delta
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Azure data platform overview
DAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Snowflake Architecture.pptx
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Snowflake SnowPro Certification Exam Cheat Sheet
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Building Modern Data Platform with Microsoft Azure
Building Big Data Analytics Center Of Excellence
Ad

Viewers also liked (20)

PPSX
planning & project management for DWH
PPTX
DATA WAREHOUSING
PPTX
The Data Warehouse Lifecycle
ODP
04 Dimensional Analysis - v6
PDF
Data warehouse architecture
PPT
Warehouse components
PPT
Data Warehousing and Data Mining
DOC
Avesh sh
PDF
Data Warehouse Evolution Roadshow
PPT
CMMI Project Planning Presentation
PPT
Lecture 13
PPT
Lecture 1
PDF
Do's and dont's of warehouse moves
PPT
Introduction to DataMining
PDF
Data Ware House Testing
DOCX
Types of testing done in a Data Warehouse project
PDF
Tivoli data warehouse version 1.3 planning and implementation sg246343
PPT
Data warehouse 101-fundamentals-
PDF
Data warehousing testing strategies cognos
PPT
Dw Kickoff Meeting V4
planning & project management for DWH
DATA WAREHOUSING
The Data Warehouse Lifecycle
04 Dimensional Analysis - v6
Data warehouse architecture
Warehouse components
Data Warehousing and Data Mining
Avesh sh
Data Warehouse Evolution Roadshow
CMMI Project Planning Presentation
Lecture 13
Lecture 1
Do's and dont's of warehouse moves
Introduction to DataMining
Data Ware House Testing
Types of testing done in a Data Warehouse project
Tivoli data warehouse version 1.3 planning and implementation sg246343
Data warehouse 101-fundamentals-
Data warehousing testing strategies cognos
Dw Kickoff Meeting V4
Ad

Similar to Planning Data Warehouse (20)

DOCX
PPS
Data Warehouse 102
PPTX
Data warehouse
PPT
Data wirehouse
PPT
E06WarehouseDesignissuesindatawarehousedesign.ppt
PPT
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
PPT
Date warehousing concepts
PPT
IT Ready - DW: 1st Day
PPT
11667 Bitt I 2008 Lect4
PDF
BI Chapter 03.pdf business business business business business business
PPTX
DataWarehouse Architecture,daat mining,data mart,etl process.pptx
PPT
Introduction to Business Intelligence and Data warehousing - ppt
PPT
E05WAREH1.PPT
PPT
Data Warehousing Datamining Concepts
PPTX
DWDM Unit 1 (1).pptx
PPTX
Introduction to Data Warehousing
PPTX
Data warehouseold
PPTX
Business Intelligence Overview
PPS
Introduction to Data Warehousing
PPTX
data mining and data warehousing
Data Warehouse 102
Data warehouse
Data wirehouse
E06WarehouseDesignissuesindatawarehousedesign.ppt
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
Date warehousing concepts
IT Ready - DW: 1st Day
11667 Bitt I 2008 Lect4
BI Chapter 03.pdf business business business business business business
DataWarehouse Architecture,daat mining,data mart,etl process.pptx
Introduction to Business Intelligence and Data warehousing - ppt
E05WAREH1.PPT
Data Warehousing Datamining Concepts
DWDM Unit 1 (1).pptx
Introduction to Data Warehousing
Data warehouseold
Business Intelligence Overview
Introduction to Data Warehousing
data mining and data warehousing

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
A Presentation on Artificial Intelligence
PPTX
MYSQL Presentation for SQL database connectivity
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
Teaching material agriculture food technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
NewMind AI Weekly Chronicles - August'25 Week I
A Presentation on Artificial Intelligence
MYSQL Presentation for SQL database connectivity
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
Spectral efficient network and resource selection model in 5G networks
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Building Integrated photovoltaic BIPV_UPV.pdf
Understanding_Digital_Forensics_Presentation.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Unlocking AI with Model Context Protocol (MCP)
Digital-Transformation-Roadmap-for-Companies.pptx
Modernizing your data center with Dell and AMD
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Teaching material agriculture food technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
NewMind AI Monthly Chronicles - July 2025
Reach Out and Touch Someone: Haptics and Empathic Computing

Planning Data Warehouse

  • 1. Planning & Project Management Fahri Firdausillah [M031010012]
  • 2. Joke First, Serious Later Consultant: So, your company is into data warehousing? How many data marts do you have? Project Manager: Eleven. Consultant: That’s great. But why so many? Project Manager: Ten mistakes.
  • 3. Defining the Business Requirements Chapter 5
  • 4. Preamble OLTP and DW planning is different in term of requirements clarity Planning DW is about solving users’ problems and providing strategic information to the user. OLTP systems are primarily data capture systems. On the other hand, data warehouse systems are information delivery systems. Unlike an OLTP system, which is needed to run the day-to-day business, no immediate payout is seen in a decision support system.
  • 5. Dimensional Analysis the users are generally unable to define their requirements clearly. For most of the users, this could be the very first data warehouse. How can you build something the users are unable to define clearly and precisely? Need different approach of requirements gathering.
  • 6. Dimensional Analysis (cont'd) They can tell you what measurement units are important for them, how they combine the various pieces of information for strategic decision making. Although the actual proposed usage of a data warehouse could be unclear, the business dimensions used by the managers for decision making are not nebulous at all
  • 9. Information Packages The business dimensions and their hierarchical levels form the basis for all further development phases. The dimension hierarchies are the paths for drilling down or rolling up in our analysis
  • 10. Requirements Gathering Methods Types of Questions Open Ended Question These open up options for interviewees to respond Closed Question These allow limited responses to interviewees Probes These are really follow-up questions. Probes may be used after open-ended or closed questions Questionnaires Group Session Interview
  • 11. Sample Expectation from Interviews Senior Executives Dept. Managers IT Dept. Professionals Organization objectives Criteria for measuring success Key business issues, current & future Problem identification Vision and direction for the organization Anticipated usage of the DW Departmental objectives Success metrics Factors limiting success Key business issues Products & Services Useful business dimensions for analysis Anticipated usage of the DW Key operational source systems Current information delivery processes Types of routine analysis Known quality issues Current IT support for information requests Concerns about proposed DW
  • 12. Adapting JAD Identify project objectives and limitations Identify critical success factors Define project deliverables Define the schedule of workshop activities Select the participants Prepare the workshop material Organize workshop activities and exercises Prepare, inform, educate the workshop participants Coordinate workshop logistics 1. Project Definition 2. Research 3. Preparation 4. JAD Sessions 5. Final Document
  • 13. Requirement Definition: Scope & Content Requirements definition document is the basis for the next phases. Formal documentation will also validate your findings when reviewed with the users Data Sources Data Transformation Data Storage Information Delivery Information Package Diagrams
  • 14. Requirements Definition Document Outline Introduction General Requirements Descriptions Specific Requirements Information Packages Other Requirements User Expectations User Participation and Sign-Off General Implementation Plan
  • 15. Requirements as the Driving Force for Data Warehousing Chapter 6
  • 16. Preamble If accurate requirements definition is important for any operational system, it is many times more important for a data warehouse extremely important that your datawarehouse contains the right elements of information in the most optimal formats Every task that is performed in every phase in the development of the data warehouse is determined by the requirements Every decision made during the design phase is totally influenced by the requirements.
  • 18. Data Design (cont'd) Structure for Business Dimensions Importance of having the appropriate dimensions and the right contents in the information package diagrams . Structure for Key Measurements Users measure performance by using and comparing key measurements In order to review using proper key measurements, DW has to guarantee the information package diagrams contain all the relevant keys. Levels of Detail DW needs to provide drill-down and roll-up facilities for analysis How deep detail of data is needed in DW
  • 19. Data Design “in Action” Structure for Business Dimensions Structure for Key Measurements Levels of Detail
  • 21. Source Data Production Data: Data get from operational system. Normally include financial system, customer relationship system, manufacturing system, etc. Internal Data: Private data keep by internal organization. Could be spreadsheets, documents, even departmental database Archived Data: Old data that is already not to be used in operational system. External Data: Data from outside systems, it can also from outside company. This type of data usually do not conform internal format
  • 22. Data Staging Bad data lead to bad decision, data quality in data warehouse is sacrosanct ETL process ensure data to be ready stored and processed in DW. In many cases data need to be extracted from sources in different scheme, different vendor, even in different format of flat files. If data extraction for a DW poses great challenges, data transformation presents even greater challenges. Data need to be cleaned from misspelling, resolution conflict, duplication, setting default missing values, etc. Initial load moves very large volumes of data. After that data staging will continuously extract the changes from sources. Extract Transform Load
  • 24. Data Storage Specifications DBMS Selection User requirements affect the selection of the proper DBMS. Choice of the DBMS may be conditioned by its tool kit component. Features to be considered: Level of User Experience, Types of Queries, Need for Openness, Data Loads, Metadata Management, Data Repository Locations, Data Warehouse Growth. Storage Sizing Determined by how many data source and how much the data will grows continuously. If DW is expected to support Online Analytical Processing OLAP, then how much OLAP is necessary.
  • 26. Metadata Operational Metadata: When deliver information to the end-users, you must be able to tie that back to the original source data sets. Operational metadata contain all of this information about theoperational data sources. Extraction and Transformation Metadata: Storing information of extraction frequencies, extraction methods, and business rules for the data extraction. End-User Metadata: Navigational map of the data warehouse, allows the end-users to use their own business terminology and look for information in those ways.
  • 27. Management & Control Sits on top of all the other components. Controls the data transformation and the data transfer into the data warehouse storage. Interacts with the metadata component to perform the management and control functions. Metadata is the source of information for the management module.
  • 28. End of Presentation & Thank You Very Much