SlideShare a Scribd company logo
Best Practices Building a Data Warehouse  Quickly  October 16, 2009 | Florida Chapter  Presented by Raphael Klebanov, WhereScape USA Copyright © 2009 by WhereScape Software |  Slide #  Copyright © 2009 by WhereScape Software
Key factors  that influence a successful data warehouse task Implementing the  True  Development  Approach  Choosing a  Rapid Development  Product  Ensuring  Data Availability  Involving  Key Users  throughout the whole project Relying on a  Pragmatic Governance Framework  Utilizing experienced  Team Members Selecting the right  Hardware ,  Infrastructure Technology  Abstract Copyright © 2009 by WhereScape Software |  Slide #
Copyright © 2009 by WhereScape Software |  Slide #  Basic Architecture of a Data Warehouse
… for a intelligent   decision-making process? … for data warehouse? Are you ready … Copyright © 2009 by WhereScape Software |  Slide #
Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
Unreliable  or unattainable  user requirements Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
Unreliable  or unattainable  user requirements Quality of the data  that feeds the source system Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
Unreliable  or unattainable  user requirements Quality of the data  that feeds the source system Changing  source or target  requirements Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
Unreliable  or unattainable  user requirements Quality of the data  that feeds the source system Changing  source or target  requirements Poor development productivity Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
Unreliable  or unattainable  user requirements Quality of the data  that feeds the source system Changing  source or target  requirements Poor development productivity High TCO  (Total Cost of Ownership Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
Unreliable  or unattainable  user requirements Quality of the data  that feeds the source system Changing  source or target  requirements Poor development productivity High TCO  (Total Cost of Ownership) Poor documentation Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
Unreliable  or unattainable  user requirements Quality of the data  that feeds the source system Changing  source or target  requirements Poor development productivity High TCO  (Total Cost of Ownership) Poor documentation “… over 50% of data warehouse projects fail or go wildly over budget – they blame data quality…” The real problem is  project  approach. Source: Gartner. Magic Quadrant for Data Integration Tools, 2007 Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software |  Slide #
DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business Divide and Conquer  approach DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business Divide and Conquer  approach Iterative Development  approach DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business Divide and Conquer  approach Iterative Development  approach Productive development tools DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business Divide and Conquer  approach Iterative Development  approach Productive development tools Real data  to populate the prototype DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business Divide and Conquer  approach Iterative Development  approach Productive development tools Real data  to populate the prototype Access to SME  during development DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business Divide and Conquer  approach Iterative Development  approach Productive development tools Real data  to populate the prototype Access to SME  during development Compact teams DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Strong sponsorship  of the DW from the business Divide and Conquer  approach Iterative Development  approach Productive development tools Real data  to populate the prototype Access to SME  during development Compact teams Sturdy  development  hardware DW Project Components Copyright © 2009 by WhereScape Software |  Slide #
Business Ownership Copyright © 2009 by WhereScape Software |  Slide #
The data warehouse should be owned by the  business –  not IT Business Ownership Copyright © 2009 by WhereScape Software |  Slide #
The data warehouse should be owned by the  business –  not IT A successful project depends upon creating a  partnership with the business  Business Ownership Copyright © 2009 by WhereScape Software |  Slide #
The data warehouse should be owned by the  business –  not IT A successful project depends upon creating a  partnership with the business  Prioritization of project phases or agreement on a data dictionary should be  agreed by the business Business Ownership Copyright © 2009 by WhereScape Software |  Slide #
The data warehouse should be owned by the  business –  not IT A successful project depends upon creating a  partnership with the business  Prioritization of project phases or agreement on a data dictionary should be  agreed by the business Without a strong,  high level business sponsor(s)  the project is likely to hit problems  Business Ownership Copyright © 2009 by WhereScape Software |  Slide #
The data warehouse should be owned by the  business –  not IT A successful project depends upon creating a  partnership with the business  prioritization of project phases or agreement on a data dictionary to should be  agreed by the business Without a strong,  high level business sponsor(s)  the project is likely to hit problems  If sponsorship is present then the data warehouse project can be broken down into a  set of smaller projects Business Ownership Copyright © 2009 by WhereScape Software |  Slide #
The Data Warehouse lifecycle  …as we know it
Divide and Conquer Copyright © 2009 by WhereScape Software |  Slide #
A ‘ big bang ’ approach to data warehousing has almost always ended in disaster  Divide and Conquer Copyright © 2009 by WhereScape Software |  Slide #
A ‘ big bang ’ approach to data warehousing has almost always ended in disaster  The project phases and the order in which they are developed should be decided by the  data warehouse sponsors Divide and Conquer Copyright © 2009 by WhereScape Software |  Slide #
A ‘ big bang ’ approach to data warehousing has almost always ended in disaster  The project phases and the order in which they are developed should be decided by the  data warehouse sponsors Momentum  is paramount for keeping the required focus  Divide and Conquer Copyright © 2009 by WhereScape Software |  Slide #
A ‘ big bang ’ approach to data warehousing has almost always ended in disaster  The project phases and the order in which they are developed should be decided by the  data warehouse sponsors Momentum  is paramount for keeping the required focus  Rapid prototyping and tight development cycles  are vital for successful warehouse Divide and Conquer Copyright © 2009 by WhereScape Software |  Slide #
A ‘ big bang ’ approach to data warehousing has almost always ended in disaster  The project phases and the order in which they are developed should be decided by the  data warehouse sponsors Momentum  is paramount for keeping the required focus  Rapid prototyping and tight development cycles  are vital for successful warehouse Keep in view the  bigger picture Divide and Conquer Copyright © 2009 by WhereScape Software |  Slide #
A ‘ big bang ’ approach to data warehousing has almost always ended in disaster  The project phases and the order in which they are developed should be decided by the  data warehouse sponsors Momentum  is paramount for keeping the required focus  Rapid prototyping and tight development cycles  are vital for successful warehouse Keep in view the  bigger picture Use  smaller phases  to  fund  the project adequately Divide and Conquer Copyright © 2009 by WhereScape Software |  Slide #
The True Project Approach Copyright © 2009 by WhereScape Software |  Slide #
Getting the business reps to use  working prototypes  to share an understanding of the scope The True Project Approach Copyright © 2009 by WhereScape Software |  Slide #
Getting the business reps to use  working prototypes  to share an understanding of the scope Collect detailed user requirements is exactly the  wrong start  to a data warehouse project The True Project Approach Copyright © 2009 by WhereScape Software |  Slide #
Getting the business reps to use  working prototypes  to share an understanding of the scope Collect detailed user requirements is exactly the  wrong start  to a data warehouse project Showing business users  the data and relationships  that are available to them in a working, populated prototype The True Project Approach Copyright © 2009 by WhereScape Software |  Slide #
Getting the business reps to use  working prototypes  to share an understanding of the scope Collect detailed user requirements is exactly the  wrong start  to a data warehouse project Showing business users  the data and relationships  that are available to them in a working, populated prototype A better place to start is to  collect KPIs and source system technical  documentation The True Project Approach Copyright © 2009 by WhereScape Software |  Slide #
Getting the business reps to use  working prototypes  to share an understanding of the scope Collect detailed user requirements is exactly the  wrong start  to a data warehouse project Showing business users the data and relationships that are available to them in a  working, populated prototype A better place to start is to  collect KPIs and source system technical  documentation OLAP technology  and  user workshops  are key tools in allowing the business to get their hands on the data The True Project Approach Copyright © 2009 by WhereScape Software |  Slide #
Getting the business reps to use  working prototypes  to share an understanding of the scope Collect detailed user requirements is exactly the  wrong start  to a data warehouse project Showing business users  the data and relationships  that are available to them in a working, populated prototype A better place to start is to  collect KPIs and source system technical  documentation OLAP technology  and  user workshops  are key tools in allowing the business to get their hands on the data Data quality should not be addressed in the DW;  problem should be fixed on the source system The True Project Approach Copyright © 2009 by WhereScape Software |  Slide #
Rapid Development Product Copyright © 2009 by WhereScape Software |  Slide #
Scrutinize Extract/Transform and Load  (ETL) tools  when considering building a DW. ETL tools  do not  provide the ability to  build a working prototype  and work in  short development cycles Rapid Development Product and ETL Copyright © 2009 by WhereScape Software |  Slide #
Combining processing and design Ability to enable, manage fast  iterations of the prototype Environment migration   Version control   Automatic documentation Rapid Development Product Enables: Copyright © 2009 by WhereScape Software |  Slide #
The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  Automated Code Generation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  Automated Code Generation Metadata Migration The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  Automated Code Generation Metadata Migration Version Control  The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  Automated Code Generation Metadata Migration Version Control  Object Checkout The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  Automated Code Generation Metadata Migration Version Control  Object Checkout Leverage Existing Core Skills. The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  Automated Code Generation Metadata Migration Version Control  Object Checkout Leverage Existing Core Skills Consistent Framework The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Single Development Interface Documentation Automated Table Generation  Automated Code Generation Metadata Migration Version Control  Object Checkout Leverage Existing Core Skills Consistent Framework Extensibility The Features of a DWLC Tool Copyright © 2009 by WhereScape Software |  Slide #
Copyright © 2009 by WhereScape Software |  Slide #  Try to get it right first time: The SDLC approach  But: Tools and operators in silos – inflexible Hard to engage business users, no shared understanding Locked in requirements that can’t be met Redevelopment 120 day cycle Risky, expensive, never OTTB & never finished No documentation, so hard to support The Traditional Approach
Copyright © 2009 by WhereScape Software |  Slide #  Prototype and iterate to prove a design with users   Supported by: Integrated toolset and metadata repository – maximum flexibility Continuous business engagement, shared understanding No ambiguity or disagreement about scope Successful phase completion Complete, OTTB, user expectations exceeded Documented solution that is easy to support The Rapid Development Approach 5 day cycle
A DWLC Tool would save a  huge amount of development time and effort , and would enable the approach required to deliver a successful outcome The DWLC methodology is a child concept for  Agile Development Methodology  also known as Systems Development Life Cycle (SDLC)  The Features of a DWLC Tool Slide #
Ensuring Data Availability  Copyright © 2009 by WhereScape Software |  Slide #
The lack of good quality live data  will have a  major impact  on the success of Iterative project approach  The DW’s  capacity to answer BI requirements  is unworkable, without  sufficient data  to populate the DW If a  new source system  is integrated into the data warehouse, the  “real”  data is quite  essential  If no “real” data for new source is available, then the  significant rework  will be required once the source is up and running Ensuring Data Availability  Copyright © 2009 by WhereScape Software |  Slide #
Involving the Business Copyright © 2009 by WhereScape Software |  Slide #
Representatives from the Business provide the  partnership  with the DW development team These reps need to be able to  articulate the needs  of the business to the dev. team  These reps have to  trust the business department  behind them when it comes to making any decisions The partnership during the iterative project approach provides a  reliable, successful outcome The main forum for developers to show a working prototype and get user feedback is  user workshop   The business Involvement for the duration of the DW development will reduce the  QA overheads   Involving the Business Copyright © 2009 by WhereScape Software |  Slide #
Copyright © 2009 by WhereScape Software |  Slide #  Project governance
Governance of the data warehouse project should operate at  two levels : an  enterprise  level and  a  project  level  Pragmatic Governance Framework  Copyright © 2009 by WhereScape Software |  Slide #
Governance of the data warehouse project should operate at  two levels : an  enterprise  level and  a  project  level  Pragmatic Governance Framework  Copyright © 2009 by WhereScape Software |  Slide #  Business Requirements Technical Constraints
Governance of the data warehouse project should operate at  two levels : an  enterprise  level and  a  project  level  Pragmatic Governance Framework  Copyright © 2009 by WhereScape Software |  Slide #  Business Requirements Technical Constraints Shared understanding, Prototype and Iterate, Best possible outcome
Sponsorship is sourced from a  highly-placed executive The steering committee  provides: + Vision  + Visibility + Priorities  + Scope  + Focus  + Terminology Copyright © 2009 by WhereScape Software |  Slide #  The DW needs to be owned by the business
At a minimum project governance should include: A  project plan , detailing (high level) scope and timelines Regular  status meetings  to share information Change request  process documentation Standards and procedures  for building a consistent  DW Version control  and  backup  procedures Ownership of specific environments and  project roles Copyright © 2009 by WhereScape Software |  Slide #  Project governance
Utilizing Experienced Team Members  Copyright © 2009 by WhereScape Software |  Slide #
Productivity within a data warehouse implementation is dependent on having  experienced team members  – both on business side and also on the technical side Experienced  Subject Matter Experts  (SME) provide a thorough understanding of the business and its needs Experienced  data warehouse  developers can take those requirements and turn them into a functioning data warehouse in a rapid timeframe Utilizing Experienced Team Members  Copyright © 2009 by WhereScape Software |  Slide #
Selecting the Right Infrastructure Copyright © 2009 by WhereScape Software |  Slide #
Sufficient hardware  and technology  infrastructure  during development Lower productivity can translate into slower development cycles and iterations, which stands the  risk of losing  project  momentum Trade-off  between having adequately sized hardware and the cost associated with purchasing that hardware One way to mitigate undersized hardware is to  use smaller subsets of data  during the prototyping phase  Selecting the Right Infrastructure Copyright © 2009 by WhereScape Software |  Slide #
Treat the Warehousing as a  process,  not a project Conclusion Copyright © 2009 by WhereScape Software |  Slide #
This means:  focusing on  iterative releases  and rollouts that follow in quick succession keeping the warehouse  in line with the ever changing needs of the business , instead of treating it as a one-time project In order to achieve this, a  change in the development approach  and  tools utilized for building the data warehouse  must be adopted Conclusion Copyright © 2009 by WhereScape Software |  Slide #
The  key factors  to creating a successful data warehouse are: -------------------------------------------------------------------------------- Implementing the  True  Development  Approach  Choosing a  Rapid Development  Product  Ensuring  Data Availability  Involving  Key Users  throughout the whole project Relying on a  Pragmatic Governance Framework  Utilizing experienced  Team Members Selecting the right hardware and other related  Infrastructure Technology  Conclusion Copyright © 2009 by WhereScape Software |  Slide #
Raphael Klebanov,   Analyst at WhereScape Office Phone:   303.968.0703 Email address:   rklebanov@wherescape.com  Public Profile:  http://www.linkedin.com/in/raphaelklebanov My  personal information: Copyright © 2009 by WhereScape Software |  Slide #

More Related Content

PDF
WhereScape, the pioneer in data warehouse automation software
PDF
The Right Data Warehouse: Automation Now, Business Value Thereafter
PPT
Automation First as Strategy for Data Warehouse Modernization
PPTX
Agile data warehouse
PPTX
Data Warehousing in the Cloud: Practical Migration Strategies
PPTX
Future of Making Things
PPT
Kb 40 kevin_klineukug_reading20070717[1]
PDF
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
 
WhereScape, the pioneer in data warehouse automation software
The Right Data Warehouse: Automation Now, Business Value Thereafter
Automation First as Strategy for Data Warehouse Modernization
Agile data warehouse
Data Warehousing in the Cloud: Practical Migration Strategies
Future of Making Things
Kb 40 kevin_klineukug_reading20070717[1]
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
 

What's hot (20)

PPTX
a2c Boston Big Data Meet-up: Agile Data Warehouse Design
 
PDF
Slides: Accelerating Queries on Cloud Data Lakes
PPT
Bi presentation to bkk
PDF
Benefits of the Azure Cloud
PDF
You are not Facebook or Google? Why you should still care about Big Data and ...
PPTX
Becoming Data-Driven Through Cultural Change
PDF
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
PDF
Agile NoSQL With XRX
PPTX
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
PPTX
A Masterclass on Data Warehouse Automation
PDF
Analytics in a Day Virtual Workshop
 
PDF
MT101 Dell OCIO: Delivering data and analytics in real time
PDF
MT11 - Turn Science Fiction into Reality by Using SAP HANA to Make Sense of IoT
PDF
Modernize your Infrastructure and Mobilize Your Data
PDF
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
PPTX
Analyzing Billions of Data Rows with Alteryx, Amazon Redshift, and Tableau
PPTX
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
PPTX
Snowflake: The Good, the Bad and the Ugly
PDF
How Businesses use Big Data to Impact the Bottom Line
PPTX
Business Intelligence In The Cloud
a2c Boston Big Data Meet-up: Agile Data Warehouse Design
 
Slides: Accelerating Queries on Cloud Data Lakes
Bi presentation to bkk
Benefits of the Azure Cloud
You are not Facebook or Google? Why you should still care about Big Data and ...
Becoming Data-Driven Through Cultural Change
The Heart of Data Modeling: The Best Data Modeler is a Lazy Data Modeler
Agile NoSQL With XRX
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
A Masterclass on Data Warehouse Automation
Analytics in a Day Virtual Workshop
 
MT101 Dell OCIO: Delivering data and analytics in real time
MT11 - Turn Science Fiction into Reality by Using SAP HANA to Make Sense of IoT
Modernize your Infrastructure and Mobilize Your Data
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Analyzing Billions of Data Rows with Alteryx, Amazon Redshift, and Tableau
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Snowflake: The Good, the Bad and the Ugly
How Businesses use Big Data to Impact the Bottom Line
Business Intelligence In The Cloud
Ad

Viewers also liked (20)

PDF
WhereScape - Business Intelligence for Growth
PDF
Smarter Analytics: Supporting the Enterprise with Automation
PDF
Rev_3 Components of a Data Warehouse
PDF
Dw design 2_conceptual_model
PPT
Warehousing dimension star-snowflake_schemas
PDF
Test Automation for Data Warehouses
PDF
When Facts and Dimensions Alone Aren't the Answer: Logically Reversing the St...
PPTX
Data modeling star schema
PDF
Google Glass UX Best Practices Presentation by Litrik De Roy (@litrik) at the...
PDF
Pedro De Bruyckere Meetup Presentation
PDF
Trends for 2014
PDF
Information Lifecycle Governance Leader Reference Guide
PDF
Creating Better Customer Experiences Online (with Top Tasks) presented by Ger...
PDF
How business analysts are catalysts for business change
PDF
Data Vault Introduction
PDF
3D printing en korte keten recyclage (Evi Swinnen, timelab)
PPT
Smarter Eduction - Higher Education Summit 2011 - D Watt
PDF
Estrategia Information lifecycle Management
PDF
Creating a Smarter Shopping Experience with IBM Solutions at Carter's
PPTX
Information Lifecycle Management
WhereScape - Business Intelligence for Growth
Smarter Analytics: Supporting the Enterprise with Automation
Rev_3 Components of a Data Warehouse
Dw design 2_conceptual_model
Warehousing dimension star-snowflake_schemas
Test Automation for Data Warehouses
When Facts and Dimensions Alone Aren't the Answer: Logically Reversing the St...
Data modeling star schema
Google Glass UX Best Practices Presentation by Litrik De Roy (@litrik) at the...
Pedro De Bruyckere Meetup Presentation
Trends for 2014
Information Lifecycle Governance Leader Reference Guide
Creating Better Customer Experiences Online (with Top Tasks) presented by Ger...
How business analysts are catalysts for business change
Data Vault Introduction
3D printing en korte keten recyclage (Evi Swinnen, timelab)
Smarter Eduction - Higher Education Summit 2011 - D Watt
Estrategia Information lifecycle Management
Creating a Smarter Shopping Experience with IBM Solutions at Carter's
Information Lifecycle Management
Ad

Similar to Best Practices for Building a Warehouse Quickly (20)

PDF
Migrating to Cloud Native Solutions
PDF
Enterprise Monitoring 2018: Converged Application & Infrastructure Monitoring...
PPTX
Developing Your Cloud Strategy
PPTX
Developing Your Cloud Strategy
PDF
Moving to the Cloud-How to Develop Cloud Strategy for Your Organization
PPTX
"Why Micro Services adoption is crucial for SaaS companies?"
PPTX
TADSummit, Temasys Keynote: Marketing to Developers is Bloody Hard! Chip Wilc...
PPTX
A Deep Dive Into Comprehensive Citrix & VDI Monitoring with eG Enterprise
PPTX
Planning Your Hadoop NoSQL Projects For 2011
PDF
Implementing Cloud-Based DevOps for Distributed Agile Projects
PDF
Veeam up for VeeamON 2015
PPTX
Oracle Cloud upcoming trends
PDF
Pivotal data science_data_engineering_secret_weapons_of_the_strategic_enterprise
 
PPTX
So You Think You Know Everything About The Cloud
PDF
Rediscover Software Development Edward Hieatt Web Summit 2014
PDF
Imaginea Overview
PDF
Wise Men TIBCO ADF Webinar- 16 October 2014
PPTX
UKOUG Journey To The Cloud - March 2017
PPTX
Accelerate Design and Development of Data Projects Using AWS
PPTX
How to Assure Performance in Hybrid Cloud Ecosystems
Migrating to Cloud Native Solutions
Enterprise Monitoring 2018: Converged Application & Infrastructure Monitoring...
Developing Your Cloud Strategy
Developing Your Cloud Strategy
Moving to the Cloud-How to Develop Cloud Strategy for Your Organization
"Why Micro Services adoption is crucial for SaaS companies?"
TADSummit, Temasys Keynote: Marketing to Developers is Bloody Hard! Chip Wilc...
A Deep Dive Into Comprehensive Citrix & VDI Monitoring with eG Enterprise
Planning Your Hadoop NoSQL Projects For 2011
Implementing Cloud-Based DevOps for Distributed Agile Projects
Veeam up for VeeamON 2015
Oracle Cloud upcoming trends
Pivotal data science_data_engineering_secret_weapons_of_the_strategic_enterprise
 
So You Think You Know Everything About The Cloud
Rediscover Software Development Edward Hieatt Web Summit 2014
Imaginea Overview
Wise Men TIBCO ADF Webinar- 16 October 2014
UKOUG Journey To The Cloud - March 2017
Accelerate Design and Development of Data Projects Using AWS
How to Assure Performance in Hybrid Cloud Ecosystems

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
Machine Learning_overview_presentation.pptx
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Tartificialntelligence_presentation.pptx
PDF
Empathic Computing: Creating Shared Understanding
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
1. Introduction to Computer Programming.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Spectral efficient network and resource selection model in 5G networks
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Group 1 Presentation -Planning and Decision Making .pptx
Machine Learning_overview_presentation.pptx
Heart disease approach using modified random forest and particle swarm optimi...
Machine learning based COVID-19 study performance prediction
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Reach Out and Touch Someone: Haptics and Empathic Computing
MIND Revenue Release Quarter 2 2025 Press Release
Mobile App Security Testing_ A Comprehensive Guide.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Network Security Unit 5.pdf for BCA BBA.
Tartificialntelligence_presentation.pptx
Empathic Computing: Creating Shared Understanding
OMC Textile Division Presentation 2021.pptx
Encapsulation_ Review paper, used for researhc scholars
Per capita expenditure prediction using model stacking based on satellite ima...
1. Introduction to Computer Programming.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm

Best Practices for Building a Warehouse Quickly

  • 1. Best Practices Building a Data Warehouse Quickly October 16, 2009 | Florida Chapter Presented by Raphael Klebanov, WhereScape USA Copyright © 2009 by WhereScape Software | Slide # Copyright © 2009 by WhereScape Software
  • 2. Key factors that influence a successful data warehouse task Implementing the True Development Approach Choosing a Rapid Development Product Ensuring Data Availability Involving Key Users throughout the whole project Relying on a Pragmatic Governance Framework Utilizing experienced Team Members Selecting the right Hardware , Infrastructure Technology Abstract Copyright © 2009 by WhereScape Software | Slide #
  • 3. Copyright © 2009 by WhereScape Software | Slide # Basic Architecture of a Data Warehouse
  • 4. … for a intelligent decision-making process? … for data warehouse? Are you ready … Copyright © 2009 by WhereScape Software | Slide #
  • 5. Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 6. Unreliable or unattainable user requirements Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 7. Unreliable or unattainable user requirements Quality of the data that feeds the source system Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 8. Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 9. Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 10. Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity High TCO (Total Cost of Ownership Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 11. Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity High TCO (Total Cost of Ownership) Poor documentation Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 12. Unreliable or unattainable user requirements Quality of the data that feeds the source system Changing source or target requirements Poor development productivity High TCO (Total Cost of Ownership) Poor documentation “… over 50% of data warehouse projects fail or go wildly over budget – they blame data quality…” The real problem is project approach. Source: Gartner. Magic Quadrant for Data Integration Tools, 2007 Why do Data Warehouse projects fail? Copyright © 2009 by WhereScape Software | Slide #
  • 13. DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 14. Strong sponsorship of the DW from the business DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 15. Strong sponsorship of the DW from the business Divide and Conquer approach DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 16. Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 17. Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 18. Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 19. Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype Access to SME during development DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 20. Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype Access to SME during development Compact teams DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 21. Strong sponsorship of the DW from the business Divide and Conquer approach Iterative Development approach Productive development tools Real data to populate the prototype Access to SME during development Compact teams Sturdy development hardware DW Project Components Copyright © 2009 by WhereScape Software | Slide #
  • 22. Business Ownership Copyright © 2009 by WhereScape Software | Slide #
  • 23. The data warehouse should be owned by the business – not IT Business Ownership Copyright © 2009 by WhereScape Software | Slide #
  • 24. The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business Business Ownership Copyright © 2009 by WhereScape Software | Slide #
  • 25. The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business Prioritization of project phases or agreement on a data dictionary should be agreed by the business Business Ownership Copyright © 2009 by WhereScape Software | Slide #
  • 26. The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business Prioritization of project phases or agreement on a data dictionary should be agreed by the business Without a strong, high level business sponsor(s) the project is likely to hit problems Business Ownership Copyright © 2009 by WhereScape Software | Slide #
  • 27. The data warehouse should be owned by the business – not IT A successful project depends upon creating a partnership with the business prioritization of project phases or agreement on a data dictionary to should be agreed by the business Without a strong, high level business sponsor(s) the project is likely to hit problems If sponsorship is present then the data warehouse project can be broken down into a set of smaller projects Business Ownership Copyright © 2009 by WhereScape Software | Slide #
  • 28. The Data Warehouse lifecycle …as we know it
  • 29. Divide and Conquer Copyright © 2009 by WhereScape Software | Slide #
  • 30. A ‘ big bang ’ approach to data warehousing has almost always ended in disaster Divide and Conquer Copyright © 2009 by WhereScape Software | Slide #
  • 31. A ‘ big bang ’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Divide and Conquer Copyright © 2009 by WhereScape Software | Slide #
  • 32. A ‘ big bang ’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Divide and Conquer Copyright © 2009 by WhereScape Software | Slide #
  • 33. A ‘ big bang ’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Rapid prototyping and tight development cycles are vital for successful warehouse Divide and Conquer Copyright © 2009 by WhereScape Software | Slide #
  • 34. A ‘ big bang ’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Rapid prototyping and tight development cycles are vital for successful warehouse Keep in view the bigger picture Divide and Conquer Copyright © 2009 by WhereScape Software | Slide #
  • 35. A ‘ big bang ’ approach to data warehousing has almost always ended in disaster The project phases and the order in which they are developed should be decided by the data warehouse sponsors Momentum is paramount for keeping the required focus Rapid prototyping and tight development cycles are vital for successful warehouse Keep in view the bigger picture Use smaller phases to fund the project adequately Divide and Conquer Copyright © 2009 by WhereScape Software | Slide #
  • 36. The True Project Approach Copyright © 2009 by WhereScape Software | Slide #
  • 37. Getting the business reps to use working prototypes to share an understanding of the scope The True Project Approach Copyright © 2009 by WhereScape Software | Slide #
  • 38. Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project The True Project Approach Copyright © 2009 by WhereScape Software | Slide #
  • 39. Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype The True Project Approach Copyright © 2009 by WhereScape Software | Slide #
  • 40. Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype A better place to start is to collect KPIs and source system technical documentation The True Project Approach Copyright © 2009 by WhereScape Software | Slide #
  • 41. Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype A better place to start is to collect KPIs and source system technical documentation OLAP technology and user workshops are key tools in allowing the business to get their hands on the data The True Project Approach Copyright © 2009 by WhereScape Software | Slide #
  • 42. Getting the business reps to use working prototypes to share an understanding of the scope Collect detailed user requirements is exactly the wrong start to a data warehouse project Showing business users the data and relationships that are available to them in a working, populated prototype A better place to start is to collect KPIs and source system technical documentation OLAP technology and user workshops are key tools in allowing the business to get their hands on the data Data quality should not be addressed in the DW; problem should be fixed on the source system The True Project Approach Copyright © 2009 by WhereScape Software | Slide #
  • 43. Rapid Development Product Copyright © 2009 by WhereScape Software | Slide #
  • 44. Scrutinize Extract/Transform and Load (ETL) tools when considering building a DW. ETL tools do not provide the ability to build a working prototype and work in short development cycles Rapid Development Product and ETL Copyright © 2009 by WhereScape Software | Slide #
  • 45. Combining processing and design Ability to enable, manage fast iterations of the prototype Environment migration Version control Automatic documentation Rapid Development Product Enables: Copyright © 2009 by WhereScape Software | Slide #
  • 46. The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 47. Single Development Interface The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 48. Single Development Interface Documentation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 49. Single Development Interface Documentation Automated Table Generation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 50. Single Development Interface Documentation Automated Table Generation Automated Code Generation The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 51. Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 52. Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 53. Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 54. Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout Leverage Existing Core Skills. The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 55. Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout Leverage Existing Core Skills Consistent Framework The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 56. Single Development Interface Documentation Automated Table Generation Automated Code Generation Metadata Migration Version Control Object Checkout Leverage Existing Core Skills Consistent Framework Extensibility The Features of a DWLC Tool Copyright © 2009 by WhereScape Software | Slide #
  • 57. Copyright © 2009 by WhereScape Software | Slide # Try to get it right first time: The SDLC approach But: Tools and operators in silos – inflexible Hard to engage business users, no shared understanding Locked in requirements that can’t be met Redevelopment 120 day cycle Risky, expensive, never OTTB & never finished No documentation, so hard to support The Traditional Approach
  • 58. Copyright © 2009 by WhereScape Software | Slide # Prototype and iterate to prove a design with users Supported by: Integrated toolset and metadata repository – maximum flexibility Continuous business engagement, shared understanding No ambiguity or disagreement about scope Successful phase completion Complete, OTTB, user expectations exceeded Documented solution that is easy to support The Rapid Development Approach 5 day cycle
  • 59. A DWLC Tool would save a huge amount of development time and effort , and would enable the approach required to deliver a successful outcome The DWLC methodology is a child concept for Agile Development Methodology also known as Systems Development Life Cycle (SDLC) The Features of a DWLC Tool Slide #
  • 60. Ensuring Data Availability Copyright © 2009 by WhereScape Software | Slide #
  • 61. The lack of good quality live data will have a major impact on the success of Iterative project approach The DW’s capacity to answer BI requirements is unworkable, without sufficient data to populate the DW If a new source system is integrated into the data warehouse, the “real” data is quite essential If no “real” data for new source is available, then the significant rework will be required once the source is up and running Ensuring Data Availability Copyright © 2009 by WhereScape Software | Slide #
  • 62. Involving the Business Copyright © 2009 by WhereScape Software | Slide #
  • 63. Representatives from the Business provide the partnership with the DW development team These reps need to be able to articulate the needs of the business to the dev. team These reps have to trust the business department behind them when it comes to making any decisions The partnership during the iterative project approach provides a reliable, successful outcome The main forum for developers to show a working prototype and get user feedback is user workshop The business Involvement for the duration of the DW development will reduce the QA overheads Involving the Business Copyright © 2009 by WhereScape Software | Slide #
  • 64. Copyright © 2009 by WhereScape Software | Slide # Project governance
  • 65. Governance of the data warehouse project should operate at two levels : an enterprise level and a project level Pragmatic Governance Framework Copyright © 2009 by WhereScape Software | Slide #
  • 66. Governance of the data warehouse project should operate at two levels : an enterprise level and a project level Pragmatic Governance Framework Copyright © 2009 by WhereScape Software | Slide # Business Requirements Technical Constraints
  • 67. Governance of the data warehouse project should operate at two levels : an enterprise level and a project level Pragmatic Governance Framework Copyright © 2009 by WhereScape Software | Slide # Business Requirements Technical Constraints Shared understanding, Prototype and Iterate, Best possible outcome
  • 68. Sponsorship is sourced from a highly-placed executive The steering committee provides: + Vision + Visibility + Priorities + Scope + Focus + Terminology Copyright © 2009 by WhereScape Software | Slide # The DW needs to be owned by the business
  • 69. At a minimum project governance should include: A project plan , detailing (high level) scope and timelines Regular status meetings to share information Change request process documentation Standards and procedures for building a consistent DW Version control and backup procedures Ownership of specific environments and project roles Copyright © 2009 by WhereScape Software | Slide # Project governance
  • 70. Utilizing Experienced Team Members Copyright © 2009 by WhereScape Software | Slide #
  • 71. Productivity within a data warehouse implementation is dependent on having experienced team members – both on business side and also on the technical side Experienced Subject Matter Experts (SME) provide a thorough understanding of the business and its needs Experienced data warehouse developers can take those requirements and turn them into a functioning data warehouse in a rapid timeframe Utilizing Experienced Team Members Copyright © 2009 by WhereScape Software | Slide #
  • 72. Selecting the Right Infrastructure Copyright © 2009 by WhereScape Software | Slide #
  • 73. Sufficient hardware and technology infrastructure during development Lower productivity can translate into slower development cycles and iterations, which stands the risk of losing project momentum Trade-off between having adequately sized hardware and the cost associated with purchasing that hardware One way to mitigate undersized hardware is to use smaller subsets of data during the prototyping phase Selecting the Right Infrastructure Copyright © 2009 by WhereScape Software | Slide #
  • 74. Treat the Warehousing as a process, not a project Conclusion Copyright © 2009 by WhereScape Software | Slide #
  • 75. This means: focusing on iterative releases and rollouts that follow in quick succession keeping the warehouse in line with the ever changing needs of the business , instead of treating it as a one-time project In order to achieve this, a change in the development approach and tools utilized for building the data warehouse must be adopted Conclusion Copyright © 2009 by WhereScape Software | Slide #
  • 76. The key factors to creating a successful data warehouse are: -------------------------------------------------------------------------------- Implementing the True Development Approach Choosing a Rapid Development Product Ensuring Data Availability Involving Key Users throughout the whole project Relying on a Pragmatic Governance Framework Utilizing experienced Team Members Selecting the right hardware and other related Infrastructure Technology Conclusion Copyright © 2009 by WhereScape Software | Slide #
  • 77. Raphael Klebanov, Analyst at WhereScape Office Phone: 303.968.0703 Email address: rklebanov@wherescape.com Public Profile: http://www.linkedin.com/in/raphaelklebanov My personal information: Copyright © 2009 by WhereScape Software | Slide #