SlideShare a Scribd company logo
Enterprise Data Warehouse Fundamentals 101 KIDS Phase II Project   Mojo Nwokoma Director ,  Enterprise Data Systems Architecture Office of Assessment & Information Services Oregon Department of Education 503-378-3600 x2242 [email_address]
What is Enterprise DW/BI Solutions Data Warehouse (DW) is a collection of integrated, subject oriented, time-variant, and non-volatile data from various sources into a single and consistent warehouse that supports reporting, analysis, and decision making within the enterprise.  Integrates operational data through consistent  naming conventions, measurements, physical attributes, and semantics. Business Intelligence (BI) solutions use a blending of technologies, including relational and multi-dimansional databases, client/server architecture, and graphical user interface, to integrate disparate data sources into a single coherent framework for real-time reporting, drill-through analysis and decision support.
KIDS Phase I Project Report: The Business Case for Change NCLB & Federal Accountability: The reporting and performance requirements of NCLB demands a fundamental change in the statewide data collection and reporting. The need to report individual student achievement and aggregation of data by subgroups. Statewide Accountability & Efficiency: Major gaps and inefficiencies exist in information collection, reporting and analysis at both district and state levels that need to be addressed. Funding formula compliance and equity, and ability to evaluate relative effectiveness/ineffectiveness of education programs. Student & Community Service: Growing stakeholder demand for a significant improvement in student records availability, accessibility, portability, and accuracy. Cost Efficiency Gains: Evidence of economies of scale resulting in cost reductions in the management of administrative systems at the larger school districts and ESD structure. Source: KIDS Phase 1 Final report: ODE & IBM Confidential  (10/20/05)
KIDS Phase II Project  Planning: 5 key questions for a successful project planning & implementation   What? When? Who? Why? How? SUCCESS Deliverables/Scope Stakeholders Resources -  Business Case   Time Frame
The Essential Building Blocks for a Successful Enterprise Information Management Project
Enterprise Data Warehouse Architecture District Data Warehouse STAGING - Integration - Metadata - Cleansed  Profiled Biz Rules Data Mart SIS Data Mart FINANCE Data Mart Transportation Data Mart Instruction HR SIS Curriculum  &  Instruction Finance Nutrition Extraction Phase Transformation Phase Load Phase Data Management H/W Server Platform Applications
KIDS Phase II Project District/ESD Server Deployment & Data Warehouse Integration Architecture Transaction System Transaction System Transaction System Transaction System Transaction System ODE (State) DW Physical  & Virtual ODS = Operational Data Store DW = Data Warehouse ODS ODS ODS ODS ODS Hillsboro District DW Beaverton District DW Portland District DW Eugene District DW ESDs DW LEGENDS:   KIDS Work = ODE Districts Record Exchange
Project Planning Methodology – “The How?” For the Project Team Step 1: Define the Work Breakdown Structure    The first  is to create a comprehensive Work Breakdown Structure (WBS). The WBS lists all the phases, activities and tasks required to undertake the project. Identify and describe each phase, activity and task required to complete the project successfully. Depict the order in which the tasks must be undertaken and identify any key internal and external project dependencies. Also list the critical project milestones, such as the completion of key project deliverables   Step 2: Identify the Required Resources     Having listed all of the tasks required to undertake the project, you now need to identify the generic resources required to complete each task. Examples of types of resource include: full-time and part-time staff, contractors, equipment and materials. For each resource type, identify the quantity required, the delivery dates and the project tasks in the WBS that the resource will be used to help complete.   Step 3: Construct a Project Schedule   To construct your schedule, you need to:  List the phases, activities and tasks Sequence the phases, activities and tasks Add key internal and external  dependencies  Allocate relevant completion  timeframes  Add additional  contingency  to mitigate risk Assign  resources  required to complete tasks List critical delivery  milestones  Specify any  assumptions  and  constraints
Current  Data Environment Lack of granular, integrated, accurate, standardized, and timely data regarding student performance and achievement both for individual students and specific student subgroups. Lack of data and information integration between instructional management sources and assessment sources aligned to the state’s specific educational standards. Other factors affected include attendance record, discipline, teacher qualification, classroom size, and instructional hours. Lack of close collaboration between the districts and ODE in tracking students as they move through the educational system, both vertically and horizontally, in order to improve performance by identifying actionable indicators.  Lack of specific multi-year student performance information to support longitudinal analysis, accessible at the State, District, and school levels with appropriate controls  to assure confidentiality. Need for data and tool standardization between all reporting districts to ensure accurate, consistent, and useful analytical input for decision making purposes. No single version of the truth exists for business rules and data definitions among various data sources. Lack of easily validated financial information that accurately reports budgeted vs actual expenditures by program that allows correlation of these expenditures to student performances. Lack of appropriately controlled online access to information for all stakeholders regarding student progress and school quality.
Problems with Current Decision Support Data redundancy and process redundancy Data is not integrated and cannot be shared Data is not understood or misunderstood Inconsistent data definitions & business rules Data retrieval is difficult and time consuming Operational files may not contain history Reports are inconsistent in content & format Data is too dirty for business analysis Multiple versions of the truth Reports and associated BI tools are not standardized.
Recommended Model for Enterprise Data Warehouse System KIDS Phase 11 Project E-Portal Data Warehouse Operational Data Store ODS Transactional Database Educational Stakeholder Communication Benchmarking/Decision Support District & State Reporting Day to Day Operations PK-12 Data Model for Information Management (ODE & IBM Confidential)  10/20/05
Standards  Governance (setting priorities) Data naming, aliases, abbreviations list Meta data capture and maintenance Data quality and data management Testing standards Security standards Measuring results (benefits, costs, usage) Service level agreements Rules and protocols to be followed by all  users and developers for all applications
DW roles & responsibilities Business User (Client) Business User Support Data Administrator Data Analysts Meta Data Administrator Database Administrator Developers ETL BI – Reports, Queries
DW roles & responsibilities (continued) Security Officer Auditor  Data Warehouse Project Manager Technical Services DW Architect Technical Advisory Board Steering Committee
Information quality Data is accurate Data is consistent Data is timely Data is integrated Data is complete Data values follow the business rules Data corresponds to valid values Data is well understood
Data Warehouse/ODS & BI Layers Data Integration Layer & Operational Data Store (ODS):  ODS  is a non queryable centralized staging areas for storing extracted, cleansing,   and transformed data,  and for gathering centralized metadata for implementing an Enterprise Data Mart Architecture (EDMA), eliminating the need for another non queryable staging area called data warehouse. Needed is a dimensionally modeled  Data Warehouse  for enterprise DSS, prepared to provide the best in  query response performance and to support the most advanced OLAP functionalities.
Meta data components Data name (entity, attribute, table, column, field, etc.) Business description of data (Project Start Date) Source of data (file, field) Business Owner of data Business rules  Transformation rules Domains (allowable values) Data relationships
Meta data components (continued) Data quality (measure of reliability) Timeliness (ex: current as of certain date)  Historical information Aggregation rules Security (who has access)
Meta data management Meta data administration Business meta data  Technical meta data  ETL reconciliation Data quality metrics Standardization Data ownership Enterprise integration Meta Data =  [Descriptive]   Data About Data   [of the Business] Information = Data within context Context = Meta data Information = Data + Meta data
KIDS Phase II Project   05-07 Workflow Review and validate business case for Phase II project Identify key stakeholders. Validate requirements, Deliverables, and Expectations Identify two or three key districts with viable data warehouse infrastructure as test sites. Review enterprise architecture, and identify Infrastructure components Transaction Level Applications Operating & Database systems Hardware Platforms 3.  Design Data Warehouse & Operational Data Store (ODS) Data Model Design Dimensional Modeling Schemas Configure Extraction, Transformation, Load (ETL) Metadata Capture (Repository or Data labels) Data Quality Profiling Vendor selection will be based on a competitive “Bake-off” results from three top vendors. 4.  Data Reporting & Information Delivery Mechanism Leverage existing reporting On-line Analytical Processing (OLAP) infrastructure Design and implement subject-area data marts for effective horizontal reporting integration Develop OLAP cubes for report aggregation and slice/dice querying investigations Deploy enterprise portal with built-in security and user access authentication. 5.  End-user Training Design and schedule end-user training at all levels of data and reporting needs. Identify “Train-the-Trainer” candidates from each school district for more detailed training.
KIDS Phase II Project Key System Deliverables   Transactional Systems/District Data Warehouse Integrating systems that support instructional management that empowers teachers to combine student performance and instructional data to make informed classroom decisions. Districts will be better able to meet NCLB and state standards for having “highly qualified” teachers in the classroom. 2.  Integrated & Interoperable Operational Data Store (ODS)  Provide ability to evaluate student performance within selected programs across various schools and districts in the State, and highlight ways to achieve AYP consistently over time. Allows for quick turnaround in transferring students records when they move between districts. Ease reporting burden on districts, and eliminate redundant and possibly inaccurate reporting of data, and a better foundation for integrated data analysis. 3.  Data Warehouse & Decision Support System /Tools A repository for State and district reporting and analysis even at student-level data. Allows for more meaningful system-level questions and answers by legislators or policy makers. Greater system accessibility to all users with relevant security access privileges will mean greater acceptance and use, and ultimately better decision. 4.  System Wide Communication Portal Provide a focused location for access to information, analytical/reporting tools, and the necessary training and support. Also, maximization of effective system-wide use of state’s data warehouse. Rapid and wide dissemination of integrated and proven instructional and administrative practices.
KIDS Phase II Project   High-level Project Work Plan, Time-line, & Resource requirements  Project Phase:   Time-line Resource Requirements Requirements Validation   January 10, 2006   Mojo & Gary Scheduled trips to all   Districts & ESDs 3.  “ Test Site ” DW/ODS Modeling,  June 30, 2006  Database Administrator integration, ETL, Data Quality,  Data Modeler/Analysts Meta Data Repository, and  Data Quality Analysts Vendor “Bake-off” contracting  Business User (Client) Meta Data Administrator ETL & BI developers Data Warehouse Project Manager Inauguration of Governance &  January 30, 2006  Doug Kosty & Mojo Nwokoma Project Team committee members 4.  OLAP & Portal Development,  October 30, 2006  End-user Business Analyst including Training, and  Web Developer vendor “Bake-off” Contracting  Portal Dashboard developer Business User (Client) BI OLAP Report Developer “ Train-the-trainer” Data Security Officer
BI - OLAP Data Warehouse Architecture
User expectations Expectations must be managed in terms of: Schedule Budget Scope Performance Availability Simplicity (ease of use) Tool functionality Data cleanliness Users’ roles and responsibilities
User responsibilities Be a full-time member of the Core Team  Participate in all data modeling sessions Co-manage the DW project Make decisions and escalate disputes to the Steering Committee  Provide meta data for business objects Identify data security requirements Participate in BI tool selection Participate in all review sessions Participate in all testing activities
IT Staffing DW roles & responsibilities Dedicated IT team New skill set – beyond tools, discipline Contractors & consultants Knowledge transfer Training Just in time Just enough Knowledge transfer through collaboration
Risks to be mitigated  Low management commitment  Low user commitment  Unrealistic schedule  Unrealistic user expectations  Budget too small  Untrained or unavailable staff
Risks to be mitigated  Unclear or changing requirements  Poor project management Creeping scope Initial project too large Wrong project  Changing priorities  Data cleansing not addressed early  Vendors out of control
Risks to be mitigated  Not architected properly (wrong design)  Inappropriate organization structure  Lost or changed sponsor  New technology not understood  No procedure to resolve disputes  Exceeding platform capabilities

More Related Content

PPT
Data Warehouse Methodology
PPT
Bi presentation to bkk
PPTX
Data warehouse design
PDF
Data Warehouse Design and Best Practices
PPT
Dw & etl concepts
PPTX
Data warehouse
PPT
Warehouse components
PDF
Rev_3 Components of a Data Warehouse
Data Warehouse Methodology
Bi presentation to bkk
Data warehouse design
Data Warehouse Design and Best Practices
Dw & etl concepts
Data warehouse
Warehouse components
Rev_3 Components of a Data Warehouse

What's hot (20)

PDF
Introduction to Data Warehousing
PDF
Data warehouse architecture
PPTX
Traditional Data-warehousing / BI overview
ODP
Introduction To Data Warehousing
PPTX
DATA WAREHOUSING
PPSX
Data warehouse
PPTX
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
PDF
The technology of the business data lake
PPTX
Data Warehousing - in the real world
PDF
Agile Data Warehouse Design for Big Data Presentation
PDF
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
PDF
How Real TIme Data Changes the Data Warehouse
PPT
Datawarehouse & bi introduction
PPTX
From Traditional Data Warehouse To Real Time Data Warehouse
PPTX
Building an Effective Data Warehouse Architecture
PDF
Introduction to Business Intelligence
PPTX
DATA WAREHOUSING
PPT
Become BI Architect with 1KEY Agile BI Suite - OLAP
PPT
Datawarehousing and Business Intelligence
PPT
Introduction to data warehousing
Introduction to Data Warehousing
Data warehouse architecture
Traditional Data-warehousing / BI overview
Introduction To Data Warehousing
DATA WAREHOUSING
Data warehouse
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
The technology of the business data lake
Data Warehousing - in the real world
Agile Data Warehouse Design for Big Data Presentation
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
How Real TIme Data Changes the Data Warehouse
Datawarehouse & bi introduction
From Traditional Data Warehouse To Real Time Data Warehouse
Building an Effective Data Warehouse Architecture
Introduction to Business Intelligence
DATA WAREHOUSING
Become BI Architect with 1KEY Agile BI Suite - OLAP
Datawarehousing and Business Intelligence
Introduction to data warehousing
Ad

Viewers also liked (20)

DOCX
Types of testing done in a Data Warehouse project
PDF
Data Ware House Testing
PDF
Tivoli data warehouse version 1.3 planning and implementation sg246343
PDF
Data warehousing testing strategies cognos
PPT
Dw Kickoff Meeting V4
DOC
Testing data warehouse applications by Kirti Bhushan
PPTX
ETL Validator: Creating Data Model
PPT
Planning Data Warehouse
PDF
2013 OHSUG - Clinical Data Warehouse Implementation
ODP
Data warehouse inmon versus kimball 2
PPTX
The Data Warehouse Lifecycle
PDF
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
DOC
Data warehouse concepts
PPTX
Oracle: Fundamental Of DW
PPT
DATA WAREHOUSING AND DATA MINING
PPS
Introduction to Data Warehousing
PDF
White Paper - Data Warehouse Project Management
PPT
Data Warehousing and Data Mining
PPT
Data Warehouse Modeling
PPTX
What is a Data Warehouse and How Do I Test It?
Types of testing done in a Data Warehouse project
Data Ware House Testing
Tivoli data warehouse version 1.3 planning and implementation sg246343
Data warehousing testing strategies cognos
Dw Kickoff Meeting V4
Testing data warehouse applications by Kirti Bhushan
ETL Validator: Creating Data Model
Planning Data Warehouse
2013 OHSUG - Clinical Data Warehouse Implementation
Data warehouse inmon versus kimball 2
The Data Warehouse Lifecycle
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Data warehouse concepts
Oracle: Fundamental Of DW
DATA WAREHOUSING AND DATA MINING
Introduction to Data Warehousing
White Paper - Data Warehouse Project Management
Data Warehousing and Data Mining
Data Warehouse Modeling
What is a Data Warehouse and How Do I Test It?
Ad

Similar to Data warehouse 101-fundamentals- (20)

DOCX
Business Intelligence
PPT
sum10_T2.ppt
PPS
Data Warehouse 102
PDF
Implementing BI & DW Governance
PDF
Summit 2011 ods edw technical
PDF
Data Quality in Data Warehouse and Business Intelligence Environments - Disc...
PPTX
Education Data Warehouse System
PPTX
Business Intelligence Overview
PDF
The final frontier
PPTX
Data Warehouse for data analytics presentation
DOCX
2DATA WAREHOUSING SYSTEMAchyut Sai Chillara50215.docx
PPT
Chapter 2-data-warehousingppt2517 vero
PDF
Data warehousing change in a challenging environment
PPTX
Re tooling for data management-support
PDF
PDF
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
DOCX
sharon - cv
PDF
IOUG93 - Technical Architecture for the Data Warehouse - Paper
PDF
TOP_PASSOS GOV DADOS.pdf
PPTX
Lesson 3 - The Kimbal Lifecycle.pptx
Business Intelligence
sum10_T2.ppt
Data Warehouse 102
Implementing BI & DW Governance
Summit 2011 ods edw technical
Data Quality in Data Warehouse and Business Intelligence Environments - Disc...
Education Data Warehouse System
Business Intelligence Overview
The final frontier
Data Warehouse for data analytics presentation
2DATA WAREHOUSING SYSTEMAchyut Sai Chillara50215.docx
Chapter 2-data-warehousingppt2517 vero
Data warehousing change in a challenging environment
Re tooling for data management-support
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
sharon - cv
IOUG93 - Technical Architecture for the Data Warehouse - Paper
TOP_PASSOS GOV DADOS.pdf
Lesson 3 - The Kimbal Lifecycle.pptx

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Tartificialntelligence_presentation.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Mushroom cultivation and it's methods.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
1. Introduction to Computer Programming.pptx
PPTX
TLE Review Electricity (Electricity).pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Heart disease approach using modified random forest and particle swarm optimi...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Univ-Connecticut-ChatGPT-Presentaion.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
cloud_computing_Infrastucture_as_cloud_p
Unlocking AI with Model Context Protocol (MCP)
Spectral efficient network and resource selection model in 5G networks
Tartificialntelligence_presentation.pptx
A comparative analysis of optical character recognition models for extracting...
Mushroom cultivation and it's methods.pdf
Spectroscopy.pptx food analysis technology
Building Integrated photovoltaic BIPV_UPV.pdf
Approach and Philosophy of On baking technology
Encapsulation_ Review paper, used for researhc scholars
gpt5_lecture_notes_comprehensive_20250812015547.pdf
A comparative study of natural language inference in Swahili using monolingua...
1. Introduction to Computer Programming.pptx
TLE Review Electricity (Electricity).pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Heart disease approach using modified random forest and particle swarm optimi...

Data warehouse 101-fundamentals-

  • 1. Enterprise Data Warehouse Fundamentals 101 KIDS Phase II Project Mojo Nwokoma Director , Enterprise Data Systems Architecture Office of Assessment & Information Services Oregon Department of Education 503-378-3600 x2242 [email_address]
  • 2. What is Enterprise DW/BI Solutions Data Warehouse (DW) is a collection of integrated, subject oriented, time-variant, and non-volatile data from various sources into a single and consistent warehouse that supports reporting, analysis, and decision making within the enterprise. Integrates operational data through consistent naming conventions, measurements, physical attributes, and semantics. Business Intelligence (BI) solutions use a blending of technologies, including relational and multi-dimansional databases, client/server architecture, and graphical user interface, to integrate disparate data sources into a single coherent framework for real-time reporting, drill-through analysis and decision support.
  • 3. KIDS Phase I Project Report: The Business Case for Change NCLB & Federal Accountability: The reporting and performance requirements of NCLB demands a fundamental change in the statewide data collection and reporting. The need to report individual student achievement and aggregation of data by subgroups. Statewide Accountability & Efficiency: Major gaps and inefficiencies exist in information collection, reporting and analysis at both district and state levels that need to be addressed. Funding formula compliance and equity, and ability to evaluate relative effectiveness/ineffectiveness of education programs. Student & Community Service: Growing stakeholder demand for a significant improvement in student records availability, accessibility, portability, and accuracy. Cost Efficiency Gains: Evidence of economies of scale resulting in cost reductions in the management of administrative systems at the larger school districts and ESD structure. Source: KIDS Phase 1 Final report: ODE & IBM Confidential (10/20/05)
  • 4. KIDS Phase II Project Planning: 5 key questions for a successful project planning & implementation What? When? Who? Why? How? SUCCESS Deliverables/Scope Stakeholders Resources - Business Case Time Frame
  • 5. The Essential Building Blocks for a Successful Enterprise Information Management Project
  • 6. Enterprise Data Warehouse Architecture District Data Warehouse STAGING - Integration - Metadata - Cleansed Profiled Biz Rules Data Mart SIS Data Mart FINANCE Data Mart Transportation Data Mart Instruction HR SIS Curriculum & Instruction Finance Nutrition Extraction Phase Transformation Phase Load Phase Data Management H/W Server Platform Applications
  • 7. KIDS Phase II Project District/ESD Server Deployment & Data Warehouse Integration Architecture Transaction System Transaction System Transaction System Transaction System Transaction System ODE (State) DW Physical & Virtual ODS = Operational Data Store DW = Data Warehouse ODS ODS ODS ODS ODS Hillsboro District DW Beaverton District DW Portland District DW Eugene District DW ESDs DW LEGENDS: KIDS Work = ODE Districts Record Exchange
  • 8. Project Planning Methodology – “The How?” For the Project Team Step 1: Define the Work Breakdown Structure  The first is to create a comprehensive Work Breakdown Structure (WBS). The WBS lists all the phases, activities and tasks required to undertake the project. Identify and describe each phase, activity and task required to complete the project successfully. Depict the order in which the tasks must be undertaken and identify any key internal and external project dependencies. Also list the critical project milestones, such as the completion of key project deliverables Step 2: Identify the Required Resources   Having listed all of the tasks required to undertake the project, you now need to identify the generic resources required to complete each task. Examples of types of resource include: full-time and part-time staff, contractors, equipment and materials. For each resource type, identify the quantity required, the delivery dates and the project tasks in the WBS that the resource will be used to help complete. Step 3: Construct a Project Schedule To construct your schedule, you need to: List the phases, activities and tasks Sequence the phases, activities and tasks Add key internal and external dependencies Allocate relevant completion timeframes Add additional contingency to mitigate risk Assign resources required to complete tasks List critical delivery milestones Specify any assumptions and constraints
  • 9. Current Data Environment Lack of granular, integrated, accurate, standardized, and timely data regarding student performance and achievement both for individual students and specific student subgroups. Lack of data and information integration between instructional management sources and assessment sources aligned to the state’s specific educational standards. Other factors affected include attendance record, discipline, teacher qualification, classroom size, and instructional hours. Lack of close collaboration between the districts and ODE in tracking students as they move through the educational system, both vertically and horizontally, in order to improve performance by identifying actionable indicators. Lack of specific multi-year student performance information to support longitudinal analysis, accessible at the State, District, and school levels with appropriate controls to assure confidentiality. Need for data and tool standardization between all reporting districts to ensure accurate, consistent, and useful analytical input for decision making purposes. No single version of the truth exists for business rules and data definitions among various data sources. Lack of easily validated financial information that accurately reports budgeted vs actual expenditures by program that allows correlation of these expenditures to student performances. Lack of appropriately controlled online access to information for all stakeholders regarding student progress and school quality.
  • 10. Problems with Current Decision Support Data redundancy and process redundancy Data is not integrated and cannot be shared Data is not understood or misunderstood Inconsistent data definitions & business rules Data retrieval is difficult and time consuming Operational files may not contain history Reports are inconsistent in content & format Data is too dirty for business analysis Multiple versions of the truth Reports and associated BI tools are not standardized.
  • 11. Recommended Model for Enterprise Data Warehouse System KIDS Phase 11 Project E-Portal Data Warehouse Operational Data Store ODS Transactional Database Educational Stakeholder Communication Benchmarking/Decision Support District & State Reporting Day to Day Operations PK-12 Data Model for Information Management (ODE & IBM Confidential) 10/20/05
  • 12. Standards Governance (setting priorities) Data naming, aliases, abbreviations list Meta data capture and maintenance Data quality and data management Testing standards Security standards Measuring results (benefits, costs, usage) Service level agreements Rules and protocols to be followed by all users and developers for all applications
  • 13. DW roles & responsibilities Business User (Client) Business User Support Data Administrator Data Analysts Meta Data Administrator Database Administrator Developers ETL BI – Reports, Queries
  • 14. DW roles & responsibilities (continued) Security Officer Auditor Data Warehouse Project Manager Technical Services DW Architect Technical Advisory Board Steering Committee
  • 15. Information quality Data is accurate Data is consistent Data is timely Data is integrated Data is complete Data values follow the business rules Data corresponds to valid values Data is well understood
  • 16. Data Warehouse/ODS & BI Layers Data Integration Layer & Operational Data Store (ODS): ODS is a non queryable centralized staging areas for storing extracted, cleansing, and transformed data, and for gathering centralized metadata for implementing an Enterprise Data Mart Architecture (EDMA), eliminating the need for another non queryable staging area called data warehouse. Needed is a dimensionally modeled Data Warehouse for enterprise DSS, prepared to provide the best in query response performance and to support the most advanced OLAP functionalities.
  • 17. Meta data components Data name (entity, attribute, table, column, field, etc.) Business description of data (Project Start Date) Source of data (file, field) Business Owner of data Business rules Transformation rules Domains (allowable values) Data relationships
  • 18. Meta data components (continued) Data quality (measure of reliability) Timeliness (ex: current as of certain date) Historical information Aggregation rules Security (who has access)
  • 19. Meta data management Meta data administration Business meta data Technical meta data ETL reconciliation Data quality metrics Standardization Data ownership Enterprise integration Meta Data = [Descriptive] Data About Data [of the Business] Information = Data within context Context = Meta data Information = Data + Meta data
  • 20. KIDS Phase II Project 05-07 Workflow Review and validate business case for Phase II project Identify key stakeholders. Validate requirements, Deliverables, and Expectations Identify two or three key districts with viable data warehouse infrastructure as test sites. Review enterprise architecture, and identify Infrastructure components Transaction Level Applications Operating & Database systems Hardware Platforms 3. Design Data Warehouse & Operational Data Store (ODS) Data Model Design Dimensional Modeling Schemas Configure Extraction, Transformation, Load (ETL) Metadata Capture (Repository or Data labels) Data Quality Profiling Vendor selection will be based on a competitive “Bake-off” results from three top vendors. 4. Data Reporting & Information Delivery Mechanism Leverage existing reporting On-line Analytical Processing (OLAP) infrastructure Design and implement subject-area data marts for effective horizontal reporting integration Develop OLAP cubes for report aggregation and slice/dice querying investigations Deploy enterprise portal with built-in security and user access authentication. 5. End-user Training Design and schedule end-user training at all levels of data and reporting needs. Identify “Train-the-Trainer” candidates from each school district for more detailed training.
  • 21. KIDS Phase II Project Key System Deliverables Transactional Systems/District Data Warehouse Integrating systems that support instructional management that empowers teachers to combine student performance and instructional data to make informed classroom decisions. Districts will be better able to meet NCLB and state standards for having “highly qualified” teachers in the classroom. 2. Integrated & Interoperable Operational Data Store (ODS) Provide ability to evaluate student performance within selected programs across various schools and districts in the State, and highlight ways to achieve AYP consistently over time. Allows for quick turnaround in transferring students records when they move between districts. Ease reporting burden on districts, and eliminate redundant and possibly inaccurate reporting of data, and a better foundation for integrated data analysis. 3. Data Warehouse & Decision Support System /Tools A repository for State and district reporting and analysis even at student-level data. Allows for more meaningful system-level questions and answers by legislators or policy makers. Greater system accessibility to all users with relevant security access privileges will mean greater acceptance and use, and ultimately better decision. 4. System Wide Communication Portal Provide a focused location for access to information, analytical/reporting tools, and the necessary training and support. Also, maximization of effective system-wide use of state’s data warehouse. Rapid and wide dissemination of integrated and proven instructional and administrative practices.
  • 22. KIDS Phase II Project High-level Project Work Plan, Time-line, & Resource requirements Project Phase: Time-line Resource Requirements Requirements Validation January 10, 2006 Mojo & Gary Scheduled trips to all Districts & ESDs 3. “ Test Site ” DW/ODS Modeling, June 30, 2006 Database Administrator integration, ETL, Data Quality, Data Modeler/Analysts Meta Data Repository, and Data Quality Analysts Vendor “Bake-off” contracting Business User (Client) Meta Data Administrator ETL & BI developers Data Warehouse Project Manager Inauguration of Governance & January 30, 2006 Doug Kosty & Mojo Nwokoma Project Team committee members 4. OLAP & Portal Development, October 30, 2006 End-user Business Analyst including Training, and Web Developer vendor “Bake-off” Contracting Portal Dashboard developer Business User (Client) BI OLAP Report Developer “ Train-the-trainer” Data Security Officer
  • 23. BI - OLAP Data Warehouse Architecture
  • 24. User expectations Expectations must be managed in terms of: Schedule Budget Scope Performance Availability Simplicity (ease of use) Tool functionality Data cleanliness Users’ roles and responsibilities
  • 25. User responsibilities Be a full-time member of the Core Team Participate in all data modeling sessions Co-manage the DW project Make decisions and escalate disputes to the Steering Committee Provide meta data for business objects Identify data security requirements Participate in BI tool selection Participate in all review sessions Participate in all testing activities
  • 26. IT Staffing DW roles & responsibilities Dedicated IT team New skill set – beyond tools, discipline Contractors & consultants Knowledge transfer Training Just in time Just enough Knowledge transfer through collaboration
  • 27. Risks to be mitigated  Low management commitment  Low user commitment  Unrealistic schedule  Unrealistic user expectations  Budget too small  Untrained or unavailable staff
  • 28. Risks to be mitigated  Unclear or changing requirements  Poor project management Creeping scope Initial project too large Wrong project  Changing priorities  Data cleansing not addressed early  Vendors out of control
  • 29. Risks to be mitigated  Not architected properly (wrong design)  Inappropriate organization structure  Lost or changed sponsor  New technology not understood  No procedure to resolve disputes  Exceeding platform capabilities

Editor's Notes

  • #17: Data Integration Layer – ODS: ODS is a non-queryable centralized staging areas for storing extracted, cleansed, and transformed data, and for gathering centralized metadata for implementing an Enterprise Data Mart Architecture (EDMA), eliminating the need for another non-queryable staging area called data warehouse. Needed is a dimensionally modeled data warehouse for enterprise DSS, prepared to provide the best in query response performance and to support the most advanced OLAP functionalities.