SlideShare a Scribd company logo
IBM ICE (Innovation Centre for Education)
Welcome to:
Introduction to Data Warehousing
© Copyright IBM Corporation 2017 9.1
Unit objectives IBM ICE (Innovation Centre for Education)
IBM Power Systems
After completing this unit, you should be able to:
• Know what data warehouse is
• Describe the architecture of data warehouse.
• Explain about data staging and ETL.
• Understand the need of content management.
© Copyright IBM Corporation 2017
An introduction to data warehousing IBM ICE (Innovation Centre for Education)
IBM Power Systems
• Data warehousing is a processing a huge amount of electronic data stored in recent
years and use that data to accomplish goals that go beyond the routine tasks linked
to daily processing.
• A data warehouse is a collection of data that supports decision-making processes. It
provides the following features
–It is subject-oriented.
–It is integrated and consistent.
–It shows its evolution over time and it is not volatile.
• It used to support management
– Decision-making processes
– Business intelligence.
– Emerge the information and knowledge needed to effectively manage the organization.
– Investigation of key challenges and research directions for this discipline.
© Copyright IBM Corporation 2017
Data warehouse architectures IBM ICE (Innovation Centre for Education)
IBM Power Systems
The following are the architecture properties of data warehouse system
• Separation
• Scalability
• Extensibility
• Security
• Administerability
© Copyright IBM Corporation 2017
Data warehouse architectures IBM ICE (Innovation Centre for Education)
IBM Power Systems
The figure represents a Single-Layer Architecture of a Data Warehouse.
© Copyright IBM Corporation 2017
Data warehouse architectures IBM ICE (Innovation Centre for Education)
IBM Power Systems
• Two-Layer Architecture
• Two-layer architecture to highlight a separation between
– Physically available sources
– Data warehouses, it actually consists of four subsequent data flow stages
• Source layer
• It uses heterogeneous sources of data.
• Data is originally stored to
–Corporate relational databases
–Legacy databases
–Information systems outside the corporate walls.
© Copyright IBM Corporation 2017
Data warehouse architectures IBM ICE (Innovation Centre for Education)
IBM Power Systems
• Data staging
• Data warehouse layer
• Analysis
© Copyright IBM Corporation 2017
Data warehouse architectures IBM ICE (Innovation Centre for Education)
IBM Power Systems
The Figure represents the
Data warehouse on a non
single tier architecture.
© Copyright IBM Corporation 2017
Data warehouse architectures IBM ICE (Innovation Centre for Education)
IBM Power Systems
• Three-Layer Architecture :
• In this architecture, the third layer is the reconciled data layer or operational data
store.
• This layer materializes operational data obtained after integrating and cleansing
source data.
– So those data are integrated, consistent, correct, current, and detailed.
– It sharply separates the problems of source data extraction and integration from those of data
warehouse population.
– Reconciled data leads to more redundancy of operational source data
© Copyright IBM Corporation 2017
Data warehouse architectures IBM ICE (Innovation Centre for Education)
IBM Power Systems
Data Staging and ETL
• ETL consists of four separate phases:
– extraction (or capture),
– cleansing (or cleaning or scrubbing),
– transformation,
– loading.
• In a three-layer architecture, ETL processes actually feed the reconciled data layer
in to:
– A single ,
– Detailed ,
– Comprehensive ,
– Top-quality data source that in its turn feeds the data warehouse.
© Copyright IBM Corporation 2017
Decision support applications IBM ICE (Innovation Centre for Education)
IBM Power Systems
• Successfully supporting managerial decision making has become critically
dependent upon the
– Availability of integrated.
– High quality information organized .
– Presented to managers in a timely and easily understood manner.
• Data warehouses have emerged to meet this need.
– Surrounded by analytical tools and models.
– data warehouses have the potential to transform operational data into business intelligence.
• Content Management
© Copyright IBM Corporation 2017
Unit summary IBM ICE (Innovation Centre for Education)
IBM Power Systems
Having completed this unit, you should be able to
• Know what data warehouse is
• Describe the architecture of data warehouse.
• Explain about data staging and ETL.
• Understand the need of content management.

More Related Content

PDF
Technical Comuting Solutions Made Simple - ISC13 IBM System x Solution
PPTX
Datawarehouse org
PPTX
Data center architure ppts
PPTX
Data warehouseold
PDF
DWH Concepts
PPTX
DATA WAREHOUSING
PPTX
Data warehousing and data mart
PPTX
data mining
Technical Comuting Solutions Made Simple - ISC13 IBM System x Solution
Datawarehouse org
Data center architure ppts
Data warehouseold
DWH Concepts
DATA WAREHOUSING
Data warehousing and data mart
data mining

What's hot (8)

PPTX
data mining
PDF
Eight styles of data integration
PPT
Data warehouse presentation
PPT
Ch1 data-warehousing
PPT
Ch1 data-warehousing
PDF
Enterprise Information Integration at LondonMet
PDF
Data integration
PDF
A Framework for Data Collection, Transformation and Processing in industrial...
data mining
Eight styles of data integration
Data warehouse presentation
Ch1 data-warehousing
Ch1 data-warehousing
Enterprise Information Integration at LondonMet
Data integration
A Framework for Data Collection, Transformation and Processing in industrial...
Ad

Similar to Chapter 1 (20)

PPTX
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
PPT
Data Warehouse
PPTX
Business intelligence-sharda-dss10-ppt-03-pptx.pptx
PPT
Data Warehouse By Piyush
PPTX
Data Lake Overview
PPTX
module 1 DWDM (complete) chapter ppt.pptx
DOCX
Business Intelligence, Analytics, and Data Science A Managerial
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PPTX
Is the traditional data warehouse dead?
PPTX
DATA WAREHOUSE.pptx
PPTX
sharda_dss10_ppt_03_GE-211566.pptx000000000
PPT
Data Warehouse Basic Guide
PPTX
Data Warehousing about data ware house.pptx
PPTX
Hadoop and Your Data Warehouse
PPT
SUPERB DATA WAREHOUSE.ppt
PPT
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
PPT
Dwh basics datastage online training
PDF
Rando Veizi: Data warehouse and Pentaho suite
PPTX
04OLAPV2 from the course data warehousing
PDF
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
Data Warehouse
Business intelligence-sharda-dss10-ppt-03-pptx.pptx
Data Warehouse By Piyush
Data Lake Overview
module 1 DWDM (complete) chapter ppt.pptx
Business Intelligence, Analytics, and Data Science A Managerial
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Is the traditional data warehouse dead?
DATA WAREHOUSE.pptx
sharda_dss10_ppt_03_GE-211566.pptx000000000
Data Warehouse Basic Guide
Data Warehousing about data ware house.pptx
Hadoop and Your Data Warehouse
SUPERB DATA WAREHOUSE.ppt
DATASTAGE AND QUALITY STAGE 9.1 ONLINE TRAINING
Dwh basics datastage online training
Rando Veizi: Data warehouse and Pentaho suite
04OLAPV2 from the course data warehousing
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE
Ad

Recently uploaded (20)

PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Lecture1 pattern recognition............
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Introduction to Business Data Analytics.
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Computer network topology notes for revision
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
Launch Your Data Science Career in Kochi – 2025
PPT
Quality review (1)_presentation of this 21
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
IB Computer Science - Internal Assessment.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Business Acumen Training GuidePresentation.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
IBA_Chapter_11_Slides_Final_Accessible.pptx
Lecture1 pattern recognition............
Business Ppt On Nestle.pptx huunnnhhgfvu
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Introduction to Business Data Analytics.
Clinical guidelines as a resource for EBP(1).pdf
Computer network topology notes for revision
Miokarditis (Inflamasi pada Otot Jantung)
Introduction-to-Cloud-ComputingFinal.pptx
.pdf is not working space design for the following data for the following dat...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Launch Your Data Science Career in Kochi – 2025
Quality review (1)_presentation of this 21
climate analysis of Dhaka ,Banglades.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
IB Computer Science - Internal Assessment.pptx
Reliability_Chapter_ presentation 1221.5784

Chapter 1

  • 1. IBM ICE (Innovation Centre for Education) Welcome to: Introduction to Data Warehousing © Copyright IBM Corporation 2017 9.1
  • 2. Unit objectives IBM ICE (Innovation Centre for Education) IBM Power Systems After completing this unit, you should be able to: • Know what data warehouse is • Describe the architecture of data warehouse. • Explain about data staging and ETL. • Understand the need of content management. © Copyright IBM Corporation 2017
  • 3. An introduction to data warehousing IBM ICE (Innovation Centre for Education) IBM Power Systems • Data warehousing is a processing a huge amount of electronic data stored in recent years and use that data to accomplish goals that go beyond the routine tasks linked to daily processing. • A data warehouse is a collection of data that supports decision-making processes. It provides the following features –It is subject-oriented. –It is integrated and consistent. –It shows its evolution over time and it is not volatile. • It used to support management – Decision-making processes – Business intelligence. – Emerge the information and knowledge needed to effectively manage the organization. – Investigation of key challenges and research directions for this discipline. © Copyright IBM Corporation 2017
  • 4. Data warehouse architectures IBM ICE (Innovation Centre for Education) IBM Power Systems The following are the architecture properties of data warehouse system • Separation • Scalability • Extensibility • Security • Administerability © Copyright IBM Corporation 2017
  • 5. Data warehouse architectures IBM ICE (Innovation Centre for Education) IBM Power Systems The figure represents a Single-Layer Architecture of a Data Warehouse. © Copyright IBM Corporation 2017
  • 6. Data warehouse architectures IBM ICE (Innovation Centre for Education) IBM Power Systems • Two-Layer Architecture • Two-layer architecture to highlight a separation between – Physically available sources – Data warehouses, it actually consists of four subsequent data flow stages • Source layer • It uses heterogeneous sources of data. • Data is originally stored to –Corporate relational databases –Legacy databases –Information systems outside the corporate walls. © Copyright IBM Corporation 2017
  • 7. Data warehouse architectures IBM ICE (Innovation Centre for Education) IBM Power Systems • Data staging • Data warehouse layer • Analysis © Copyright IBM Corporation 2017
  • 8. Data warehouse architectures IBM ICE (Innovation Centre for Education) IBM Power Systems The Figure represents the Data warehouse on a non single tier architecture. © Copyright IBM Corporation 2017
  • 9. Data warehouse architectures IBM ICE (Innovation Centre for Education) IBM Power Systems • Three-Layer Architecture : • In this architecture, the third layer is the reconciled data layer or operational data store. • This layer materializes operational data obtained after integrating and cleansing source data. – So those data are integrated, consistent, correct, current, and detailed. – It sharply separates the problems of source data extraction and integration from those of data warehouse population. – Reconciled data leads to more redundancy of operational source data © Copyright IBM Corporation 2017
  • 10. Data warehouse architectures IBM ICE (Innovation Centre for Education) IBM Power Systems Data Staging and ETL • ETL consists of four separate phases: – extraction (or capture), – cleansing (or cleaning or scrubbing), – transformation, – loading. • In a three-layer architecture, ETL processes actually feed the reconciled data layer in to: – A single , – Detailed , – Comprehensive , – Top-quality data source that in its turn feeds the data warehouse. © Copyright IBM Corporation 2017
  • 11. Decision support applications IBM ICE (Innovation Centre for Education) IBM Power Systems • Successfully supporting managerial decision making has become critically dependent upon the – Availability of integrated. – High quality information organized . – Presented to managers in a timely and easily understood manner. • Data warehouses have emerged to meet this need. – Surrounded by analytical tools and models. – data warehouses have the potential to transform operational data into business intelligence. • Content Management © Copyright IBM Corporation 2017
  • 12. Unit summary IBM ICE (Innovation Centre for Education) IBM Power Systems Having completed this unit, you should be able to • Know what data warehouse is • Describe the architecture of data warehouse. • Explain about data staging and ETL. • Understand the need of content management.