SlideShare a Scribd company logo
DWH-Ahsan AbdullahDWH-Ahsan Abdullah
11
Data WarehousingData Warehousing
Lecture-3Lecture-3
Introduction and BackgroundIntroduction and Background
Virtual University of PakistanVirtual University of Pakistan
Ahsan Abdullah
Assoc. Prof. & Head
Center for Agro-Informatics Research
www.nu.edu.pk/cairindex.asp
FAST National University of Computers & Emerging Sciences, IslamabadFAST National University of Computers & Emerging Sciences, Islamabad
DWH-Ahsan Abdullah
2
Introduction and BackgroundIntroduction and Background
DWH-Ahsan Abdullah
3
What is a Data Warehouse ?What is a Data Warehouse ?
It is a blend of many technologies, the basic
concept being:

Take all data from different operational systems.

If necessary, add relevant data from industry.

Transform all data and bring into a uniform format.

Integrate all data as a single entity.
DWH-Ahsan Abdullah
4
What is a Data Warehouse ? (Cont…)What is a Data Warehouse ? (Cont…)
It is a blend of many technologies, the basic
concept being:

Store data in a format supporting easy access for
decision support.

Create performance enhancing indices.

Implement performance enhancement joins.

Run ad-hoc queries with low selectivity.
DWH-Ahsan Abdullah
5
Business user
needs info
User requests
IT people
IT people
create reports
IT people
send reports to
business user
IT people do
system analysis
and design
Business user
may get answers
Answers result
in more questions

?
How is it Different?How is it Different?
 Fundamentally differentFundamentally different
DWH-Ahsan Abdullah
6
How is it Different?How is it Different?
 Different patterns of hardware utilizationDifferent patterns of hardware utilization
100%
0%
Operational DWH
Bus Service vs. TrainBus Service vs. Train
DWH-Ahsan Abdullah
7
How is it Different?How is it Different?
 Combines operational and historical data.Combines operational and historical data.
 Don’t do data entry into a DWH, OLTP or ERP are the
source systems.
 OLTP systems don’t keep history, cant get balance
statement more than a year old.
 DWH keep historical data, even of bygone customers. Why?
 In the context of bank, want to know why the customer left?
 What were the events that led to his/her leaving? Why?
 Customer retention.
DWH-Ahsan Abdullah
8
How much history?How much history?
 Depends on:Depends on:
 Industry.Industry.
 Cost of storing historical data.Cost of storing historical data.
 Economic value of historical data.Economic value of historical data.
DWH-Ahsan Abdullah
9
How much history?How much history?
 Industries and historyIndustries and history
 TelecommTelecomm calls are much much more as compared tocalls are much much more as compared to
bank transactions-bank transactions- 18 months18 months..
 RetailersRetailers interested in analyzing yearly seasonalinterested in analyzing yearly seasonal
patterns-patterns- 65 weeks65 weeks..
 InsuranceInsurance companies want to do actuary analysis, usecompanies want to do actuary analysis, use
the historical data in order to predict risk-the historical data in order to predict risk- 7 years7 years..
DWH-Ahsan Abdullah
10
How much history?How much history?
EconomicEconomic valuevalue of dataof data
Vs.Vs.
StorageStorage costcost
Data Warehouse aData Warehouse a
complete repositorycomplete repository of data?of data?
DWH-Ahsan Abdullah
11
How is it Different?How is it Different?
 Usually (but not always) periodic or batchUsually (but not always) periodic or batch
updates rather than real-time.updates rather than real-time.
 The boundary is blurring for active data warehousing.
 For an ATM, if update not in real-time, then lot of real
trouble.
 DWH is for strategic decision making based on historical
data. Wont hurt if transactions of last one hour/day are
absent.
DWH-Ahsan Abdullah
12
How is it Different?How is it Different?
 Rate of update depends on:
 volume of data,
 nature of business,
 cost of keeping historical data,
 benefit of keeping historical data.

More Related Content

PPT
Lecture 2
PPT
Lecture 1
PPT
Lecture 5
PPT
Lecture 4
PPT
Intro to Data warehousing lecture 02
PPT
Lecture 7
PPT
Lecture 18
PPTX
Lecture 1 introduction to data warehouse
Lecture 2
Lecture 1
Lecture 5
Lecture 4
Intro to Data warehousing lecture 02
Lecture 7
Lecture 18
Lecture 1 introduction to data warehouse

What's hot (20)

PPT
Lecture 9
PDF
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
PPTX
Data mining and data warehousing
PDF
Technology Trend Analysis of R&D Strategy on iPS Cells
PPTX
data warehousing and data mining
PDF
Refinery Advisor
PPTX
introduction to data warehousing and mining
PPT
Data Warehousing And Data Mining Presentation Transcript
PDF
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
PPTX
Oracle big data publix sector 1
PDF
Introduction to analytics
PPTX
Introduction about analytics with sas+r programming.
PPT
Data Warehouse By Piyush
PDF
Cognitive Overview
PDF
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
PPTX
Data mining
PDF
2013 OHSUG - Clinical Data Warehouse Implementation
PDF
AMS description
PDF
Career in-field-business-data-analytics
PPT
Data ware house
Lecture 9
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
Data mining and data warehousing
Technology Trend Analysis of R&D Strategy on iPS Cells
data warehousing and data mining
Refinery Advisor
introduction to data warehousing and mining
Data Warehousing And Data Mining Presentation Transcript
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
Oracle big data publix sector 1
Introduction to analytics
Introduction about analytics with sas+r programming.
Data Warehouse By Piyush
Cognitive Overview
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
Data mining
2013 OHSUG - Clinical Data Warehouse Implementation
AMS description
Career in-field-business-data-analytics
Data ware house
Ad

Similar to Lecture 3 (20)

PPT
Lecture 01.ppt
PPT
1-_Intro_to_Data_Minning__DWH.ppt
PPT
Lecture 2.ppt
PPT
Lecture 2.ppt
PPTX
Presentation data warehouse easy and simple words.pptx
PPT
Dwh lecture slides-week3&4
DOC
Oracle sql plsql & dw
PDF
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
PPTX
Data warehouse
PPTX
1. Data warehouse Fundamentals for MCA SPPU.pptx
PPTX
Data warehousing
PDF
Introduction to Data Warehousing
PPTX
Data warehousing
PPTX
Data warehouse-complete-1-100227093028-phpapp01.pptx
PPT
Introduction to Business Intelligence and Data warehousing - ppt
PPTX
presentationofism-complete-1-100227093028-phpapp01.pptx
PPTX
Data warehouse
PPT
Introduction to Data Warehouse
PPTX
158001210111bapan data warehousepptse.pptx
PPT
data warehousing
Lecture 01.ppt
1-_Intro_to_Data_Minning__DWH.ppt
Lecture 2.ppt
Lecture 2.ppt
Presentation data warehouse easy and simple words.pptx
Dwh lecture slides-week3&4
Oracle sql plsql & dw
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data warehouse
1. Data warehouse Fundamentals for MCA SPPU.pptx
Data warehousing
Introduction to Data Warehousing
Data warehousing
Data warehouse-complete-1-100227093028-phpapp01.pptx
Introduction to Business Intelligence and Data warehousing - ppt
presentationofism-complete-1-100227093028-phpapp01.pptx
Data warehouse
Introduction to Data Warehouse
158001210111bapan data warehousepptse.pptx
data warehousing
Ad

More from Shani729 (20)

PPT
Python tutorialfeb152012
PPT
Python tutorial
PDF
Interaction design _beyond_human_computer_interaction
PPTX
Fm lecturer 13(final)
PPT
Lecture slides week14-15
PPT
Frequent itemset mining using pattern growth method
PPT
Dwh lecture slides-week15
PPT
Dwh lecture slides-week10
PPT
Dwh lecture slidesweek7&8
PPT
Dwh lecture slides-week5&6
PPT
Dwh lecture slides-week2
PPTX
Dwh lecture slides-week1
PPT
Dwh lecture slides-week 13
PPT
Dwh lecture slides-week 12&13
PPTX
Data warehousing and mining furc
PPT
Lecture 40
PPT
Lecture 39
PPT
Lecture 38
PPT
Lecture 37
PPT
Lecture 35
Python tutorialfeb152012
Python tutorial
Interaction design _beyond_human_computer_interaction
Fm lecturer 13(final)
Lecture slides week14-15
Frequent itemset mining using pattern growth method
Dwh lecture slides-week15
Dwh lecture slides-week10
Dwh lecture slidesweek7&8
Dwh lecture slides-week5&6
Dwh lecture slides-week2
Dwh lecture slides-week1
Dwh lecture slides-week 13
Dwh lecture slides-week 12&13
Data warehousing and mining furc
Lecture 40
Lecture 39
Lecture 38
Lecture 37
Lecture 35

Recently uploaded (20)

PPT
Mechanical Engineering MATERIALS Selection
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Sustainable Sites - Green Building Construction
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Digital Logic Computer Design lecture notes
PPTX
Artificial Intelligence
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
Safety Seminar civil to be ensured for safe working.
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPT
introduction to datamining and warehousing
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Well-logging-methods_new................
PPTX
Lecture Notes Electrical Wiring System Components
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PPTX
web development for engineering and engineering
PPT
Project quality management in manufacturing
Mechanical Engineering MATERIALS Selection
OOP with Java - Java Introduction (Basics)
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Sustainable Sites - Green Building Construction
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Digital Logic Computer Design lecture notes
Artificial Intelligence
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Operating System & Kernel Study Guide-1 - converted.pdf
Safety Seminar civil to be ensured for safe working.
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
UNIT-1 - COAL BASED THERMAL POWER PLANTS
introduction to datamining and warehousing
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Well-logging-methods_new................
Lecture Notes Electrical Wiring System Components
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
web development for engineering and engineering
Project quality management in manufacturing

Lecture 3

  • 1. DWH-Ahsan AbdullahDWH-Ahsan Abdullah 11 Data WarehousingData Warehousing Lecture-3Lecture-3 Introduction and BackgroundIntroduction and Background Virtual University of PakistanVirtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research www.nu.edu.pk/cairindex.asp FAST National University of Computers & Emerging Sciences, IslamabadFAST National University of Computers & Emerging Sciences, Islamabad
  • 2. DWH-Ahsan Abdullah 2 Introduction and BackgroundIntroduction and Background
  • 3. DWH-Ahsan Abdullah 3 What is a Data Warehouse ?What is a Data Warehouse ? It is a blend of many technologies, the basic concept being:  Take all data from different operational systems.  If necessary, add relevant data from industry.  Transform all data and bring into a uniform format.  Integrate all data as a single entity.
  • 4. DWH-Ahsan Abdullah 4 What is a Data Warehouse ? (Cont…)What is a Data Warehouse ? (Cont…) It is a blend of many technologies, the basic concept being:  Store data in a format supporting easy access for decision support.  Create performance enhancing indices.  Implement performance enhancement joins.  Run ad-hoc queries with low selectivity.
  • 5. DWH-Ahsan Abdullah 5 Business user needs info User requests IT people IT people create reports IT people send reports to business user IT people do system analysis and design Business user may get answers Answers result in more questions  ? How is it Different?How is it Different?  Fundamentally differentFundamentally different
  • 6. DWH-Ahsan Abdullah 6 How is it Different?How is it Different?  Different patterns of hardware utilizationDifferent patterns of hardware utilization 100% 0% Operational DWH Bus Service vs. TrainBus Service vs. Train
  • 7. DWH-Ahsan Abdullah 7 How is it Different?How is it Different?  Combines operational and historical data.Combines operational and historical data.  Don’t do data entry into a DWH, OLTP or ERP are the source systems.  OLTP systems don’t keep history, cant get balance statement more than a year old.  DWH keep historical data, even of bygone customers. Why?  In the context of bank, want to know why the customer left?  What were the events that led to his/her leaving? Why?  Customer retention.
  • 8. DWH-Ahsan Abdullah 8 How much history?How much history?  Depends on:Depends on:  Industry.Industry.  Cost of storing historical data.Cost of storing historical data.  Economic value of historical data.Economic value of historical data.
  • 9. DWH-Ahsan Abdullah 9 How much history?How much history?  Industries and historyIndustries and history  TelecommTelecomm calls are much much more as compared tocalls are much much more as compared to bank transactions-bank transactions- 18 months18 months..  RetailersRetailers interested in analyzing yearly seasonalinterested in analyzing yearly seasonal patterns-patterns- 65 weeks65 weeks..  InsuranceInsurance companies want to do actuary analysis, usecompanies want to do actuary analysis, use the historical data in order to predict risk-the historical data in order to predict risk- 7 years7 years..
  • 10. DWH-Ahsan Abdullah 10 How much history?How much history? EconomicEconomic valuevalue of dataof data Vs.Vs. StorageStorage costcost Data Warehouse aData Warehouse a complete repositorycomplete repository of data?of data?
  • 11. DWH-Ahsan Abdullah 11 How is it Different?How is it Different?  Usually (but not always) periodic or batchUsually (but not always) periodic or batch updates rather than real-time.updates rather than real-time.  The boundary is blurring for active data warehousing.  For an ATM, if update not in real-time, then lot of real trouble.  DWH is for strategic decision making based on historical data. Wont hurt if transactions of last one hour/day are absent.
  • 12. DWH-Ahsan Abdullah 12 How is it Different?How is it Different?  Rate of update depends on:  volume of data,  nature of business,  cost of keeping historical data,  benefit of keeping historical data.

Editor's Notes

  • #7: <number>
  • #8: <number>
  • #9: <number>
  • #10: <number>
  • #11: <number>
  • #12: <number>
  • #13: <number>