SlideShare a Scribd company logo
BY
Kushal Singh
Acute Informatics Pvt
What is Business
Intelligence?
BI is an abbreviation of the two words     
Business Intelligence, bringing the right 
information at the right time to the right 
people in the right format.
What is Data
Warehousing?
Data Warehouse is a subject-oriented,
integrated, nonvolatile and timevariant collection of data in support of
management’s decisions.
Kushal Data Warehousing PPT
What is Business Intelligence?
 The architecture 
Operational
data source1

High
summarized data

Meta-data
Operational
data source 2

Reporting, query,
application development,
and EIS(executive
information system) tools

Query Manage
Lightly
summarized
data

Load Manager

Operational
data source n

Operational
data store (ods)

DBMS

Detailed data

OLAP(online
analytical processing) tools

Warehouse Manager

Operational data store (ODS)
Data mining

Archive/backup
data

Typical architecture of a data warehouse

End-user
access tools
 The benefits of data
warehousing
• The potential benefits of data warehousing
are high returns on investment..
• substantial competitive advantage..
• increased productivity of corporate
decision-makers..
Data Warehouse
Characteristics
 Key Characteristics of a Data Warehouse
 Subject-oriented
 Integrated
 Time-variant
 Non-volatile

8
Subject Oriented
• Example for an insurance company :
Applications Area

Data Warehouse
Auto and Fire
Auto and Fire
Policy
Policy
Processing
Processing
Systems
Systems

Commercial
Commercial
and Life
and Life
Insurance
Insurance
Systems
Systems

Data

Data
Accounting
Accounting
System
System

Billing
Billing
System
System

Policy
Policy

Customer
Customer

Claims
Claims
Processing
Processing
System
System

Losses
Losses

Premium
Premium

9
Integrated
• Data is stored once in a single integrated location
(e.g. insurance company)
Auto Policy
Auto Policy
Processing
Processing
System
System

Customer
data
stored
in several
databases

Data Warehouse
Database

Fire Policy
Fire Policy
Processing
Processing
System
System
FACTS, LIFE
FACTS, LIFE
Commercial, Accounting
Commercial, Accounting
Applications
Applications

Subject = Customer

10
Time - Variant
Data is stored as a series of snapshots or views which record how it is
collected across time.
Data Warehouse Data

Time

Data

 
{

•

Key




Data is tagged with some element of time -  creation date, as of 
date, etc.
Data is available on-line for long periods of time for trend 
analysis and forecasting. For example, five or more years
11
Non-Volatile
• Existing data in the warehouse is not overwritten or
updated.

External
Sources
Production
Databases
Data
Data
Warehouse
Warehouse
Environment
Environment

Production
Production
Applications
Applications

• Update
• Insert
• Delete

Data
Warehouse
Database

• Load
• Read-Only

12
Comparision of OLTP systems and data
warehousing system
OLTP systems
Hold current data
Stores detailed data
Data is dynamic
Repetitive processing
High level of transaction throughput
Predictable pattern of usage
Transaction-driven
Application-orented
Supports day-to-day decisions
Serves large number of clerical/operation users

Data warehousing systems
Holds historical data
Stores detailed, lightly, and highly summarized
data
Data is largely static
Ad hoc, unstructured, and heuristic processing
Medium to how level of transaction throughput
Unpredictable pattern of usage
Analysis driven
Subject-oriented
supports strategic decisions
Serves relatively how number of managerial
users
OLTP
Online Transaction
Processing
On Line Transaction
Processing
• What is a Transaction ?
– A Logical unit of work
–
–
–

Examples:
Drawing Money from a bank account
Booking a seat on an airline
Transactions

• It is a unit of program execution that

accesses & possibly updates various data
items.
• A transaction is a logical unit of work that
performs some useful function for a user.
• In end of the transaction the system must
be:
– in the prior state (if the transaction fails) or
– the status of the system should reflect the
successful completion (if the transaction
succeeded).

• May take a database from one consistent
Characteristics of Transactions
A tomicity
C onsistency
I solation
D urability
OLAP
Online Analytical Processing
Types of OLAP
• ROLAP (Relational Online Analytical
Processing)
• MOLAP (Multidimensional Online
Analytical Processing)
• HOLAP (Hybrid Online Analytical
Processing)
ROLAP
• ROLAP (Relational online analytical
Processing)
• Used for reporting
• Tools: Report studio
MOLAP
• MOLAP (Multidimensional online
Analytical processing)
• Used to build cubes
• Tools: Powerplay, Transformer
HOLAP
• HOLAP (Hybrid online analytical

Processing)
• Used for Data modeling
• This will support both MOLAP and ROLAP
• Tools: Framework manager, Query Studio.
Dimensions
• It’s descriptive information about a

measures like product, location, customer
etc.
Types of Dimensions
• Confirmed Dimensions
• Degenerated Dimensions
• Junk Dimensions
Facts
• Fact is containing measures and IDs.
• Ex; Revenue, Cost, Amount etc
Measure Types
• Additive Measures: Which can be added

across all the dimensions
• Non Additive Measures: Which can not be
added across all the dimensions
• Semi Additive Measures: Which can be
added across some dimensions and which
can not be added across some other
dimensions
Schema’s In Data warehousing
•
•
•

STAR SHEMA
SNOW-FLAKE SCHEMA
STAR-FLAKE SCHEMA
Star Schema
Dimension Tables

Region_Dimension_Table
region _id
NE
NW
SE
SW

Product_Dimension_Table
prod_grp_id

prod_id

prod_grp_desc

prod_desc

10
20
30

100
140
220

Fewer devices
Circuit boards
Components

region _doc
Northeast
Northwest
Southeast
Southwest

account _id

Power supply
Motherboard
Co-processor

100000
110000
120000
130000
140000

account _doc
ABC Electronics
Midway Electric
Victor Components
Washburn, Inc.
Zerox

Account_Dimension_Table
month

prod_id

region_id

account_id

vend_id

net-sales

gross_sales

01-1996
02-1996
03-1996

100
140
220

SW
NE
SW

100000
110000
100000

100
200
300

30,000
23,000
32,000

50,000
42,000
49,000

Fact Table
Monthly_Sales_Summary_Table
month
01-1996
02-1996
03-1996

mo_in_fiscal_yr
4
5
6

month_name
January
February
March

Time_Dimension_Table

Vendor_Dimension_Table
vend_id
100
200
300

vendor_desc
PowerAge, Inc.
Advanced Micro Devices
Farad Incorporated

28
SNOW-FLAKE SCHEMA
Factless Fact Table
• It’s just a bridge between table where we used to join
tables.

• In this scenario we can only track the event.
SCD
(Slowly Changing Dimensions)
•
•
•
•

TYPE 0
TYPE 1
TYPE 2
TYPE 3
ETL
(Extract, Transform and Loading)

INFORMATICA
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Designing
FRAMEWORK
MANAGER
Relational Database
&
DMR
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
REPORTING

IBM COGNOS
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Data Warehousing PPT

More Related Content

PPT
Basics of Microsoft Business Intelligence and Data Integration Techniques
PPTX
BI Introduction
PPT
Bi presentation Designing and Implementing Business Intelligence Systems
PDF
OLTP vs OLAP
PPTX
Data warehousing
PDF
Delivering fast, powerful and scalable analytics
PPTX
Data warehouse
PPT
Datawarehouse & bi introduction
Basics of Microsoft Business Intelligence and Data Integration Techniques
BI Introduction
Bi presentation Designing and Implementing Business Intelligence Systems
OLTP vs OLAP
Data warehousing
Delivering fast, powerful and scalable analytics
Data warehouse
Datawarehouse & bi introduction

What's hot (20)

PPTX
Online analytical processing
PDF
Cognos datawarehouse
PDF
Business Intelligence Data Warehouse System
PPT
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
PPTX
Role of Database Management System in A Data Warehouse
PDF
Introduction to Business Intelligence
PPT
1.4 data warehouse
PPTX
Data warehouse
PPT
Olap, oltp and data mining
PPTX
DATA MART APPROCHES TO ARCHITECTURE
PPTX
Data warehouse system and its concepts
PDF
Prague data management meetup 2017-02-28
PDF
Business Intelligence Presentation 1 (15th March'16)
PPT
OLAP Cubes in Datawarehousing
PPTX
Keys toSuccess: Business Intelligence Proven, Practical Strategies That Work
PPT
Retail Data Warehouse
PPT
Date warehousing concepts
PPTX
Data warehouse
PDF
Types of business intelligence tools
Online analytical processing
Cognos datawarehouse
Business Intelligence Data Warehouse System
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Role of Database Management System in A Data Warehouse
Introduction to Business Intelligence
1.4 data warehouse
Data warehouse
Olap, oltp and data mining
DATA MART APPROCHES TO ARCHITECTURE
Data warehouse system and its concepts
Prague data management meetup 2017-02-28
Business Intelligence Presentation 1 (15th March'16)
OLAP Cubes in Datawarehousing
Keys toSuccess: Business Intelligence Proven, Practical Strategies That Work
Retail Data Warehouse
Date warehousing concepts
Data warehouse
Types of business intelligence tools
Ad

Viewers also liked (20)

PPTX
DATA WAREHOUSING
PPT
Data Warehousing and Data Mining
PDF
It6601 mobile computing unit1questions
PPT
datamining and warehousing ppt
PDF
It6601 mobile computing unit 3 questions
PPSX
KOKPIT CPM for IT - Kurumsal Performans Yönetim Platformu
PDF
It6601 mobile computing unit 5 questions
PDF
It6601 mobile computing unit 2 questions
PDF
It6601 mobile computing unit 4 questions
PPT
An example of discovering simple patterns using basic data mining
PDF
IT6601 Mobile Computing
PDF
mechanics of solids
PPTX
Query decomposition in data base
PDF
Data mining & data warehousing (ppt)
PDF
8 query processing and optimization
PPT
Warehouse components
PPT
Query processing-and-optimization
PPTX
It6601 mobile computing unit 5
PDF
IT6601 MOBILE COMPUTING
PPT
Data Warehousing, Data Mining & Data Visualisation
DATA WAREHOUSING
Data Warehousing and Data Mining
It6601 mobile computing unit1questions
datamining and warehousing ppt
It6601 mobile computing unit 3 questions
KOKPIT CPM for IT - Kurumsal Performans Yönetim Platformu
It6601 mobile computing unit 5 questions
It6601 mobile computing unit 2 questions
It6601 mobile computing unit 4 questions
An example of discovering simple patterns using basic data mining
IT6601 Mobile Computing
mechanics of solids
Query decomposition in data base
Data mining & data warehousing (ppt)
8 query processing and optimization
Warehouse components
Query processing-and-optimization
It6601 mobile computing unit 5
IT6601 MOBILE COMPUTING
Data Warehousing, Data Mining & Data Visualisation
Ad

Similar to Kushal Data Warehousing PPT (20)

PPTX
DWDM Unit 1 (1).pptx
PPTX
Data wharehousing and OLAP
PPT
Lecture1
PDF
Business Intelligence: Data Warehouses
PPT
Introduction to Business Intelligence and Data warehousing - ppt
PPTX
Data warehouse introduction
PPS
Bi Dw Presentation
PPT
Chapter 2
PPTX
3 OLAP.pptx
PPTX
OLAP & Data Warehouse
PDF
05_Decision Support and OLAP.pdf
PDF
data warehousing
PPT
Data ware housing - Introduction to data ware housing process.
PPT
Msbi by quontra us
DOCX
us it recruiter
PPTX
OLAP & DATA WAREHOUSE
PPT
Data Warehouse
PPT
Data Warehouse
PPT
kalyani.ppt
PPT
kalyani.ppt
DWDM Unit 1 (1).pptx
Data wharehousing and OLAP
Lecture1
Business Intelligence: Data Warehouses
Introduction to Business Intelligence and Data warehousing - ppt
Data warehouse introduction
Bi Dw Presentation
Chapter 2
3 OLAP.pptx
OLAP & Data Warehouse
05_Decision Support and OLAP.pdf
data warehousing
Data ware housing - Introduction to data ware housing process.
Msbi by quontra us
us it recruiter
OLAP & DATA WAREHOUSE
Data Warehouse
Data Warehouse
kalyani.ppt
kalyani.ppt

Recently uploaded (20)

PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Business Ethics Teaching Materials for college
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
Cell Structure & Organelles in detailed.
PPTX
Institutional Correction lecture only . . .
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
TR - Agricultural Crops Production NC III.pdf
Anesthesia in Laparoscopic Surgery in India
Abdominal Access Techniques with Prof. Dr. R K Mishra
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Module 4: Burden of Disease Tutorial Slides S2 2025
Microbial disease of the cardiovascular and lymphatic systems
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPH.pptx obstetrics and gynecology in nursing
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Business Ethics Teaching Materials for college
Renaissance Architecture: A Journey from Faith to Humanism
Cell Structure & Organelles in detailed.
Institutional Correction lecture only . . .

Kushal Data Warehousing PPT

  • 3. What is Data Warehousing? Data Warehouse is a subject-oriented, integrated, nonvolatile and timevariant collection of data in support of management’s decisions.
  • 5. What is Business Intelligence?
  • 6.  The architecture  Operational data source1 High summarized data Meta-data Operational data source 2 Reporting, query, application development, and EIS(executive information system) tools Query Manage Lightly summarized data Load Manager Operational data source n Operational data store (ods) DBMS Detailed data OLAP(online analytical processing) tools Warehouse Manager Operational data store (ODS) Data mining Archive/backup data Typical architecture of a data warehouse End-user access tools
  • 7.  The benefits of data warehousing • The potential benefits of data warehousing are high returns on investment.. • substantial competitive advantage.. • increased productivity of corporate decision-makers..
  • 8. Data Warehouse Characteristics  Key Characteristics of a Data Warehouse  Subject-oriented  Integrated  Time-variant  Non-volatile 8
  • 9. Subject Oriented • Example for an insurance company : Applications Area Data Warehouse Auto and Fire Auto and Fire Policy Policy Processing Processing Systems Systems Commercial Commercial and Life and Life Insurance Insurance Systems Systems Data Data Accounting Accounting System System Billing Billing System System Policy Policy Customer Customer Claims Claims Processing Processing System System Losses Losses Premium Premium 9
  • 10. Integrated • Data is stored once in a single integrated location (e.g. insurance company) Auto Policy Auto Policy Processing Processing System System Customer data stored in several databases Data Warehouse Database Fire Policy Fire Policy Processing Processing System System FACTS, LIFE FACTS, LIFE Commercial, Accounting Commercial, Accounting Applications Applications Subject = Customer 10
  • 11. Time - Variant Data is stored as a series of snapshots or views which record how it is collected across time. Data Warehouse Data Time Data   { • Key   Data is tagged with some element of time -  creation date, as of  date, etc. Data is available on-line for long periods of time for trend  analysis and forecasting. For example, five or more years 11
  • 12. Non-Volatile • Existing data in the warehouse is not overwritten or updated. External Sources Production Databases Data Data Warehouse Warehouse Environment Environment Production Production Applications Applications • Update • Insert • Delete Data Warehouse Database • Load • Read-Only 12
  • 13. Comparision of OLTP systems and data warehousing system OLTP systems Hold current data Stores detailed data Data is dynamic Repetitive processing High level of transaction throughput Predictable pattern of usage Transaction-driven Application-orented Supports day-to-day decisions Serves large number of clerical/operation users Data warehousing systems Holds historical data Stores detailed, lightly, and highly summarized data Data is largely static Ad hoc, unstructured, and heuristic processing Medium to how level of transaction throughput Unpredictable pattern of usage Analysis driven Subject-oriented supports strategic decisions Serves relatively how number of managerial users
  • 15. On Line Transaction Processing • What is a Transaction ? – A Logical unit of work – – – Examples: Drawing Money from a bank account Booking a seat on an airline
  • 16. Transactions • It is a unit of program execution that accesses & possibly updates various data items. • A transaction is a logical unit of work that performs some useful function for a user. • In end of the transaction the system must be: – in the prior state (if the transaction fails) or – the status of the system should reflect the successful completion (if the transaction succeeded). • May take a database from one consistent
  • 17. Characteristics of Transactions A tomicity C onsistency I solation D urability
  • 19. Types of OLAP • ROLAP (Relational Online Analytical Processing) • MOLAP (Multidimensional Online Analytical Processing) • HOLAP (Hybrid Online Analytical Processing)
  • 20. ROLAP • ROLAP (Relational online analytical Processing) • Used for reporting • Tools: Report studio
  • 21. MOLAP • MOLAP (Multidimensional online Analytical processing) • Used to build cubes • Tools: Powerplay, Transformer
  • 22. HOLAP • HOLAP (Hybrid online analytical Processing) • Used for Data modeling • This will support both MOLAP and ROLAP • Tools: Framework manager, Query Studio.
  • 23. Dimensions • It’s descriptive information about a measures like product, location, customer etc.
  • 24. Types of Dimensions • Confirmed Dimensions • Degenerated Dimensions • Junk Dimensions
  • 25. Facts • Fact is containing measures and IDs. • Ex; Revenue, Cost, Amount etc
  • 26. Measure Types • Additive Measures: Which can be added across all the dimensions • Non Additive Measures: Which can not be added across all the dimensions • Semi Additive Measures: Which can be added across some dimensions and which can not be added across some other dimensions
  • 27. Schema’s In Data warehousing • • • STAR SHEMA SNOW-FLAKE SCHEMA STAR-FLAKE SCHEMA
  • 28. Star Schema Dimension Tables Region_Dimension_Table region _id NE NW SE SW Product_Dimension_Table prod_grp_id prod_id prod_grp_desc prod_desc 10 20 30 100 140 220 Fewer devices Circuit boards Components region _doc Northeast Northwest Southeast Southwest account _id Power supply Motherboard Co-processor 100000 110000 120000 130000 140000 account _doc ABC Electronics Midway Electric Victor Components Washburn, Inc. Zerox Account_Dimension_Table month prod_id region_id account_id vend_id net-sales gross_sales 01-1996 02-1996 03-1996 100 140 220 SW NE SW 100000 110000 100000 100 200 300 30,000 23,000 32,000 50,000 42,000 49,000 Fact Table Monthly_Sales_Summary_Table month 01-1996 02-1996 03-1996 mo_in_fiscal_yr 4 5 6 month_name January February March Time_Dimension_Table Vendor_Dimension_Table vend_id 100 200 300 vendor_desc PowerAge, Inc. Advanced Micro Devices Farad Incorporated 28
  • 30. Factless Fact Table • It’s just a bridge between table where we used to join tables. • In this scenario we can only track the event.
  • 32. ETL (Extract, Transform and Loading) INFORMATICA

Editor's Notes

  • #18: Let us look at a transaction which transfer money from one account to another. The transaction has to do two updates. But this should be transparent to the end user. To the user either the transfer goes thru or it doesn’t. Before and after the transaction the database should be in a consistent state Each transaction should be made to feel that it is the only transaction executing at that instant After the transaction completes the changes made to the db should be visible to other transactions