SlideShare a Scribd company logo
Business Intelligence
“You can’t manage what you can’t measure.
You can’t measure what you can’t describe”
Ahsan Kabir
“A broad category of applications and technologies for gathering, storing,
analyzing, sharing and providing access to data to help enterprise users make
better business decisions” -Gartner
Why BI ?
Performance management
Identify trends
Cash flow trend
Fine-tune operations
Sales pipeline analysis
Future projections
business Forecasting
Decision Making Tools
Convert data into information
How to Think ?
• What happened?
• What is happening?
• Why did it happen?
• What will happen?
• What do I want to happen?
ERP CRM 3PtySCM
Black
books
Past
Present
Future
Data
Major Players in BI Market
Improving organizations by providing business
insights to all employees leading to better, faster,
more relevant decisions
Advanced Analytics
Self Service Reporting
End-User Analysis
Business Performance Management
Operational Applications
Microsoft Business Intelligence Vision
BI implementations
– Corporate BI
Commonly design, implement and maintain data warehouses , data models and
integrated reporting and analytics. It require significant time, expertise and money but
total business is not covered .
– Self-service BI (SSBI)
SSBI is to empower analysts so that they can design, customize and maintain their
own BI solutions. SSBI is a combination of corporate BI and extensions to empower
analysts to more fully exploit it .
– Managed BI
Ensuring responsible BI by managing review, approve and audit solutions
Data is delivered in a compliant, responsive and secure way and access permissions
are enforced
Analysis Services
(SSAS)
Reporting Services
(SSRS)
Integration Services
(SSIS)
Master Data Services
(MDS)
SharePoint
Collaboration
Excel
Workbooks
PowerPivot
Applications
SharePoint
Dashboards &
Scorecards
Microsoft Business Intelligence Components
DQS
ERP/CRM DB Cloud Born Data Social Network
Step 1 : Business Analysis
Step 2 : SSIS
Different Source of Data (RDBMS, FTP, Web Services, XML, CSV, EXCEL, etc.)
DQS (Data Quality Services) Integration, cleansing, profiling
MDS (Master Data Service ) Centrally managing organizational master data
ETL (Extraction, Transformation and Loading) framework
Step 3 : SSAS
Create an OLAP multi-dimensional structure making data available for analytics and reporting
SSAS can pre-calculates, summarizes and stores the data in a highly compressed format
Reporting is provided by data through SSAS cubes
Step 4 : SSRS
SSRS (SQL Server Reporting Services) allows creating formatted and interactive reports
Step 5 :
PowerPivot, Power View, Excel services provide rapid data exploration, visualization, and
presentation experience for users . It allows users to interrogate the data from various aspects
by using charts, graphs, drill-down paths etc.
Excel and PowerPivot services can be used for deploying Excel or PowerPivot to SharePoint in
order to make it available to other people, turning Personal BI into Organizational BI.
Microsoft Business Intelligence Road Map
“…was designed specifically to be a central repository for all data
in a company disparate data from transactional systems”
Data Warehouse
Characteristics DW
 DW is a relational database that is designed for query
and analysis
 Ship and integrate data from different sources to the
analyst
 Contains data derived from transaction, internal-external
data & archived data
 But it’s not a copy of a source database
 High query performance
 Analysis queries place extra load on transactional systems
 Query optimization is hard to do well
 Queries not visible outside warehouse
 Local processing at sources unaffected
 Can operate when sources unavailable
 Can query data not stored in a DBMS
 Summarized and Extremal data at warehouse
Advantages of Warehousing
Data is kept in a
specific business
line wise.
Before enter into warehouse
Data is processed
(cleansed and transformed)
DW Architecture
Warehouse
Data Marts
Users query
the data
warehouse
Data Warehouse
Corporate/Enterprise-wide
Union of all data marts
Organized on E-R model*
Data Mart
Departmental
Single business process
Star-join*
DW vs. Data Mart
Transactional Databases vs. Data warehouse
1. ER modeling is used
2. 3NF Normalized
3. Data is spited into tables
4. Hard to visualize
5. Slows down the response time of
the query and report
1. Dimensional modeling
2. De-normalized
3. Data is kept in fact and dimension
4. Flexible for user perspective
5. Response time and increases
the performance
Transactional Databases Warehouse Database
16
Requisition
RID(PK)
CID (FK)
WID (FK)
UID (FK)
Requestion_Date
Warehouse
WID (PK)
Location
Address
district
WU_Code
User_Profile
UId (PK)
Name
Address
Email
CellNo
Product_Profile
PID (PK)
description
brand
category
Client_Information
CID (PK)
Name
Address
Credit_Limit
Requisition_Details
RID (PK)
RDD (FK)
PID (FK)
promotion_key (FK)
dollars_sold
units_sold
dollars_cost
Entity Relation Diagram
17
TIME
time_key (PK)
SQL_date
day_of_week
month
STORE
store_key (PK)
store_ID
store_name
address
district
floor_type
CLERK
clerk_key (PK)
clerk_id
clerk_name
clerk_grade
PRODUCT
product_key (PK)
SKU
description
brand
category
CUSTOMER
customer_key (PK)
customer_name
purchase_profile
credit_profile
Address
City
country
PROMOTION
promotion_key (PK)
promotion_name
price_type
ad_type
Sales - FACT
time_key (FK)
store_key (FK)
clerk_key (FK)
product_key (FK)
customer_key (FK)
promotion_key (FK)
dollars_sold
units_sold
dollars_cost
DIMENSONAL MODEL
Federated Databases vs. Data warehouse
 Data warehouse
Create a copy of all the data and Execute queries against the copy
 Federated database
Pull data from source systems as needed to answer queries
Data Warehouse Federated Database
Query
Answer
Query
Extraction Rewritten Queries
Answer
Source
Systems
Warehouse
Mediator
Data Quality problems
Name Address City House
No
DoB State Country
Ahsan CDAAvenue CTG 181/1 05/11/1978 BD
Kabir RB Avn CTG 41/6 23/04/1991 DHK Bangladesh
Name Address City House
No
DoB State Country
Ahsan CDAAvenue CTG 181/1 05/11/1978 CT Bangladesh
Kabir RB Avenue DHK 41/6 23/04/1991 DHK Bangladesh
Before
After
Indication : Completeness Accuracy Conformity Consistency
Data Quality Issues
Data Quality Issue Sample Data Problem
Standard Are data elements consistently
defined and understood ?
Gender code = M, F, U in one system and Gender
code = 0, 1, 2 in another system
Complete Is all necessary data present ? 20% of customers’ last name is blank,
50% of zip-codes are 99999
Accurate Does the data accurately represent
reality or a verifiable source?
A Supplier is listed as ‘Active’ but went out of
business six years ago
Valid Do data values fall within acceptable
ranges?
Salary values should be between
60,000-120,000
Unique Data appears several times Both John Ryan and Jack Ryan appear in the
system – are they the same person?
Data Quality Services (DQS)
Data Quality Services (DQS) is a Knowledge-Driven
data quality solution, enabling to easily improve the
quality of their data
DW Design Consideration
 Simplicity
 Users should understand the design
 Data model should match users’ conceptual model
 Queries should be easy and intuitive to write
 Expressiveness
 Include enough information to answer all important queries
 Include all relevant data (without irrelevant data)
 Performance
 An efficient physical design should be possible
Component of Data Warehousing
DW consists of Fact tables and dimensions. The relationship between a Fact table and
dimensions are based on the foreign key and primary key.
Facts are numeric measurements or additive
value that represent a specific business aspect or
activity.
Examples :
 Unit Cost,
 Sale Amount,
 Quantity Sold
 Salary Amount
 Purchase amount
Dimension has a primary key, which is called the
surrogate key. The primary key of the source
system will be stored in the dimension table as
the business key
Dimension tables are tables that contain
descriptive information. Dimension table contains a
list of columns
Example :
 Incase of Product
 Product Name
 Origin
 Category
 Manufacturer Date
 Sales Date
The Fact table is a table with foreign keys pointing
to surrogate keys of the dimension tables
24
TIME
time_key (PK)
SQL_date
day_of_week
month
STORE
store_key (PK)
store_ID
store_name
address
district
floor_type
CLERK
clerk_key (PK)
clerk_id
clerk_name
clerk_grade
PRODUCT
product_key (PK)
SKU
description
brand
category
CUSTOMER
customer_key (PK)
customer_name
purchase_profile
credit_profile
Address
City
country
PROMOTION
promotion_key (PK)
promotion_name
price_type
ad_type
Sales - FACT
time_key (FK)
store_key (FK)
clerk_key (FK)
product_key (FK)
customer_key (FK)
promotion_key (FK)
dollars_sold
units_sold
dollars_cost
Dimensional Modeling
25
 The diagram resembles a star
 Center of the star consists of one fact
table
 Points of the star are the dimension
tables
 Optimizes performance by keeping
queries simple and
 Providing fast response time
Star schema
26
Star Schema for the retailer’s DW
Sales
Date
Product Store
Promotion
Fact table
Dimension tables
ONE fact table 4 dimension tables
TIME
time_key (PK)
SQL_date
day_of_week
month
STORE
store_key (PK)
store_ID
store_name
address
district
floor_type
CLERK
clerk_key (PK)
clerk_id
clerk_name
clerk_grade
PRODUCT
product_key (PK)
SKU
description
brand
category
CUSTOMER
customer_key (PK)
customer_name
purchase_profile
credit_profile
Address
City
country
PROMOTION
promotion_key (PK)
promotion_name
price_type
ad_type
Sales - FACT
time_key (FK)
store_key (FK)
clerk_key (FK)
product_key (FK)
customer_key (FK)
promotion_key (FK)
dollars_sold
units_sold
dollars_cost
DIMENSONAL MODEL
28
Goals for Logical Design
 Simplicity
 Users should understand the design
 Data model should match users’ conceptual model
 Queries should be easy and intuitive to write
 Expressiveness
 Include enough information to answer all important queries
 Include all relevant data (without irrelevant data)
 Performance
 An efficient physical design should be possible
29
Step 1 : Identify business subjects and fields of information
of relevant subjects
Step 2 : Discover entities and attributes and relationships
Step 3 : Identify which information belongs to a central fact table
Step 4 : Which information belongs to its associated dimension tables
Step 5 : Identify cleansing points
Step 6 : Which data need to mange centrally
Step 7 : Define surrogate key and business key
Step 8 : Make ETL Package
Step 9 : Organize data structures on disk
Steps of DW Implementation
Thanks

More Related Content

PDF
BI Masterclass slides (Reference Architecture v3)
PDF
Data Warehouse Design & Dimensional Modeling
PDF
Business Intelligence Data Warehouse System
DOCX
Example data specifications and info requirements framework OVERVIEW
PDF
Data Mapping eBook
PDF
Business Analysis, Query Tools, Dm unit-3
PDF
Offers bank dss
PPT
Data Warehouse
BI Masterclass slides (Reference Architecture v3)
Data Warehouse Design & Dimensional Modeling
Business Intelligence Data Warehouse System
Example data specifications and info requirements framework OVERVIEW
Data Mapping eBook
Business Analysis, Query Tools, Dm unit-3
Offers bank dss
Data Warehouse

What's hot (20)

DOCX
03. Business Information Requirements Template
PDF
Data warehousing and business intelligence project report
PPT
Business intelligence
PPT
Warehouse components
PPTX
The Data Warehouse Lifecycle
PDF
Gathering Business Requirements for Data Warehouses
PDF
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
DOC
Swapna Tammishetty CV-Business & Systems Analyst-Data Analyst-Crystal Reports...
PDF
Business Intelligence: Data Warehouses
PDF
Arun Mathew Thomas_resume
PPS
Technologies
PDF
Data Warehouse Project Report
PPTX
Microsoft Business Intelligence - Practical Approach & Overview
PDF
Sap cia technical-desktop
PPTX
Introduction to Data Warehousing
PDF
Business Intelligence Presentation (1/2)
PPT
Data warehouse 101-fundamentals-
PDF
Considerations for Data Migration D365 Finance & Operations
PDF
What you need to know about Data Migration for D365 Finance & Operations
PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
03. Business Information Requirements Template
Data warehousing and business intelligence project report
Business intelligence
Warehouse components
The Data Warehouse Lifecycle
Gathering Business Requirements for Data Warehouses
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Swapna Tammishetty CV-Business & Systems Analyst-Data Analyst-Crystal Reports...
Business Intelligence: Data Warehouses
Arun Mathew Thomas_resume
Technologies
Data Warehouse Project Report
Microsoft Business Intelligence - Practical Approach & Overview
Sap cia technical-desktop
Introduction to Data Warehousing
Business Intelligence Presentation (1/2)
Data warehouse 101-fundamentals-
Considerations for Data Migration D365 Finance & Operations
What you need to know about Data Migration for D365 Finance & Operations
Semantic 'Radar' Steers Users to Insights in the Data Lake
Ad

Similar to Overview of business intelligence (20)

PPT
Introduction To Msbi By Yasir
PDF
Data warehousing
PPTX
Bi Architecture And Conceptual Framework
PDF
Data Warehouse Design and Best Practices
PDF
Business intelligence an Overview
PPT
Datawarehouse Overview
PPT
Data Warehouse-Final
PPTX
Data warehousing Concepts and Design.pptx
PPTX
introduction & conceptsdatawarehousing.pptx
PPT
Msbi by quontra us
PPT
Business Intelligence with SQL Server
PPT
Data Warehouse Modeling
PPT
Dw & etl concepts
PDF
BI Overview
DOCX
Bi assignment
PPT
Business Intelligence: A Review
PDF
Starring sakila my sql university 2009
PPT
Data ware housing- Introduction to data ware housing
PPTX
INFORMATICA EASY LEARNING ONLINE TRAINING
PPT
LECTURE4.ppt
Introduction To Msbi By Yasir
Data warehousing
Bi Architecture And Conceptual Framework
Data Warehouse Design and Best Practices
Business intelligence an Overview
Datawarehouse Overview
Data Warehouse-Final
Data warehousing Concepts and Design.pptx
introduction & conceptsdatawarehousing.pptx
Msbi by quontra us
Business Intelligence with SQL Server
Data Warehouse Modeling
Dw & etl concepts
BI Overview
Bi assignment
Business Intelligence: A Review
Starring sakila my sql university 2009
Data ware housing- Introduction to data ware housing
INFORMATICA EASY LEARNING ONLINE TRAINING
LECTURE4.ppt
Ad

More from Ahsan Kabir (6)

PDF
Steps towards of sql server developer
PDF
Sql server 2016 rc 3 query store overview and architecture
PDF
Brief overview on Microsoft Solution Framework (MSF)
PDF
Step by Step design cube using SSAS
PDF
Steps towards business intelligence
PDF
Brief overview on microsoft solution framework
Steps towards of sql server developer
Sql server 2016 rc 3 query store overview and architecture
Brief overview on Microsoft Solution Framework (MSF)
Step by Step design cube using SSAS
Steps towards business intelligence
Brief overview on microsoft solution framework

Recently uploaded (20)

PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Introduction to machine learning and Linear Models
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
climate analysis of Dhaka ,Banglades.pptx
annual-report-2024-2025 original latest.
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Introduction to Knowledge Engineering Part 1
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Introduction to machine learning and Linear Models
Miokarditis (Inflamasi pada Otot Jantung)
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
IB Computer Science - Internal Assessment.pptx
Introduction-to-Cloud-ComputingFinal.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Business Ppt On Nestle.pptx huunnnhhgfvu
STUDY DESIGN details- Lt Col Maksud (21).pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Business Analytics and business intelligence.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx

Overview of business intelligence

  • 1. Business Intelligence “You can’t manage what you can’t measure. You can’t measure what you can’t describe” Ahsan Kabir
  • 2. “A broad category of applications and technologies for gathering, storing, analyzing, sharing and providing access to data to help enterprise users make better business decisions” -Gartner
  • 3. Why BI ? Performance management Identify trends Cash flow trend Fine-tune operations Sales pipeline analysis Future projections business Forecasting Decision Making Tools Convert data into information
  • 4. How to Think ? • What happened? • What is happening? • Why did it happen? • What will happen? • What do I want to happen? ERP CRM 3PtySCM Black books Past Present Future Data
  • 5. Major Players in BI Market
  • 6. Improving organizations by providing business insights to all employees leading to better, faster, more relevant decisions Advanced Analytics Self Service Reporting End-User Analysis Business Performance Management Operational Applications Microsoft Business Intelligence Vision
  • 7. BI implementations – Corporate BI Commonly design, implement and maintain data warehouses , data models and integrated reporting and analytics. It require significant time, expertise and money but total business is not covered . – Self-service BI (SSBI) SSBI is to empower analysts so that they can design, customize and maintain their own BI solutions. SSBI is a combination of corporate BI and extensions to empower analysts to more fully exploit it . – Managed BI Ensuring responsible BI by managing review, approve and audit solutions Data is delivered in a compliant, responsive and secure way and access permissions are enforced
  • 8. Analysis Services (SSAS) Reporting Services (SSRS) Integration Services (SSIS) Master Data Services (MDS) SharePoint Collaboration Excel Workbooks PowerPivot Applications SharePoint Dashboards & Scorecards Microsoft Business Intelligence Components DQS ERP/CRM DB Cloud Born Data Social Network
  • 9. Step 1 : Business Analysis Step 2 : SSIS Different Source of Data (RDBMS, FTP, Web Services, XML, CSV, EXCEL, etc.) DQS (Data Quality Services) Integration, cleansing, profiling MDS (Master Data Service ) Centrally managing organizational master data ETL (Extraction, Transformation and Loading) framework Step 3 : SSAS Create an OLAP multi-dimensional structure making data available for analytics and reporting SSAS can pre-calculates, summarizes and stores the data in a highly compressed format Reporting is provided by data through SSAS cubes Step 4 : SSRS SSRS (SQL Server Reporting Services) allows creating formatted and interactive reports Step 5 : PowerPivot, Power View, Excel services provide rapid data exploration, visualization, and presentation experience for users . It allows users to interrogate the data from various aspects by using charts, graphs, drill-down paths etc. Excel and PowerPivot services can be used for deploying Excel or PowerPivot to SharePoint in order to make it available to other people, turning Personal BI into Organizational BI. Microsoft Business Intelligence Road Map
  • 10. “…was designed specifically to be a central repository for all data in a company disparate data from transactional systems” Data Warehouse
  • 11. Characteristics DW  DW is a relational database that is designed for query and analysis  Ship and integrate data from different sources to the analyst  Contains data derived from transaction, internal-external data & archived data  But it’s not a copy of a source database
  • 12.  High query performance  Analysis queries place extra load on transactional systems  Query optimization is hard to do well  Queries not visible outside warehouse  Local processing at sources unaffected  Can operate when sources unavailable  Can query data not stored in a DBMS  Summarized and Extremal data at warehouse Advantages of Warehousing
  • 13. Data is kept in a specific business line wise. Before enter into warehouse Data is processed (cleansed and transformed) DW Architecture Warehouse Data Marts Users query the data warehouse
  • 14. Data Warehouse Corporate/Enterprise-wide Union of all data marts Organized on E-R model* Data Mart Departmental Single business process Star-join* DW vs. Data Mart
  • 15. Transactional Databases vs. Data warehouse 1. ER modeling is used 2. 3NF Normalized 3. Data is spited into tables 4. Hard to visualize 5. Slows down the response time of the query and report 1. Dimensional modeling 2. De-normalized 3. Data is kept in fact and dimension 4. Flexible for user perspective 5. Response time and increases the performance Transactional Databases Warehouse Database
  • 16. 16 Requisition RID(PK) CID (FK) WID (FK) UID (FK) Requestion_Date Warehouse WID (PK) Location Address district WU_Code User_Profile UId (PK) Name Address Email CellNo Product_Profile PID (PK) description brand category Client_Information CID (PK) Name Address Credit_Limit Requisition_Details RID (PK) RDD (FK) PID (FK) promotion_key (FK) dollars_sold units_sold dollars_cost Entity Relation Diagram
  • 17. 17 TIME time_key (PK) SQL_date day_of_week month STORE store_key (PK) store_ID store_name address district floor_type CLERK clerk_key (PK) clerk_id clerk_name clerk_grade PRODUCT product_key (PK) SKU description brand category CUSTOMER customer_key (PK) customer_name purchase_profile credit_profile Address City country PROMOTION promotion_key (PK) promotion_name price_type ad_type Sales - FACT time_key (FK) store_key (FK) clerk_key (FK) product_key (FK) customer_key (FK) promotion_key (FK) dollars_sold units_sold dollars_cost DIMENSONAL MODEL
  • 18. Federated Databases vs. Data warehouse  Data warehouse Create a copy of all the data and Execute queries against the copy  Federated database Pull data from source systems as needed to answer queries Data Warehouse Federated Database Query Answer Query Extraction Rewritten Queries Answer Source Systems Warehouse Mediator
  • 19. Data Quality problems Name Address City House No DoB State Country Ahsan CDAAvenue CTG 181/1 05/11/1978 BD Kabir RB Avn CTG 41/6 23/04/1991 DHK Bangladesh Name Address City House No DoB State Country Ahsan CDAAvenue CTG 181/1 05/11/1978 CT Bangladesh Kabir RB Avenue DHK 41/6 23/04/1991 DHK Bangladesh Before After Indication : Completeness Accuracy Conformity Consistency
  • 20. Data Quality Issues Data Quality Issue Sample Data Problem Standard Are data elements consistently defined and understood ? Gender code = M, F, U in one system and Gender code = 0, 1, 2 in another system Complete Is all necessary data present ? 20% of customers’ last name is blank, 50% of zip-codes are 99999 Accurate Does the data accurately represent reality or a verifiable source? A Supplier is listed as ‘Active’ but went out of business six years ago Valid Do data values fall within acceptable ranges? Salary values should be between 60,000-120,000 Unique Data appears several times Both John Ryan and Jack Ryan appear in the system – are they the same person?
  • 21. Data Quality Services (DQS) Data Quality Services (DQS) is a Knowledge-Driven data quality solution, enabling to easily improve the quality of their data
  • 22. DW Design Consideration  Simplicity  Users should understand the design  Data model should match users’ conceptual model  Queries should be easy and intuitive to write  Expressiveness  Include enough information to answer all important queries  Include all relevant data (without irrelevant data)  Performance  An efficient physical design should be possible
  • 23. Component of Data Warehousing DW consists of Fact tables and dimensions. The relationship between a Fact table and dimensions are based on the foreign key and primary key. Facts are numeric measurements or additive value that represent a specific business aspect or activity. Examples :  Unit Cost,  Sale Amount,  Quantity Sold  Salary Amount  Purchase amount Dimension has a primary key, which is called the surrogate key. The primary key of the source system will be stored in the dimension table as the business key Dimension tables are tables that contain descriptive information. Dimension table contains a list of columns Example :  Incase of Product  Product Name  Origin  Category  Manufacturer Date  Sales Date The Fact table is a table with foreign keys pointing to surrogate keys of the dimension tables
  • 24. 24 TIME time_key (PK) SQL_date day_of_week month STORE store_key (PK) store_ID store_name address district floor_type CLERK clerk_key (PK) clerk_id clerk_name clerk_grade PRODUCT product_key (PK) SKU description brand category CUSTOMER customer_key (PK) customer_name purchase_profile credit_profile Address City country PROMOTION promotion_key (PK) promotion_name price_type ad_type Sales - FACT time_key (FK) store_key (FK) clerk_key (FK) product_key (FK) customer_key (FK) promotion_key (FK) dollars_sold units_sold dollars_cost Dimensional Modeling
  • 25. 25  The diagram resembles a star  Center of the star consists of one fact table  Points of the star are the dimension tables  Optimizes performance by keeping queries simple and  Providing fast response time Star schema
  • 26. 26 Star Schema for the retailer’s DW Sales Date Product Store Promotion Fact table Dimension tables ONE fact table 4 dimension tables
  • 27. TIME time_key (PK) SQL_date day_of_week month STORE store_key (PK) store_ID store_name address district floor_type CLERK clerk_key (PK) clerk_id clerk_name clerk_grade PRODUCT product_key (PK) SKU description brand category CUSTOMER customer_key (PK) customer_name purchase_profile credit_profile Address City country PROMOTION promotion_key (PK) promotion_name price_type ad_type Sales - FACT time_key (FK) store_key (FK) clerk_key (FK) product_key (FK) customer_key (FK) promotion_key (FK) dollars_sold units_sold dollars_cost DIMENSONAL MODEL
  • 28. 28 Goals for Logical Design  Simplicity  Users should understand the design  Data model should match users’ conceptual model  Queries should be easy and intuitive to write  Expressiveness  Include enough information to answer all important queries  Include all relevant data (without irrelevant data)  Performance  An efficient physical design should be possible
  • 29. 29 Step 1 : Identify business subjects and fields of information of relevant subjects Step 2 : Discover entities and attributes and relationships Step 3 : Identify which information belongs to a central fact table Step 4 : Which information belongs to its associated dimension tables Step 5 : Identify cleansing points Step 6 : Which data need to mange centrally Step 7 : Define surrogate key and business key Step 8 : Make ETL Package Step 9 : Organize data structures on disk Steps of DW Implementation