SlideShare a Scribd company logo
Unit-1
Submitted by
Javed Akhtar
M.Tech part time(CSE)
• In computing, a data warehouse (DW, DWH), or an
enterprise data warehouse (EDW), is a database
used for reporting (1) and data analysis (2).
• Integrating data from one or more disparate sources
creates a central repository of data, a data
warehouse (DW).
• The typical extract transform load (ETL)-based
data warehouse uses staging, data integration,
and access layers to house its key functions.
• The staging layer or staging database stores raw
data extracted from each of the disparate source
data systems.
• The integration layer integrates the disparate
data sets by transforming the data from the
staging layer often storing this transformed data
in an operational data store (ODS) database.
• The integrated data are then moved to yet
another database, often called the data
warehouse database, where the data is
arranged into hierarchical groups often called
dimensions and into facts and aggregate facts.
• The combination of facts and dimensions is
sometimes called a star schema. The access
layer helps users retrieve data.
Unit-1.pptx final unit new mtech unit thre
• The integrated source data systems and the data
warehouse are all integrated since there is no
transformation of dimensional or reference data.
• This integrated data warehouse architecture supports the
drill down from the aggregate data of the data warehouse
to the transactional data of the integrated source data
systems.
• A data mart is a small data warehouse focused on a specific
area of interest. Data warehouses can be subdivided into
data marts for improved performance and ease of use
within that area.
1. Keep things focused.
• "Try not to create a global solution." Kramer suggests that a good practice is
to "focus on what you need. A small data warehouse or data mart which
addresses a single subject or that is focused on a single department is much
more efficient than a large data warehouse.
• You will see measurable results much faster from a data mart than a data
warehouse. A focused data mart will get funding and gain organizational
consensus a lot easier, too."
2. Don't worry about integration, keep things small.
• "Integration can be an issue, but it has always been a problem when
organizations try to take a small filing system and integrate it into an
organizational system. There are always
3. Spend the extra money if you need help
designing your system.
Kramer commented, "Systems designing is the
best place to spend the money on hiring
consultants. They know the problems, and
know how to deal with them. It is possible to
design your own data warehouse system, but it
is a lot less frustrating to hire out the design
process."
4.minimizes, or possibly eliminates any tool integration
issues," Kramer advised.
5. Be in tune with the users.
• "Know your users," Kramer warned. "If you are not
careful, you will wind up giving the right users the
wrong tools, and that only leads one place -
frustration.
• Find out who your end-users are, and work
backward to the operational data. This will tell you
what tools your data warehouse needs."
• 6. Consider your platforms.
• Kramer said "there really are no right platforms out
there. You can start with a UNIX system or NT. Keep in
mind that the NT has a ceiling in terms of scalability,
but it works well with data marts, and most other small
warehouses, just not global data warehouses."
• 7. Think before you data mine.
"Data mining is a solution in search of a problem," Kramer
said. "Know what you want to find before you select
the tool. Data mining software simply relieves some of
the burden from the analyst."

More Related Content

DOC
Oracle sql plsql & dw
PPT
Data Warehouse
PDF
Unit 3 part 2
PDF
DATA WAREHOUSING AND DATA MINING (R18A0524).pdf
PPT
20IT501_DWDM_PPT_Unit_I.ppt
DOC
Data mining notes
DOC
Dwdm unit 1-2016-Data ingarehousing
PPT
20IT501_DWDM_PPT_Unit_I.ppt
Oracle sql plsql & dw
Data Warehouse
Unit 3 part 2
DATA WAREHOUSING AND DATA MINING (R18A0524).pdf
20IT501_DWDM_PPT_Unit_I.ppt
Data mining notes
Dwdm unit 1-2016-Data ingarehousing
20IT501_DWDM_PPT_Unit_I.ppt

Similar to Unit-1.pptx final unit new mtech unit thre (20)

PPT
Chapter 2-data-warehousingppt2517 vero
PDF
Data warehousing interview questions
PPT
20IT501_DWDM_PPT_Unit_I.ppt
PPTX
Business Intelligence Module 3_Datawarehousing.pptx
PPTX
module 1 DWDM (complete) chapter ppt.pptx
DOCX
Unit 1
DOC
Data Mining
PPTX
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
PPTX
ETL processes , Datawarehouse and Datamarts.pptx
PPTX
Data warehousing Concepts and Design.pptx
PPTX
introduction & conceptsdatawarehousing.pptx
PPTX
Data Warehouse
PDF
Chapter-6_BasicsOfDataIntegrationbibibibini.pdf
PPTX
Manish tripathi-ea-dw-bi
 
PPTX
Datawarehouse
PPS
Data Warehouse 101
PPTX
Data Warehouse Design on Cloud ,A Big Data approach Part_One
PPT
DWIntro.ppt
PPT
DWIntro.ppt
PPT
DWIntro.ppt
Chapter 2-data-warehousingppt2517 vero
Data warehousing interview questions
20IT501_DWDM_PPT_Unit_I.ppt
Business Intelligence Module 3_Datawarehousing.pptx
module 1 DWDM (complete) chapter ppt.pptx
Unit 1
Data Mining
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
ETL processes , Datawarehouse and Datamarts.pptx
Data warehousing Concepts and Design.pptx
introduction & conceptsdatawarehousing.pptx
Data Warehouse
Chapter-6_BasicsOfDataIntegrationbibibibini.pdf
Manish tripathi-ea-dw-bi
 
Datawarehouse
Data Warehouse 101
Data Warehouse Design on Cloud ,A Big Data approach Part_One
DWIntro.ppt
DWIntro.ppt
DWIntro.ppt
Ad

More from javed75 (11)

PPT
javed_prethesis2608 on predcition of heart disease
PPTX
presentationfinal-090714235255-phpapp01 (1) (2).pptx
PPTX
algocomplexity cost effective tradeoff in
PPT
Section 7.5 version 2 AM new ppt for every
PPTX
Cyber_Security_Awareness_Presentation (1).pptx
PPTX
anand ethics ppt for phd scholar integral
PPTX
Data Science.pptx NEW COURICUUMN IN DATA
PPT
1 Basic E-Commerce Concepts for it 2ndt year
PPT
UNIT-IV WT web technology for 1st year cs
PPTX
training about android installation and usa
PPTX
Phd2023-2024cIntegralUniversitynida.pptx
javed_prethesis2608 on predcition of heart disease
presentationfinal-090714235255-phpapp01 (1) (2).pptx
algocomplexity cost effective tradeoff in
Section 7.5 version 2 AM new ppt for every
Cyber_Security_Awareness_Presentation (1).pptx
anand ethics ppt for phd scholar integral
Data Science.pptx NEW COURICUUMN IN DATA
1 Basic E-Commerce Concepts for it 2ndt year
UNIT-IV WT web technology for 1st year cs
training about android installation and usa
Phd2023-2024cIntegralUniversitynida.pptx
Ad

Recently uploaded (20)

PPTX
Cell Types and Its function , kingdom of life
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
master seminar digital applications in india
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Institutional Correction lecture only . . .
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Cell Types and Its function , kingdom of life
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Abdominal Access Techniques with Prof. Dr. R K Mishra
master seminar digital applications in india
01-Introduction-to-Information-Management.pdf
Institutional Correction lecture only . . .
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
O7-L3 Supply Chain Operations - ICLT Program
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
GDM (1) (1).pptx small presentation for students
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
Pharma ospi slides which help in ospi learning
Microbial diseases, their pathogenesis and prophylaxis
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student

Unit-1.pptx final unit new mtech unit thre

  • 2. • In computing, a data warehouse (DW, DWH), or an enterprise data warehouse (EDW), is a database used for reporting (1) and data analysis (2). • Integrating data from one or more disparate sources creates a central repository of data, a data warehouse (DW).
  • 3. • The typical extract transform load (ETL)-based data warehouse uses staging, data integration, and access layers to house its key functions. • The staging layer or staging database stores raw data extracted from each of the disparate source data systems. • The integration layer integrates the disparate data sets by transforming the data from the staging layer often storing this transformed data in an operational data store (ODS) database.
  • 4. • The integrated data are then moved to yet another database, often called the data warehouse database, where the data is arranged into hierarchical groups often called dimensions and into facts and aggregate facts. • The combination of facts and dimensions is sometimes called a star schema. The access layer helps users retrieve data.
  • 6. • The integrated source data systems and the data warehouse are all integrated since there is no transformation of dimensional or reference data. • This integrated data warehouse architecture supports the drill down from the aggregate data of the data warehouse to the transactional data of the integrated source data systems. • A data mart is a small data warehouse focused on a specific area of interest. Data warehouses can be subdivided into data marts for improved performance and ease of use within that area.
  • 7. 1. Keep things focused. • "Try not to create a global solution." Kramer suggests that a good practice is to "focus on what you need. A small data warehouse or data mart which addresses a single subject or that is focused on a single department is much more efficient than a large data warehouse. • You will see measurable results much faster from a data mart than a data warehouse. A focused data mart will get funding and gain organizational consensus a lot easier, too." 2. Don't worry about integration, keep things small. • "Integration can be an issue, but it has always been a problem when organizations try to take a small filing system and integrate it into an organizational system. There are always
  • 8. 3. Spend the extra money if you need help designing your system. Kramer commented, "Systems designing is the best place to spend the money on hiring consultants. They know the problems, and know how to deal with them. It is possible to design your own data warehouse system, but it is a lot less frustrating to hire out the design process."
  • 9. 4.minimizes, or possibly eliminates any tool integration issues," Kramer advised. 5. Be in tune with the users. • "Know your users," Kramer warned. "If you are not careful, you will wind up giving the right users the wrong tools, and that only leads one place - frustration. • Find out who your end-users are, and work backward to the operational data. This will tell you what tools your data warehouse needs."
  • 10. • 6. Consider your platforms. • Kramer said "there really are no right platforms out there. You can start with a UNIX system or NT. Keep in mind that the NT has a ceiling in terms of scalability, but it works well with data marts, and most other small warehouses, just not global data warehouses." • 7. Think before you data mine. "Data mining is a solution in search of a problem," Kramer said. "Know what you want to find before you select the tool. Data mining software simply relieves some of the burden from the analyst."