SlideShare a Scribd company logo
data mining
Technical considerations
What Is a Data Warehouse:
 Definition: A data warehouse is the data repository of
an enterprise. It is generally used for research and
decision support.
 By comparison: an OLTP (on-line transaction processor)
or operational system is used to deal with the everyday
running of one aspect of an enterprise.
 OLTP systems are usually designed independently of
each other and it is difficult for them to share
information.
Why Do We Need Data Warehouses
 Consolidation of information resources
 Improved query performance
 Separate research and decision support
functions from the operational systems
 Foundation for data mining, data
visualization, advanced reporting and
OLAP tools
Building a Data Warehouse
1. Business Considerations (Return on
Investment)
2. Design Considerations
3. Technical Considerations
4. Implementation Considerations
5. Integrated Solutions
6. Benefits of Data Warehousing
Technical Considerations
 A number of technical issues are to be considered when
designing and implementing a Data Warehouse environment.
1. The Hardware Platform that would house the Data
Warehouse for parallel query scalability. (Uni-
Processor, Multi-processor, etc)
2. The DBMS that supports the warehouse database
3. The communication infrastructure that connects the
warehouse, data marts, operational systems, and end users
4. The hardware platform and software to support the
metadata repository
5. The systems management framework that enables
centralized management and administration to the entire
environment.
HARDWAER PLATFORMS
Data warehouse implementations are
developed into already existing
environments.
This section looks at the hardware
platform selection from an architectural
viewpoint.
A mainframe system however,is not as
open and flexible as contemporary
client/server system,and is noy optimized
for hoc query proccessing.
In addition it has to be scalable,since the data
warehouse is never finished, as new user
requirements,new data sources,and more
historical datata are continusly incorrporated
into the warehouse.
Often the platform choice is the choice
between a mainframe and non-mvs(unix or
window nt)server.
BALANCED APPROACH
An important design point when selecting
a scalable computing platform is the right
balanced between all computing
components,for
Example between the number of
processors in a multiprocessors system
and the i/o bandwidth.remember that the
lack of balance in a system inevitabley
results in a bottleneck.
OPTIMAL HARDWARE ARCHITECTURE
FOR PARALLEL QUERY SCALABILLITY
An important consideration when selecting a
hardware platform for a data wareehouse is
that of scalabilty.
This architecture induced data skew is more
severe in the low-density asymmetric
connection architectures.
When selecting a hardware platform for a
data warehouse,take into account the fact
that the system a hardware platform for a
data skew can overpower even the best data
layout for parallel query.
data mining
data mining
data mining

More Related Content

PPTX
Data warehousing and data mart
PPT
7 data warehouse & marts
PPTX
Datawarehouse org
PPTX
Presentation on data Warehouse
PDF
Data warehousing
PPTX
Classification of data mart
PDF
Data warehousing
PPT
Data warehouseing
Data warehousing and data mart
7 data warehouse & marts
Datawarehouse org
Presentation on data Warehouse
Data warehousing
Classification of data mart
Data warehousing
Data warehouseing

What's hot (20)

PPTX
Aspects of data mart
PPTX
Data center architure ppts
PPTX
Isas report
PDF
Data mining
PPTX
BUILDING A DATA WAREHOUSE
PDF
8 crm data warehouse
PPTX
Adbms and mmdbms
PPTX
DATA WAREHOUSING
PPTX
Are New Orleans Data Centers Making Green Strategies a Priority? (SlideShare)
PPTX
Data mining
PPT
DATA WAREHOUSING
PDF
Databases to improve business performance and decision making Client-server a...
PPTX
Databases to improve business performance and decision making Client-server a...
PPT
Grid Asia2008 Low Latency Data Grid
PDF
Let unified storage drive the change you need
PPT
Ch1 data-warehousing
PPT
Ch1 data-warehousing
PPTX
Data junction tool
PPTX
Teradata
Aspects of data mart
Data center architure ppts
Isas report
Data mining
BUILDING A DATA WAREHOUSE
8 crm data warehouse
Adbms and mmdbms
DATA WAREHOUSING
Are New Orleans Data Centers Making Green Strategies a Priority? (SlideShare)
Data mining
DATA WAREHOUSING
Databases to improve business performance and decision making Client-server a...
Databases to improve business performance and decision making Client-server a...
Grid Asia2008 Low Latency Data Grid
Let unified storage drive the change you need
Ch1 data-warehousing
Ch1 data-warehousing
Data junction tool
Teradata
Ad

Similar to data mining (20)

PPTX
Data warehouse
PPTX
presentationofism-complete-1-100227093028-phpapp01.pptx
PPTX
Data warehouse-complete-1-100227093028-phpapp01.pptx
DOC
Data mining notes
DOC
Oracle sql plsql & dw
PPT
Data ware housing - Introduction to data ware housing process.
PDF
Data warehousing unit 1
PPT
Various Applications of Data Warehouse.ppt
PDF
data warehousing and data mining (1).pdf
PDF
TOPIC 9 data warehousing and data mining.pdf
PPT
SUPERB DATA WAREHOUSE.ppt
DOCX
Data Warehose and Data Mining Unit I.docx
PPTX
datamining techniques and various tools.pptx
PPT
Dw Concepts
PPT
1-_Intro_to_Data_Minning__DWH.ppt
PPT
DW (1).ppt
PPT
Data Warehousing Datamining Concepts
PPTX
Data warehouse
PDF
Pros_and_Cons_of_DW_Apps pdf.pdf
PPT
11667 Bitt I 2008 Lect4
Data warehouse
presentationofism-complete-1-100227093028-phpapp01.pptx
Data warehouse-complete-1-100227093028-phpapp01.pptx
Data mining notes
Oracle sql plsql & dw
Data ware housing - Introduction to data ware housing process.
Data warehousing unit 1
Various Applications of Data Warehouse.ppt
data warehousing and data mining (1).pdf
TOPIC 9 data warehousing and data mining.pdf
SUPERB DATA WAREHOUSE.ppt
Data Warehose and Data Mining Unit I.docx
datamining techniques and various tools.pptx
Dw Concepts
1-_Intro_to_Data_Minning__DWH.ppt
DW (1).ppt
Data Warehousing Datamining Concepts
Data warehouse
Pros_and_Cons_of_DW_Apps pdf.pdf
11667 Bitt I 2008 Lect4
Ad

More from renukarenuka9 (20)

PPTX
mobile computing
PPTX
PPTX
Compiler design
PPTX
Web programming
PPTX
Software engineering
PPTX
Software engineering
PPTX
Software engineering
PPTX
Bigdata
PPTX
Bigdata ppt
PPTX
PPTX
PPTX
operating system
PPTX
PPTX
OPERATING SYSTEM
PPTX
Data mining
PPTX
Computer network
PPTX
computer network
PPTX
operating system
PPTX
COMPUTER NETWORK
PPTX
data mining
mobile computing
Compiler design
Web programming
Software engineering
Software engineering
Software engineering
Bigdata
Bigdata ppt
operating system
OPERATING SYSTEM
Data mining
Computer network
computer network
operating system
COMPUTER NETWORK
data mining

Recently uploaded (20)

PPTX
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PPT
Presentation of a Romanian Institutee 2.
PPT
6.1 High Risk New Born. Padetric health ppt
PPTX
endocrine - management of adrenal incidentaloma.pptx
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PPTX
A powerpoint on colorectal cancer with brief background
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPTX
Microbes in human welfare class 12 .pptx
PPTX
BODY FLUIDS AND CIRCULATION class 11 .pptx
PDF
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
PPTX
gene cloning powerpoint for general biology 2
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPTX
Understanding the Circulatory System……..
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
Presentation of a Romanian Institutee 2.
6.1 High Risk New Born. Padetric health ppt
endocrine - management of adrenal incidentaloma.pptx
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
TORCH INFECTIONS in pregnancy with toxoplasma
A powerpoint on colorectal cancer with brief background
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Microbes in human welfare class 12 .pptx
BODY FLUIDS AND CIRCULATION class 11 .pptx
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
gene cloning powerpoint for general biology 2
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Understanding the Circulatory System……..
lecture 2026 of Sjogren's syndrome l .pdf
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
THE CELL THEORY AND ITS FUNDAMENTALS AND USE

data mining

  • 2. Technical considerations What Is a Data Warehouse:  Definition: A data warehouse is the data repository of an enterprise. It is generally used for research and decision support.  By comparison: an OLTP (on-line transaction processor) or operational system is used to deal with the everyday running of one aspect of an enterprise.  OLTP systems are usually designed independently of each other and it is difficult for them to share information.
  • 3. Why Do We Need Data Warehouses  Consolidation of information resources  Improved query performance  Separate research and decision support functions from the operational systems  Foundation for data mining, data visualization, advanced reporting and OLAP tools
  • 4. Building a Data Warehouse 1. Business Considerations (Return on Investment) 2. Design Considerations 3. Technical Considerations 4. Implementation Considerations 5. Integrated Solutions 6. Benefits of Data Warehousing
  • 5. Technical Considerations  A number of technical issues are to be considered when designing and implementing a Data Warehouse environment. 1. The Hardware Platform that would house the Data Warehouse for parallel query scalability. (Uni- Processor, Multi-processor, etc) 2. The DBMS that supports the warehouse database 3. The communication infrastructure that connects the warehouse, data marts, operational systems, and end users 4. The hardware platform and software to support the metadata repository 5. The systems management framework that enables centralized management and administration to the entire environment.
  • 6. HARDWAER PLATFORMS Data warehouse implementations are developed into already existing environments. This section looks at the hardware platform selection from an architectural viewpoint. A mainframe system however,is not as open and flexible as contemporary client/server system,and is noy optimized for hoc query proccessing.
  • 7. In addition it has to be scalable,since the data warehouse is never finished, as new user requirements,new data sources,and more historical datata are continusly incorrporated into the warehouse. Often the platform choice is the choice between a mainframe and non-mvs(unix or window nt)server.
  • 8. BALANCED APPROACH An important design point when selecting a scalable computing platform is the right balanced between all computing components,for Example between the number of processors in a multiprocessors system and the i/o bandwidth.remember that the lack of balance in a system inevitabley results in a bottleneck.
  • 9. OPTIMAL HARDWARE ARCHITECTURE FOR PARALLEL QUERY SCALABILLITY An important consideration when selecting a hardware platform for a data wareehouse is that of scalabilty. This architecture induced data skew is more severe in the low-density asymmetric connection architectures. When selecting a hardware platform for a data warehouse,take into account the fact that the system a hardware platform for a data skew can overpower even the best data layout for parallel query.