SlideShare a Scribd company logo
Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology.
© 2015, IJARIIT All Rights Reserved
ISSN: 2454-132X
(Volume1, Issue 2)
Implementation of Data Marts in Data ware house
Neha Malhotra
nehamalhotra588@gmail.com
Abstract: A data mart is a persistent physical store of operational and aggregated data statistically processed data
that supports businesspeople in making decisions based primarily on analyses of past activities and results. A data
mart contains a predefined subset of enterprise data organized for rapid analysis and reporting. Data warehousing
has come into being because the file structure of the large mainframe core business systems is inimical to
information retrieval. The purpose of the data warehouse is to combine core business and data from other sources in
a format that facilitates reporting and decision support. In just a few years, data warehouses have evolved from
large, centralized data repositories to subject specific, but independent, data marts and now to dependent marts that
load data from a central repository of Data Staging files that has previously extracted data from the institution’s
operational business systems (e.g., student record, finance and human resource systems, etc.).
I. INTRODUCTION
To understand what a data mart is, we must first know a little about data warehouses. The concept of data
warehousing first appeared in the late 1980s in articles published by Bill Inmon and others. Essentially, a data
warehouse involves a systematic approach to gathering, organizing, and storing data–generally from internal
production sources but also from external providers–to create a definitive source of information for analysis and
decision support [1]. A data mart is basically a condensed and more focused version of a data warehouse that
reflects the regulations and process specifications of each business unit within an organization. Each data mart is
dedicated to a specific business function or region. This subset of data may span across many or all of an
enterprise’s functional subject areas [2].
II. DEPENDENT AND INDEPENDENT DATAMARTS
There are two basic types of data marts: dependent and independent. The categorization is based primarily on the
data source that feeds the data mart [3]. Dependent data marts draw data from a central data warehouse that has
already been created. Independent data marts, in contrast, are standalone systems built by drawing data directly
from operational or external sources of data, or both.
The main difference between independent and dependent data marts is how you populate the data mart; that is, how
you get data out of the sources and into the data mart [4]. This step, called the Extraction-Transformation-and
Loading (ETL) process, involves moving data from operational systems, filtering it, and loading it into the data
mart.
With dependent data marts, this process is somewhat simplified because formatted and summarized (clean) data has
already been loaded into the central data warehouse [5]. The ETL process for dependent data marts is mostly a
Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology.
© 2015, IJARIIT All Rights Reserved
process of identifying the right subset of data relevant to the chosen data mart subject and moving a copy of it,
perhaps in a summarized form.
With independent data marts, however, you must deal with all aspects of the ETL process, much as you do with a
central data warehouse. The number of sources is likely to be fewer and the amount of data associated with the data
mart is less than the warehouse, given your focus on a single subject [3].
The motivations behind the creation of these two types of data marts are also typically different. Dependent data
marts are usually built to achieve improved performance and availability, better control, and lower
telecommunication costs resulting from local access of data relevant to a specific department. The creation of
independent data marts is often driven by the need to have a solution within a shorter time [6].
III. DIFFERENCE BETWEEN DATA WAREHOUSING AND DATA MART
It is important to note that there are huge differences between these two tools though they may serve same purpose.
Firstly, data mart contains programs, data, software and hardware of a specific department of a company. There can
be separate data marts for finance, sales, production or marketing. All these data marts are different but they can be
coordinated. Data mart of one department is different from data mart of another department, and though indexed,
this system is not suitable for a huge data base as it is designed to meet the requirements of a particular department.
Data Warehousing is not limited to a particular department and it represents the database of a complete
organization. The data stored in data warehouse is more detailed though indexing is light as it has to store huge
amounts of information. It is also difficult to manage and takes a long time to process. It implies then that data
marts are quick and easy to use, as they make use of small amounts of data. Data warehousing is also more
expensive because of the same reason.
IV. FUNCTIONAL OVERVIEW
While it is recognized that Data Warehouse systems each have their own unique characteristics, there are certain
generic characteristics shared by the family of systems known as Data Warehouses. These include:
1. The prime purpose of a Data Warehouse is to store, in one system, data and information that originates
from multiple applications within, or across, organizations. The data may be stored ‘as received’ from the
source application, or it may be processed upon input to validate, translate, aggregate or derive new
data/information.
2. Most of the data load functions are processed in batch. There are few on-line data maintenance functions.
The on-line functions that do exist tend to update the reference files and data translation tables.
3. A database alone does not constitute a Data Warehouse system. At a minimum, a Data Warehouse system
must include the database and corresponding load functions. Data reporting functions are optional. They
may or may not be an integral part of the Data Warehouse system.
4. The prime purpose of storing the data is to support the information reporting requirements of an
organization, i.e. multiple users and multiple applications.
5. The Data Warehouse may or may not provide the required reporting functions. In some cases, external
applications access the Data Warehouse files to generate their own reports and queries.
6. Data Warehouse functions are often based upon packaged software. An example is the Business Objects
product.
7. Where the data within the Warehouse supports the reporting requirements of multiple applications and
users, the data may be physically stored based upon the user requirements. Separate database segments may
store the ‘user views’ for a particular user. This results in the physical storage of a considerable amount of
redundant data. The data storage approach is designed to optimize data/information retrieval.
Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology.
© 2015, IJARIIT All Rights Reserved
V. CONTRASTING OLTP AND DATA WAREHOUSING ENVIRONMENTS
Figure 1-1 illustrates key differences between an OLTP system and a data warehouse. One major difference between
the types of system is that data warehouses are not usually in third normal form (3NF), a type of data normalization
common in OLTP environments. Data warehouses and OLTP systems have very different requirements. Here are
some examples of differences between typical data warehouses and OLTP systems:
 Workload
Data warehouses are designed to accommodate ad hoc queries and data analysis. You might not know the
workload of your data warehouse in advance, so a data warehouse should be optimized to perform well for
a wide variety of possible query and analytical operations.
OLTP systems support only predefined operations. Your applications might be specifically tuned or
designed to support only these operations.
 Data modifications
A data warehouse is updated on a regular basis by the ETL process (run nightly or weekly) using bulk data
modification techniques. The end users of a data warehouse do not directly update the data warehouse
except when using analytical tools, such as data mining, to make predictions with associated probabilities,
assign customers to market segments, and develop customer profiles [7].
In OLTP systems, end users routinely issue individual data modification statements to the database. The
OLTP database is always up to date, and reflects the current state of each business transaction.
 Schema design
Data warehouses often use denormalized or partially denormalized schemas (such as a star schema) to
optimize query and analytical performance..
 Typical operations [5]
A typical data warehouse query scans thousands or millions of rows. For example, "Find the total sales for
all customers last month."
A typical OLTP operation accesses only a handful of records. For example, "Retrieve the current order for
this customer."
 Historical data
Data warehouses usually store many months or years of data. This is to support historical analysis and
reporting.
OLTP systems usually store data from only a few weeks or months. The OLTP system stores only
historical data as needed to successfully meet the requirements of the current transaction.
Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology.
© 2015, IJARIIT All Rights Reserved
Figure 1-1 Contrasting OLTP and Data Warehousing Environments
VI. DATA WAREHOUSE ARCHITECTURES
Data warehouses and their architectures vary depending upon the specifics of an organization's situation. Three
common architectures are [3]:
 Data Warehouse Architecture: Basic
 Data Warehouse Architecture: with a Staging Area
 Data Warehouse Architecture: with a Staging Area and Data Marts
Data Warehouse Architecture: Basic
Figure 1-2 shows a simple architecture for a data warehouse. End users directly access data derived from several
source systems through the data warehouse.
Figure 1-2 Architecture of a Data Warehouse
Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology.
© 2015, IJARIIT All Rights Reserved
In Figure 1-2, the metadata and raw data of a traditional OLTP system is present, as is an additional type of data,
summary data. Summaries are very valuable in data warehouses because they pre-compute long operations in
advance. For example, a typical data warehouse query is to retrieve something such as August sales. A summary in
an Oracle database is called a materialized view [8].
Data Warehouse Architecture: with a Staging Area
You need to clean and process your operational data before putting it into the warehouse, as shown in Figure 1-2.
You can do this programmatically, although most data warehouses use a staging area instead. A staging area
simplifies building summaries and general warehouse management. Figure 1-3 illustrates this typical architecture.
Figure 1-3 Architecture of a Data Warehouse with a Staging Area
Data Warehouse Architecture: with a Staging Area and Data Marts
Although the architecture in Figure 1-3 is quite common, you may want to customize your warehouse's architecture
for different groups within your organization. You can do this by adding data marts, which are systems designed for
a particular line of business. Figure 1-4 illustrates an example where purchasing, sales, and inventories are separated
[7]. In this example, a financial analyst might want to analyze historical data for purchases and sales or mine
historical data to make predictions about customer behavior.
Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology.
© 2015, IJARIIT All Rights Reserved
Figure 1-4 Architecture of a Data Warehouse with a Staging Area and Data Marts
VII. CONCLUSION
Data mart and data warehousing are tools to assist management to come up with relevant information about the
organization at any point of time. While data marts are limited for use of a department only, data warehousing
applies to an entire organization. Data marts are easy to design and use while data warehousing is complex and
difficult to manage. Data warehousing is more useful as it can come up with information from any department
VIII. REFRENCES
1. Berry, M., J., A., & Linoff, G., S., (2000). Mastering data mining. New York: Wiley.
2. Edelstein, H., A. (1999). Introduction to data mining and knowledge discovery (3rd ed). Potomac, MD: Two
Crows Corp.
3. Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P., & Uthurusamy, R. (1996). Advances in knowledge discovery
& data mining. Cambridge, MA: MIT Press.
4. Han, J., Kamber, M. (2000). Data mining: Concepts and Techniques. New York: Morgan-Kaufman.
5. Hastie, T., Tibshirani, R., & Friedman, J. H. (2001). The elements of statistical learning: Data mining,
inference, and prediction. New York: Springer.
6. Pregibon, D. (1997). Data Mining. Statistical Computing and Graphics, 7, 8.
7. Weiss, S. M., & Indurkhya, N. (1997). Predictive data mining: A practical guide. New York: Morgan-Kaufman.
8. Westphal, C., Blaxton, T. (1998). Data mining solutions. New York: Wiley.
9. Witten, I. H., & Frank, E. (2000). Data mining. New York: Morgan-Kaufmann.
10. http://www.kron.com/nc4/contact4/stories/computer_privacy.html
11. http://www.privacyrights.org
12. http://www.cfp.org

More Related Content

PDF
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
PPTX
Issue in Data warehousing and OLAP in E-business
PPT
data warehousing
PPT
data warehousing
PPT
Data warehousing
PPT
Data Mining and Data Warehousing
PPTX
Data warehouse architecture
PDF
Data Warehousing & Basic Architectural Framework
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
Issue in Data warehousing and OLAP in E-business
data warehousing
data warehousing
Data warehousing
Data Mining and Data Warehousing
Data warehouse architecture
Data Warehousing & Basic Architectural Framework

What's hot (20)

PPTX
Data warehouse,data mining & Big Data
DOCX
Data warehousing
PPS
Data Warehouse 101
PPTX
Data warehousing ppt
PDF
Data mining & data warehousing (ppt)
PPT
Datawarehousing
PPT
Seminar datawarehousing
DOC
Oracle sql plsql & dw
PPT
Data Warehouse
DOCX
Components of a Data-Warehouse
PPT
Unit 5
PDF
Data Warehouses & Deployment By Ankita dubey
PPTX
Data warehouse presentaion
PPTX
3 tier data warehouse
 
PDF
Introduction to Data Warehousing
PDF
Data Warehouse
PDF
CTP Data Warehouse
PPT
Lecture 03 - The Data Warehouse and Design
PPT
7 data warehouse & marts
PPTX
Data warehouse logical design
Data warehouse,data mining & Big Data
Data warehousing
Data Warehouse 101
Data warehousing ppt
Data mining & data warehousing (ppt)
Datawarehousing
Seminar datawarehousing
Oracle sql plsql & dw
Data Warehouse
Components of a Data-Warehouse
Unit 5
Data Warehouses & Deployment By Ankita dubey
Data warehouse presentaion
3 tier data warehouse
 
Introduction to Data Warehousing
Data Warehouse
CTP Data Warehouse
Lecture 03 - The Data Warehouse and Design
7 data warehouse & marts
Data warehouse logical design
Ad

Viewers also liked (14)

DOCX
Implement Data Ware House
PDF
Trích dẫn Cẩm nang Marketing
PPTX
Fact table design for data ware house
PPTX
Data ware house design
PPT
Data ware house
PDF
Bài 1: Làm quen với SQL Server 2008 - Giáo trình FPT
PPTX
Data ware house architecture
PDF
Bài 2: Các khái niệm trong CSDL quan hệ - Giáo trình FPT
PDF
Data warehouse architecture
PPTX
Imagenes
DOC
Escaleras manuales CSS
PDF
Artam 2014
PDF
161201
PDF
海賊王
Implement Data Ware House
Trích dẫn Cẩm nang Marketing
Fact table design for data ware house
Data ware house design
Data ware house
Bài 1: Làm quen với SQL Server 2008 - Giáo trình FPT
Data ware house architecture
Bài 2: Các khái niệm trong CSDL quan hệ - Giáo trình FPT
Data warehouse architecture
Imagenes
Escaleras manuales CSS
Artam 2014
161201
海賊王
Ad

Similar to Implementation of Data Marts in Data ware house (20)

PPTX
Data warehouse
PDF
DMDW 1st module.pdf
DOCX
Data Mining and Warehousing -Unit III & Unit IV Notes
DOC
Data warehouse concepts
PPTX
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
PDF
(Lecture 2)Data Warehouse Architecture.pdf
DOC
Informatica and datawarehouse Material
PDF
Top 60+ Data Warehouse Interview Questions and Answers.pdf
PPTX
WEEK 1 - Data mining and Warehouse.pptx
DOCX
Data Warehose and Data Mining Unit I.docx
PPTX
Data Warehouse
PPTX
Module 1_Data Warehousing Fundamentals.pptx
PDF
Introduction to Data Warehouse
PPT
DW 101
PDF
data warehousing and data mining (1).pdf
PDF
TOPIC 9 data warehousing and data mining.pdf
PPTX
ETL processes , Datawarehouse and Datamarts.pptx
PDF
Chapter-6_BasicsOfDataIntegrationbibibibini.pdf
PPTX
CHAPTER 1 - Introdution to Datawarehousing.pptx
DOCX
UNIT-5 DATA WAREHOUSING.docx
Data warehouse
DMDW 1st module.pdf
Data Mining and Warehousing -Unit III & Unit IV Notes
Data warehouse concepts
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
(Lecture 2)Data Warehouse Architecture.pdf
Informatica and datawarehouse Material
Top 60+ Data Warehouse Interview Questions and Answers.pdf
WEEK 1 - Data mining and Warehouse.pptx
Data Warehose and Data Mining Unit I.docx
Data Warehouse
Module 1_Data Warehousing Fundamentals.pptx
Introduction to Data Warehouse
DW 101
data warehousing and data mining (1).pdf
TOPIC 9 data warehousing and data mining.pdf
ETL processes , Datawarehouse and Datamarts.pptx
Chapter-6_BasicsOfDataIntegrationbibibibini.pdf
CHAPTER 1 - Introdution to Datawarehousing.pptx
UNIT-5 DATA WAREHOUSING.docx

Recently uploaded (20)

DOCX
573137875-Attendance-Management-System-original
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Well-logging-methods_new................
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
bas. eng. economics group 4 presentation 1.pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
OOP with Java - Java Introduction (Basics)
PDF
composite construction of structures.pdf
PPTX
Artificial Intelligence
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPT
Project quality management in manufacturing
PPTX
Safety Seminar civil to be ensured for safe working.
PPTX
additive manufacturing of ss316l using mig welding
PPTX
Construction Project Organization Group 2.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
573137875-Attendance-Management-System-original
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Well-logging-methods_new................
Model Code of Practice - Construction Work - 21102022 .pdf
bas. eng. economics group 4 presentation 1.pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
OOP with Java - Java Introduction (Basics)
composite construction of structures.pdf
Artificial Intelligence
CYBER-CRIMES AND SECURITY A guide to understanding
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Foundation to blockchain - A guide to Blockchain Tech
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Project quality management in manufacturing
Safety Seminar civil to be ensured for safe working.
additive manufacturing of ss316l using mig welding
Construction Project Organization Group 2.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT

Implementation of Data Marts in Data ware house

  • 1. Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology. © 2015, IJARIIT All Rights Reserved ISSN: 2454-132X (Volume1, Issue 2) Implementation of Data Marts in Data ware house Neha Malhotra nehamalhotra588@gmail.com Abstract: A data mart is a persistent physical store of operational and aggregated data statistically processed data that supports businesspeople in making decisions based primarily on analyses of past activities and results. A data mart contains a predefined subset of enterprise data organized for rapid analysis and reporting. Data warehousing has come into being because the file structure of the large mainframe core business systems is inimical to information retrieval. The purpose of the data warehouse is to combine core business and data from other sources in a format that facilitates reporting and decision support. In just a few years, data warehouses have evolved from large, centralized data repositories to subject specific, but independent, data marts and now to dependent marts that load data from a central repository of Data Staging files that has previously extracted data from the institution’s operational business systems (e.g., student record, finance and human resource systems, etc.). I. INTRODUCTION To understand what a data mart is, we must first know a little about data warehouses. The concept of data warehousing first appeared in the late 1980s in articles published by Bill Inmon and others. Essentially, a data warehouse involves a systematic approach to gathering, organizing, and storing data–generally from internal production sources but also from external providers–to create a definitive source of information for analysis and decision support [1]. A data mart is basically a condensed and more focused version of a data warehouse that reflects the regulations and process specifications of each business unit within an organization. Each data mart is dedicated to a specific business function or region. This subset of data may span across many or all of an enterprise’s functional subject areas [2]. II. DEPENDENT AND INDEPENDENT DATAMARTS There are two basic types of data marts: dependent and independent. The categorization is based primarily on the data source that feeds the data mart [3]. Dependent data marts draw data from a central data warehouse that has already been created. Independent data marts, in contrast, are standalone systems built by drawing data directly from operational or external sources of data, or both. The main difference between independent and dependent data marts is how you populate the data mart; that is, how you get data out of the sources and into the data mart [4]. This step, called the Extraction-Transformation-and Loading (ETL) process, involves moving data from operational systems, filtering it, and loading it into the data mart. With dependent data marts, this process is somewhat simplified because formatted and summarized (clean) data has already been loaded into the central data warehouse [5]. The ETL process for dependent data marts is mostly a
  • 2. Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology. © 2015, IJARIIT All Rights Reserved process of identifying the right subset of data relevant to the chosen data mart subject and moving a copy of it, perhaps in a summarized form. With independent data marts, however, you must deal with all aspects of the ETL process, much as you do with a central data warehouse. The number of sources is likely to be fewer and the amount of data associated with the data mart is less than the warehouse, given your focus on a single subject [3]. The motivations behind the creation of these two types of data marts are also typically different. Dependent data marts are usually built to achieve improved performance and availability, better control, and lower telecommunication costs resulting from local access of data relevant to a specific department. The creation of independent data marts is often driven by the need to have a solution within a shorter time [6]. III. DIFFERENCE BETWEEN DATA WAREHOUSING AND DATA MART It is important to note that there are huge differences between these two tools though they may serve same purpose. Firstly, data mart contains programs, data, software and hardware of a specific department of a company. There can be separate data marts for finance, sales, production or marketing. All these data marts are different but they can be coordinated. Data mart of one department is different from data mart of another department, and though indexed, this system is not suitable for a huge data base as it is designed to meet the requirements of a particular department. Data Warehousing is not limited to a particular department and it represents the database of a complete organization. The data stored in data warehouse is more detailed though indexing is light as it has to store huge amounts of information. It is also difficult to manage and takes a long time to process. It implies then that data marts are quick and easy to use, as they make use of small amounts of data. Data warehousing is also more expensive because of the same reason. IV. FUNCTIONAL OVERVIEW While it is recognized that Data Warehouse systems each have their own unique characteristics, there are certain generic characteristics shared by the family of systems known as Data Warehouses. These include: 1. The prime purpose of a Data Warehouse is to store, in one system, data and information that originates from multiple applications within, or across, organizations. The data may be stored ‘as received’ from the source application, or it may be processed upon input to validate, translate, aggregate or derive new data/information. 2. Most of the data load functions are processed in batch. There are few on-line data maintenance functions. The on-line functions that do exist tend to update the reference files and data translation tables. 3. A database alone does not constitute a Data Warehouse system. At a minimum, a Data Warehouse system must include the database and corresponding load functions. Data reporting functions are optional. They may or may not be an integral part of the Data Warehouse system. 4. The prime purpose of storing the data is to support the information reporting requirements of an organization, i.e. multiple users and multiple applications. 5. The Data Warehouse may or may not provide the required reporting functions. In some cases, external applications access the Data Warehouse files to generate their own reports and queries. 6. Data Warehouse functions are often based upon packaged software. An example is the Business Objects product. 7. Where the data within the Warehouse supports the reporting requirements of multiple applications and users, the data may be physically stored based upon the user requirements. Separate database segments may store the ‘user views’ for a particular user. This results in the physical storage of a considerable amount of redundant data. The data storage approach is designed to optimize data/information retrieval.
  • 3. Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology. © 2015, IJARIIT All Rights Reserved V. CONTRASTING OLTP AND DATA WAREHOUSING ENVIRONMENTS Figure 1-1 illustrates key differences between an OLTP system and a data warehouse. One major difference between the types of system is that data warehouses are not usually in third normal form (3NF), a type of data normalization common in OLTP environments. Data warehouses and OLTP systems have very different requirements. Here are some examples of differences between typical data warehouses and OLTP systems:  Workload Data warehouses are designed to accommodate ad hoc queries and data analysis. You might not know the workload of your data warehouse in advance, so a data warehouse should be optimized to perform well for a wide variety of possible query and analytical operations. OLTP systems support only predefined operations. Your applications might be specifically tuned or designed to support only these operations.  Data modifications A data warehouse is updated on a regular basis by the ETL process (run nightly or weekly) using bulk data modification techniques. The end users of a data warehouse do not directly update the data warehouse except when using analytical tools, such as data mining, to make predictions with associated probabilities, assign customers to market segments, and develop customer profiles [7]. In OLTP systems, end users routinely issue individual data modification statements to the database. The OLTP database is always up to date, and reflects the current state of each business transaction.  Schema design Data warehouses often use denormalized or partially denormalized schemas (such as a star schema) to optimize query and analytical performance..  Typical operations [5] A typical data warehouse query scans thousands or millions of rows. For example, "Find the total sales for all customers last month." A typical OLTP operation accesses only a handful of records. For example, "Retrieve the current order for this customer."  Historical data Data warehouses usually store many months or years of data. This is to support historical analysis and reporting. OLTP systems usually store data from only a few weeks or months. The OLTP system stores only historical data as needed to successfully meet the requirements of the current transaction.
  • 4. Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology. © 2015, IJARIIT All Rights Reserved Figure 1-1 Contrasting OLTP and Data Warehousing Environments VI. DATA WAREHOUSE ARCHITECTURES Data warehouses and their architectures vary depending upon the specifics of an organization's situation. Three common architectures are [3]:  Data Warehouse Architecture: Basic  Data Warehouse Architecture: with a Staging Area  Data Warehouse Architecture: with a Staging Area and Data Marts Data Warehouse Architecture: Basic Figure 1-2 shows a simple architecture for a data warehouse. End users directly access data derived from several source systems through the data warehouse. Figure 1-2 Architecture of a Data Warehouse
  • 5. Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology. © 2015, IJARIIT All Rights Reserved In Figure 1-2, the metadata and raw data of a traditional OLTP system is present, as is an additional type of data, summary data. Summaries are very valuable in data warehouses because they pre-compute long operations in advance. For example, a typical data warehouse query is to retrieve something such as August sales. A summary in an Oracle database is called a materialized view [8]. Data Warehouse Architecture: with a Staging Area You need to clean and process your operational data before putting it into the warehouse, as shown in Figure 1-2. You can do this programmatically, although most data warehouses use a staging area instead. A staging area simplifies building summaries and general warehouse management. Figure 1-3 illustrates this typical architecture. Figure 1-3 Architecture of a Data Warehouse with a Staging Area Data Warehouse Architecture: with a Staging Area and Data Marts Although the architecture in Figure 1-3 is quite common, you may want to customize your warehouse's architecture for different groups within your organization. You can do this by adding data marts, which are systems designed for a particular line of business. Figure 1-4 illustrates an example where purchasing, sales, and inventories are separated [7]. In this example, a financial analyst might want to analyze historical data for purchases and sales or mine historical data to make predictions about customer behavior.
  • 6. Neha Malhotra, International Journal of Advance research, Ideas and Innovations in Technology. © 2015, IJARIIT All Rights Reserved Figure 1-4 Architecture of a Data Warehouse with a Staging Area and Data Marts VII. CONCLUSION Data mart and data warehousing are tools to assist management to come up with relevant information about the organization at any point of time. While data marts are limited for use of a department only, data warehousing applies to an entire organization. Data marts are easy to design and use while data warehousing is complex and difficult to manage. Data warehousing is more useful as it can come up with information from any department VIII. REFRENCES 1. Berry, M., J., A., & Linoff, G., S., (2000). Mastering data mining. New York: Wiley. 2. Edelstein, H., A. (1999). Introduction to data mining and knowledge discovery (3rd ed). Potomac, MD: Two Crows Corp. 3. Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P., & Uthurusamy, R. (1996). Advances in knowledge discovery & data mining. Cambridge, MA: MIT Press. 4. Han, J., Kamber, M. (2000). Data mining: Concepts and Techniques. New York: Morgan-Kaufman. 5. Hastie, T., Tibshirani, R., & Friedman, J. H. (2001). The elements of statistical learning: Data mining, inference, and prediction. New York: Springer. 6. Pregibon, D. (1997). Data Mining. Statistical Computing and Graphics, 7, 8. 7. Weiss, S. M., & Indurkhya, N. (1997). Predictive data mining: A practical guide. New York: Morgan-Kaufman. 8. Westphal, C., Blaxton, T. (1998). Data mining solutions. New York: Wiley. 9. Witten, I. H., & Frank, E. (2000). Data mining. New York: Morgan-Kaufmann. 10. http://www.kron.com/nc4/contact4/stories/computer_privacy.html 11. http://www.privacyrights.org 12. http://www.cfp.org