SlideShare a Scribd company logo
Yogesh Benawat Sameer Deshmukh
Outline Data Mining  Data Warehousing  Q ‘n’ A Conclusion
Historical Perspective 1960s: Data collection, database creation, IMS and network DBMS 1970s:  Relational data model, relational DBMS implementation 1980s:  RDBMS, advanced data models (extended-relational, OO, deductive, etc.) and application-oriented DBMS (spatial, scientific, engineering, etc.) 1990s —2000s :  Data mining and data warehousing, multimedia databases, and Web databases
Data Mining
Definition Data mining automates the process of locating and extracting the hidden patterns and knowledge   In simple words Searching for new knowledge
Why we need data mining Data explosion problem  Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories  We are drowning in data, but starving for knowledge!  Solution: Data mining Data warehousing and on-line analytical processing Extraction of interesting knowledge (rules, regularities,  patterns, constraints) from data in large databases
Data Mining Models Predictive Model Descriptive Model
Predictive Model Prediction determining how certain attributes will behave in the future Regression mapping of data item to real valued prediction variable Classification categorization of data based on combinations of attributes   Time Series analysis examining values of attributes with respect to time
Descriptive Model Clustering  most closely data clubbed together into clusters Data Summarization  extracting representative information about database Association Rules  associativity defined between data items to form relationship Sequence Discovery it is used to determine sequential patterns in data based on time sequence of action
Data mining process Fig. General Phases of Data Mining Process Problem Definition Creating Database Exploring database Preparation for creating a data mining model Building Data Mining Model Evaluation Phase Deploying the Data Mining model
Who needs data mining? Whoever has information fastest and uses it wins Don McKeough former president of Coke Cola   Businesses are looking for new ways to let end users find the data they need to:  make decisions  Serve customers Gain the competitive edge
Applications Business analysis and management  Computer security  Customer relationships analysis and management  Telecommunication analysis and management  News and entertainment  Bioinformatics and Healthcare analysis
Summary Need of data mining Data mining models Process of data mining Some applications
Data Warehousing
Data Warehousing  Data Warehouse What is Data Warehouse? Database & Data Warehouse. How to distinguish? Purpose Database : Transactional Data Warehouse :Intended for Decision Supporting    Applications. Functionality Optimized for data retrieval, not routine transaction processing.  Structure Performance
Data Warehousing Modern Organization’s needs ? Companies spread world wide. Have  So many  Data Sources Different  Operational Systems Different  Schemas Need Data for  Complex Analysis Knowledge Discovery   Decision Making . Solution ???
Data Warehousing  Solution … Data Warehouse. Data Warehouse .  Definition ?? No single definition…. Data Warehouse Collection of Information gathered from  multiple sources , stored under  unified schema , at a  single site  & mainly intended  for  decision support  applications.  A subject oriented, integrated, nonvolatile, time-variant, collection of data in support of management’s decision.   ~  W.H. Inmon
Warehouses are Very Large Databases 35% 30% 25% 20% 15% 10% 5% 0% 5GB 5-9GB 10-19GB 50-99GB 250-499GB 20-49GB 100-249GB 500GB-1TB Initial Projected 2Q96 Source: META Group, Inc. Respondents
Data Warehousing  Data Warehouse - Architecture
Data Warehousing Data Warehouse building When & how to gather data Source-driven architecture   Destination-driven architecture What schema to use  Data Cleansing Task of correcting and processing data How to propagate updates What data to summarize And many more……
Summary  What is Data Warehousing? Data Warehouse. Data Warehouse – Architecture Data Warehouse vs. Data Mining
Conclusion Your data is full of undiscovered gems; start digging!
References  Data Mining Introductory and advanced Topics Margaret H. Dunham Modern Data Warehousing, Mining, and visualization   George M. Marakas Data Mining    BPB Publications  Database System Concepts   Silbershatz, Korth, Sudarshan www.statoo.info/ www.crm2day.com/ www.trilliumsoftware.com/
Q ‘n’ A
Thank You!

More Related Content

PPTX
Data warehousing and Data mining
PDF
Modern data warehouse
PPTX
Data warehouse
PPTX
Solution architecture for big data projects
PPT
Data Warehousing and Mining
PPTX
DATA MART APPROCHES TO ARCHITECTURE
PPTX
Data mining and data warehousing
PPTX
From Traditional Data Warehouse To Real Time Data Warehouse
Data warehousing and Data mining
Modern data warehouse
Data warehouse
Solution architecture for big data projects
Data Warehousing and Mining
DATA MART APPROCHES TO ARCHITECTURE
Data mining and data warehousing
From Traditional Data Warehouse To Real Time Data Warehouse

What's hot (20)

PPTX
Data warehousing
PDF
Enterprise Data Lake - Scalable Digital
PDF
Data Warehouse
PPTX
introduction to data warehousing and mining
PDF
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
PPTX
Prcn 2019 stage 1264-question-presentation_poster file_id-15
PPTX
Business intelligence and data warehousing
PDF
Are You Killing the Benefits of Your Data Lake?
PDF
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
PPTX
Real World Business Intelligence and Data Warehousing
PPTX
data warehouse , data mart, etl
ODP
Introduction To Data Warehousing
PDF
Performance Acceleration: Summaries, Recommendation, MPP and more
PPTX
Building a Big Data Solution
PDF
Splunk Business Analytics
PDF
Big Data and Data Virtualization
PDF
Understanding Metadata: Why it's essential to your big data solution and how ...
PPTX
DW Appliance
PDF
An introduction to data virtualization in business intelligence
PDF
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Data warehousing
Enterprise Data Lake - Scalable Digital
Data Warehouse
introduction to data warehousing and mining
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Prcn 2019 stage 1264-question-presentation_poster file_id-15
Business intelligence and data warehousing
Are You Killing the Benefits of Your Data Lake?
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Real World Business Intelligence and Data Warehousing
data warehouse , data mart, etl
Introduction To Data Warehousing
Performance Acceleration: Summaries, Recommendation, MPP and more
Building a Big Data Solution
Splunk Business Analytics
Big Data and Data Virtualization
Understanding Metadata: Why it's essential to your big data solution and how ...
DW Appliance
An introduction to data virtualization in business intelligence
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Ad

Similar to Data Mining and Data Warehousing (20)

PPT
Data Warehouse and Data Mining
PPT
dwdm unit 1.ppt
PPT
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
PPT
Dwdmunit1 a
PPT
E06WarehouseDesignissuesindatawarehousedesign.ppt
PPT
hanjia chapter_1.ppt data mining chapter 1
PPTX
Business Intelligence Module 3_Datawarehousing.pptx
PPTX
Big Data Session 1.pptx
DOCX
Abstract
PPT
Introduction of Data Mining - Concept and techniques
PPTX
Lect 1 introduction
PPTX
Introduction to data mining and data warehousing
PPTX
Big data by Mithlesh sadh
PPT
Chapter 1. Introduction.ppt
PPTX
DATA MINING AND WAREHOUSING_MBA_MIS_BMB208
PPT
01Intro.ppt data analytics r language slide 1
PPTX
Data mining and Data Warehousing in Databases.pptx
PPTX
dataminingintroductionpptpptpptptro.pptx
PPTX
DWDM 3rd EDITION TEXT BOOK SLIDES24.pptx
PPT
DATA MINING: INTRODUCTION TO DATA MINING
Data Warehouse and Data Mining
dwdm unit 1.ppt
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
Dwdmunit1 a
E06WarehouseDesignissuesindatawarehousedesign.ppt
hanjia chapter_1.ppt data mining chapter 1
Business Intelligence Module 3_Datawarehousing.pptx
Big Data Session 1.pptx
Abstract
Introduction of Data Mining - Concept and techniques
Lect 1 introduction
Introduction to data mining and data warehousing
Big data by Mithlesh sadh
Chapter 1. Introduction.ppt
DATA MINING AND WAREHOUSING_MBA_MIS_BMB208
01Intro.ppt data analytics r language slide 1
Data mining and Data Warehousing in Databases.pptx
dataminingintroductionpptpptpptptro.pptx
DWDM 3rd EDITION TEXT BOOK SLIDES24.pptx
DATA MINING: INTRODUCTION TO DATA MINING
Ad

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
A Presentation on Artificial Intelligence
PPTX
Cloud computing and distributed systems.
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Modernizing your data center with Dell and AMD
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
Big Data Technologies - Introduction.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Understanding_Digital_Forensics_Presentation.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Reach Out and Touch Someone: Haptics and Empathic Computing
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Diabetes mellitus diagnosis method based random forest with bat algorithm
Unlocking AI with Model Context Protocol (MCP)
A Presentation on Artificial Intelligence
Cloud computing and distributed systems.
Spectral efficient network and resource selection model in 5G networks
Building Integrated photovoltaic BIPV_UPV.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Modernizing your data center with Dell and AMD
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Encapsulation_ Review paper, used for researhc scholars

Data Mining and Data Warehousing

  • 2. Outline Data Mining Data Warehousing Q ‘n’ A Conclusion
  • 3. Historical Perspective 1960s: Data collection, database creation, IMS and network DBMS 1970s: Relational data model, relational DBMS implementation 1980s: RDBMS, advanced data models (extended-relational, OO, deductive, etc.) and application-oriented DBMS (spatial, scientific, engineering, etc.) 1990s —2000s : Data mining and data warehousing, multimedia databases, and Web databases
  • 5. Definition Data mining automates the process of locating and extracting the hidden patterns and knowledge In simple words Searching for new knowledge
  • 6. Why we need data mining Data explosion problem Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories We are drowning in data, but starving for knowledge! Solution: Data mining Data warehousing and on-line analytical processing Extraction of interesting knowledge (rules, regularities, patterns, constraints) from data in large databases
  • 7. Data Mining Models Predictive Model Descriptive Model
  • 8. Predictive Model Prediction determining how certain attributes will behave in the future Regression mapping of data item to real valued prediction variable Classification categorization of data based on combinations of attributes Time Series analysis examining values of attributes with respect to time
  • 9. Descriptive Model Clustering most closely data clubbed together into clusters Data Summarization extracting representative information about database Association Rules associativity defined between data items to form relationship Sequence Discovery it is used to determine sequential patterns in data based on time sequence of action
  • 10. Data mining process Fig. General Phases of Data Mining Process Problem Definition Creating Database Exploring database Preparation for creating a data mining model Building Data Mining Model Evaluation Phase Deploying the Data Mining model
  • 11. Who needs data mining? Whoever has information fastest and uses it wins Don McKeough former president of Coke Cola Businesses are looking for new ways to let end users find the data they need to: make decisions Serve customers Gain the competitive edge
  • 12. Applications Business analysis and management Computer security Customer relationships analysis and management Telecommunication analysis and management News and entertainment Bioinformatics and Healthcare analysis
  • 13. Summary Need of data mining Data mining models Process of data mining Some applications
  • 15. Data Warehousing Data Warehouse What is Data Warehouse? Database & Data Warehouse. How to distinguish? Purpose Database : Transactional Data Warehouse :Intended for Decision Supporting Applications. Functionality Optimized for data retrieval, not routine transaction processing. Structure Performance
  • 16. Data Warehousing Modern Organization’s needs ? Companies spread world wide. Have So many Data Sources Different Operational Systems Different Schemas Need Data for Complex Analysis Knowledge Discovery Decision Making . Solution ???
  • 17. Data Warehousing Solution … Data Warehouse. Data Warehouse . Definition ?? No single definition…. Data Warehouse Collection of Information gathered from multiple sources , stored under unified schema , at a single site & mainly intended for decision support applications. A subject oriented, integrated, nonvolatile, time-variant, collection of data in support of management’s decision. ~ W.H. Inmon
  • 18. Warehouses are Very Large Databases 35% 30% 25% 20% 15% 10% 5% 0% 5GB 5-9GB 10-19GB 50-99GB 250-499GB 20-49GB 100-249GB 500GB-1TB Initial Projected 2Q96 Source: META Group, Inc. Respondents
  • 19. Data Warehousing Data Warehouse - Architecture
  • 20. Data Warehousing Data Warehouse building When & how to gather data Source-driven architecture Destination-driven architecture What schema to use Data Cleansing Task of correcting and processing data How to propagate updates What data to summarize And many more……
  • 21. Summary What is Data Warehousing? Data Warehouse. Data Warehouse – Architecture Data Warehouse vs. Data Mining
  • 22. Conclusion Your data is full of undiscovered gems; start digging!
  • 23. References Data Mining Introductory and advanced Topics Margaret H. Dunham Modern Data Warehousing, Mining, and visualization George M. Marakas Data Mining BPB Publications Database System Concepts Silbershatz, Korth, Sudarshan www.statoo.info/ www.crm2day.com/ www.trilliumsoftware.com/