SlideShare a Scribd company logo
MCA535 Data Mining and Data Warehousing
Total teaching Hours/Semester: 60 No of Lecture Hours/Week: 04
Unit I. (12)
Introduction
What Is Data Mining? Data Mining—On What Kind of Data? Data Mining
Functionalities, Classification of Data Mining Systems, Data Mining Task Primitives,
Integration of a Data Mining System with a Database or DataWarehouse System, Major
Issues in Data Mining.
Data Preprocessing
Why Preprocess the Data? Descriptive Data Summarization – Measuring the central
tendency- Measuring the dispersion of data.
Unit II. (12)
Data Preprocessing (cont.)
Data Cleaning-Missing Values-Noisy Data-Data Cleaning as a Process, Data Integration
and Transformation, Data Reduction-Data Cube Aggregation-Attribute Subset Selection-
Dimensionality Reduction-Numerosity Reduction.
DataWarehouse and OLAP Technology
What Is a DataWarehouse? A Multidimensional Data Model, DataWarehouse
Architecture, DataWarehouse Implementation, From DataWarehousing to Data Mining.
Unit III. (12)
Data Cube Computation and Data Generalization
Efficient Methods for Data Cube Computation – Road map - Multiway array aggregation
– Star cubing, Further Development of Data Cube and OLAP Technology.
Mining Frequent Patterns and Associations
Basic Concepts, Efficient and Scalable Frequent Itemset Mining Methods – Apriori
algorithm, Generating Rules – Improving efficiency – Mining frequent itemset without
candidate generation.
Unit IV. (14)
Classification and Prediction
What Is Classification? What Is Prediction? Issues Regarding Classification and
Prediction, Classification by Decision Tree – Decision tree induction – Attribute
selection, Bayesian Classification – Bayesian Theorem - naïve Bayesian, Rule-Based
Classification, Prediction, Accuracy and Error Measures.
Cluster Analysis
What Is Cluster Analysis? Types of Data in Cluster Analysis, A Categorization of Major
Clustering Methods, Partitioning Methods – K-Means and K-Medoids, Hierarchical
Methods – Agglomerative and Divisive, Density Based Methods - DBSCAN, Outlier
Analysis – Statistical based.
Syllabus 2009 MCA 84
Christ University, Bangalore, India
Unit V. (10)
Mining Time-Series and Spatial Data
Mining Time-Series Data – Trend analysis – Similarity search, Spatial Data Mining-
Spatial Data Cube Construction and Spatial OLAP-Mining Spatial Association and Colocation
Patterns-Spatial Clustering, Classification Methods-Mining Raster Databases
Applications and Trends in Data Mining
Data Mining Applications, Data Mining System Products and Research Prototypes,
Social Impacts of Data Mining.
Text Book:
1. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques,
Morgan Kaufmann Publishers, San Francisco, USA, 2nd edition, 2006.
Reference Books:
1. Claudia Imhoff, Nicholas & et al, Mastering Data warehouse Design, J Wiley,
2003
2. Berson A & Smith S J, Data warehousing, Data Mining & OLAP, Mc Graw
Hall, 1997.
3. Margaret H. Dunham, Data mining-Introductory and Advanced topics Pearson
Education, 2003
4. Inmon W H, Building the Data Warehouse, John Wiley & Sons, 3rd edition,
2005

More Related Content

PDF
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
PPT
Chapter - 7 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
PDF
Combined mining approach to generate patterns for complex data
PDF
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
DOCX
knowledge discovery and data mining approach in databases (2)
PDF
An Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
PPT
Knowledge Discovery Using Data Mining
PDF
Introduction to Data Mining
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
Chapter - 7 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Combined mining approach to generate patterns for complex data
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
knowledge discovery and data mining approach in databases (2)
An Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
Knowledge Discovery Using Data Mining
Introduction to Data Mining

What's hot (20)

PPTX
Additional themes of data mining for Msc CS
PPT
introduction to data mining tutorial
PPTX
Project 0th Review
PPT
Data mining in agriculture
PPT
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
PPTX
R project
PPT
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
PPT
Data Mining Overview
PPTX
Introduction to Data Mining
PDF
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
PPTX
Knowledge Discovery and Data Mining
PPT
Chaper 13 trend
PPTX
Data mining primitives
PPTX
Data Mining: Application and trends in data mining
PPTX
Data mining concepts
PPT
Data mining and knowledge Discovery
PPTX
Data mining and knowledge discovery
PPT
Introduction to Data Mining
PPTX
DM
PDF
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
Additional themes of data mining for Msc CS
introduction to data mining tutorial
Project 0th Review
Data mining in agriculture
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
R project
Chapter - 8.1 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Data Mining Overview
Introduction to Data Mining
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
Knowledge Discovery and Data Mining
Chaper 13 trend
Data mining primitives
Data Mining: Application and trends in data mining
Data mining concepts
Data mining and knowledge Discovery
Data mining and knowledge discovery
Introduction to Data Mining
DM
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
Ad

Similar to Mca535 data mining and data warehousing (20)

PDF
17 cs002
PPT
Data mining techniques unit 1
PPT
Dwdmunit1 a
DOC
MIS 542 Syllabus 08.doc
PPT
Data Mining Xuequn Shang NorthWestern Polytechnical University
DOCX
Maintenance for university course out line.docx
PPTX
PowerPoint Template
PDF
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
PPTX
Human resource assignment help
PDF
Lect 1 introduction
PDF
PDF
Dbm630 Lecture01
PPT
Data mining - GDi Techno Solutions
PPT
Data Mining- Unit-I PPT (1).ppt
PPTX
Introduction to dm and dw
PPT
Data Warehouse and Data Mining
PPTX
PPT
dwdm unit 1.ppt
PPT
DMML1_overview.ppt
17 cs002
Data mining techniques unit 1
Dwdmunit1 a
MIS 542 Syllabus 08.doc
Data Mining Xuequn Shang NorthWestern Polytechnical University
Maintenance for university course out line.docx
PowerPoint Template
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Human resource assignment help
Lect 1 introduction
Dbm630 Lecture01
Data mining - GDi Techno Solutions
Data Mining- Unit-I PPT (1).ppt
Introduction to dm and dw
Data Warehouse and Data Mining
dwdm unit 1.ppt
DMML1_overview.ppt
Ad

Mca535 data mining and data warehousing

  • 1. MCA535 Data Mining and Data Warehousing Total teaching Hours/Semester: 60 No of Lecture Hours/Week: 04 Unit I. (12) Introduction What Is Data Mining? Data Mining—On What Kind of Data? Data Mining Functionalities, Classification of Data Mining Systems, Data Mining Task Primitives, Integration of a Data Mining System with a Database or DataWarehouse System, Major Issues in Data Mining. Data Preprocessing Why Preprocess the Data? Descriptive Data Summarization – Measuring the central tendency- Measuring the dispersion of data. Unit II. (12) Data Preprocessing (cont.) Data Cleaning-Missing Values-Noisy Data-Data Cleaning as a Process, Data Integration and Transformation, Data Reduction-Data Cube Aggregation-Attribute Subset Selection- Dimensionality Reduction-Numerosity Reduction. DataWarehouse and OLAP Technology What Is a DataWarehouse? A Multidimensional Data Model, DataWarehouse Architecture, DataWarehouse Implementation, From DataWarehousing to Data Mining. Unit III. (12) Data Cube Computation and Data Generalization Efficient Methods for Data Cube Computation – Road map - Multiway array aggregation – Star cubing, Further Development of Data Cube and OLAP Technology. Mining Frequent Patterns and Associations Basic Concepts, Efficient and Scalable Frequent Itemset Mining Methods – Apriori algorithm, Generating Rules – Improving efficiency – Mining frequent itemset without candidate generation. Unit IV. (14) Classification and Prediction What Is Classification? What Is Prediction? Issues Regarding Classification and Prediction, Classification by Decision Tree – Decision tree induction – Attribute selection, Bayesian Classification – Bayesian Theorem - naïve Bayesian, Rule-Based Classification, Prediction, Accuracy and Error Measures. Cluster Analysis What Is Cluster Analysis? Types of Data in Cluster Analysis, A Categorization of Major Clustering Methods, Partitioning Methods – K-Means and K-Medoids, Hierarchical Methods – Agglomerative and Divisive, Density Based Methods - DBSCAN, Outlier Analysis – Statistical based. Syllabus 2009 MCA 84 Christ University, Bangalore, India Unit V. (10) Mining Time-Series and Spatial Data Mining Time-Series Data – Trend analysis – Similarity search, Spatial Data Mining- Spatial Data Cube Construction and Spatial OLAP-Mining Spatial Association and Colocation Patterns-Spatial Clustering, Classification Methods-Mining Raster Databases Applications and Trends in Data Mining
  • 2. Data Mining Applications, Data Mining System Products and Research Prototypes, Social Impacts of Data Mining. Text Book: 1. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers, San Francisco, USA, 2nd edition, 2006. Reference Books: 1. Claudia Imhoff, Nicholas & et al, Mastering Data warehouse Design, J Wiley, 2003 2. Berson A & Smith S J, Data warehousing, Data Mining & OLAP, Mc Graw Hall, 1997. 3. Margaret H. Dunham, Data mining-Introductory and Advanced topics Pearson Education, 2003 4. Inmon W H, Building the Data Warehouse, John Wiley & Sons, 3rd edition, 2005