SlideShare a Scribd company logo
DATA MINING
TECHNIQUES
DATA WAREHOUSING
TOP DATA MINING TECHNIQUES
Data Warehousing
• A data warehouse is a collection of
databases that work together. A data
warehouse makes it possible to integrate
data from multiple databases, which can
give new insights into the data.
• The ultimate goal of a database is not just
to store data, but to help businesses make
decisions based on that data.
What is data mining?
• Queries based on SQL, a database
programming language, are used to
answer basic questions about data.
• But, as the collection of data grows in a
database, the amount of data can easily
become overwhelming.
• Data mining is the process of analyzing
data and summarizing it to produce useful
information
Data sets
• The complete set of data available to us
for an application is called a dataset. A
dataset is often depicted as a table, with
each row representing an instance. Each
column contains the value of one of the
variables (attributes) for each of the
instances.
Goal of Data Mining
• The overall goal of the data mining
process is to extract information from a
data set and transform it into an
understandable structure for further use.
• Data mining is the computing process of
discovering patterns in large data sets
involving methods.
• These patterns can be used for predictive
analytics.
Techniques used for mining
• CLASSIFICATION ANALYSIS
• ASSOCIATION RULE LEARNING
• ANOMALY OR OUTLIER DETECTION
• CLUSTERING ANALYSIS
• REGRESSION ANALYSIS
CLUSTERING ANALYSIS
• Clustering analysis is the process of
discovering groups and clusters in the
data in such a way that the degree of
association between two objects is highest
if they belong to the same group and
lowest otherwise.
• The cluster is actually a collection of data
objects.
ASSOCIATION RULE
LEARNING
• It refers to the method that can help you
identify some interesting relations between
different variables in large databases.
• Association rules are useful for examining
and forecasting customer behavior. It is
highly recommended in the retail industry
analysis.
ANOMALY OR OUTLIER
DETECTION
• This refers to the observation for data
items in a dataset that do not match an
expected pattern or an expected behavior.
• This technique can be used in a variety of
domains, such as intrusion detection.
CLASSIFICATION ANALYSIS
• This analysis is used to retrieve important
and relevant information about data. It is
used to classify different data in different
classes.
• A classic example of classification analysis
would be our Outlook email. In Outlook,
they use certain algorithms to characterize
an email as legitimate or spam.
REGRESSION ANALYSIS
• In statistical terms, a regression analysis is
the process of identifying and analyzing
the relationship among variables.
• It can help you understand the
characteristic value of the dependent
variable changes, if any one of the
independent variables is varied.
• It is generally used for prediction and
forecasting.
Real life applications
• Service providers
• Retail
• E-commerce
• Crime agencies
Thank You

More Related Content

PPTX
Data reduction
PPTX
Exploratory data analysis with Python
PPTX
What is Data analytics and it's importance ?
PDF
Data preprocessing
PPTX
Data Mining: Key definitions
PPTX
Introduction to data pre-processing and cleaning
DOCX
Exam Short Preparation on Data Analytics
PPTX
DM
Data reduction
Exploratory data analysis with Python
What is Data analytics and it's importance ?
Data preprocessing
Data Mining: Key definitions
Introduction to data pre-processing and cleaning
Exam Short Preparation on Data Analytics
DM

What's hot (16)

PDF
Statistics for data scientists
PDF
Vlaeminck iassist 2013
PPTX
MS Sql Server: Introduction To Datamining Suing Sql Server
PPTX
Data Mining: Classification and analysis
DOCX
RESEARCH DIRECTIONS FOR ENGINEERING BIG DATA ANALYTICS SOFTWARE
PPTX
02 Related Concepts
DOCX
Statistics in real life engineering
PPTX
Data analysis
PPT
File 498 Doc 4 01 Dm Intro To Dm
PPTX
Pre processing
PDF
Data Analyst Roles & Responsibilities | Edureka
PPTX
Data Mining: Applying data mining
PPTX
03 Data Mining Techniques
PPTX
Data mining
PPT
What is Graph Database
PDF
Data analysis
Statistics for data scientists
Vlaeminck iassist 2013
MS Sql Server: Introduction To Datamining Suing Sql Server
Data Mining: Classification and analysis
RESEARCH DIRECTIONS FOR ENGINEERING BIG DATA ANALYTICS SOFTWARE
02 Related Concepts
Statistics in real life engineering
Data analysis
File 498 Doc 4 01 Dm Intro To Dm
Pre processing
Data Analyst Roles & Responsibilities | Edureka
Data Mining: Applying data mining
03 Data Mining Techniques
Data mining
What is Graph Database
Data analysis
Ad

Similar to Data Mining Technniques (20)

PPTX
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
PDF
Introduction to Artificial Intelligence_ Lec 4
PPT
Unit 3 part ii Data mining
PPTX
DM_Notes.pptx
PPTX
PPTX
CLUSTER ANALYSIS.pptx
PPTX
INTRODUCTION to datawarehouse IN DATA.pptx
PPTX
01 Introduction to Data Mining
PPTX
Data warehouse
PPTX
Data Mining Presentation.pptx
PPTX
unit 1.pptx
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
PPTX
Data Science presentation for explanation of numpy and pandas
PPTX
computational statistics machine learning unit 5.pptx
PPTX
2-Concept Hierarchy to Classification of DMS.pptx
PPTX
Hetrogeneous Data handling in Big Data Analysis
PPTX
data science, prior knowledge ,modeling, scatter plot
PDF
Introduction to Data Analysis for researcher.pdf
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
Introduction to Artificial Intelligence_ Lec 4
Unit 3 part ii Data mining
DM_Notes.pptx
CLUSTER ANALYSIS.pptx
INTRODUCTION to datawarehouse IN DATA.pptx
01 Introduction to Data Mining
Data warehouse
Data Mining Presentation.pptx
unit 1.pptx
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
Data Science presentation for explanation of numpy and pandas
computational statistics machine learning unit 5.pptx
2-Concept Hierarchy to Classification of DMS.pptx
Hetrogeneous Data handling in Big Data Analysis
data science, prior knowledge ,modeling, scatter plot
Introduction to Data Analysis for researcher.pdf
Ad

More from Livares Technologies Pvt Ltd (20)

PPTX
Web Performance Optimization
PPTX
Supervised Machine Learning
PPTX
Software Architecture Design
PPTX
Automation using Appium
PPTX
Bubble(No code Tool)
PPTX
Unsupervised Machine Learning
PPTX
Developing Secure Apps
PPTX
Micro-Frontend Architecture
PPTX
PPTX
Introduction to Angular JS
PPTX
An Insight into Quantum Computing
PPTX
Just in Time (JIT)
PPTX
Introduction to Bitcoin
PPTX
Facade Design Pattern
PPTX
Manual Vs Automation Testing
PPT
Screenless display
PPTX
Database Overview
PPTX
An Introduction to Machine Learning
PPTX
An Introduction to Face Detection
PDF
Smart water meter solutions using LoRa WAN - Troncart
Web Performance Optimization
Supervised Machine Learning
Software Architecture Design
Automation using Appium
Bubble(No code Tool)
Unsupervised Machine Learning
Developing Secure Apps
Micro-Frontend Architecture
Introduction to Angular JS
An Insight into Quantum Computing
Just in Time (JIT)
Introduction to Bitcoin
Facade Design Pattern
Manual Vs Automation Testing
Screenless display
Database Overview
An Introduction to Machine Learning
An Introduction to Face Detection
Smart water meter solutions using LoRa WAN - Troncart

Recently uploaded (20)

PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Programs and apps: productivity, graphics, security and other tools
PPT
Teaching material agriculture food technology
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Encapsulation theory and applications.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
cuic standard and advanced reporting.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Machine learning based COVID-19 study performance prediction
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Cloud computing and distributed systems.
Per capita expenditure prediction using model stacking based on satellite ima...
Understanding_Digital_Forensics_Presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
20250228 LYD VKU AI Blended-Learning.pptx
Network Security Unit 5.pdf for BCA BBA.
Programs and apps: productivity, graphics, security and other tools
Teaching material agriculture food technology
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Encapsulation theory and applications.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Chapter 3 Spatial Domain Image Processing.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
MIND Revenue Release Quarter 2 2025 Press Release
cuic standard and advanced reporting.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
The AUB Centre for AI in Media Proposal.docx
sap open course for s4hana steps from ECC to s4
Machine learning based COVID-19 study performance prediction
Dropbox Q2 2025 Financial Results & Investor Presentation
Cloud computing and distributed systems.

Data Mining Technniques

  • 2. Data Warehousing • A data warehouse is a collection of databases that work together. A data warehouse makes it possible to integrate data from multiple databases, which can give new insights into the data. • The ultimate goal of a database is not just to store data, but to help businesses make decisions based on that data.
  • 3. What is data mining? • Queries based on SQL, a database programming language, are used to answer basic questions about data. • But, as the collection of data grows in a database, the amount of data can easily become overwhelming. • Data mining is the process of analyzing data and summarizing it to produce useful information
  • 4. Data sets • The complete set of data available to us for an application is called a dataset. A dataset is often depicted as a table, with each row representing an instance. Each column contains the value of one of the variables (attributes) for each of the instances.
  • 5. Goal of Data Mining • The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. • Data mining is the computing process of discovering patterns in large data sets involving methods. • These patterns can be used for predictive analytics.
  • 6. Techniques used for mining • CLASSIFICATION ANALYSIS • ASSOCIATION RULE LEARNING • ANOMALY OR OUTLIER DETECTION • CLUSTERING ANALYSIS • REGRESSION ANALYSIS
  • 7. CLUSTERING ANALYSIS • Clustering analysis is the process of discovering groups and clusters in the data in such a way that the degree of association between two objects is highest if they belong to the same group and lowest otherwise. • The cluster is actually a collection of data objects.
  • 8. ASSOCIATION RULE LEARNING • It refers to the method that can help you identify some interesting relations between different variables in large databases. • Association rules are useful for examining and forecasting customer behavior. It is highly recommended in the retail industry analysis.
  • 9. ANOMALY OR OUTLIER DETECTION • This refers to the observation for data items in a dataset that do not match an expected pattern or an expected behavior. • This technique can be used in a variety of domains, such as intrusion detection.
  • 10. CLASSIFICATION ANALYSIS • This analysis is used to retrieve important and relevant information about data. It is used to classify different data in different classes. • A classic example of classification analysis would be our Outlook email. In Outlook, they use certain algorithms to characterize an email as legitimate or spam.
  • 11. REGRESSION ANALYSIS • In statistical terms, a regression analysis is the process of identifying and analyzing the relationship among variables. • It can help you understand the characteristic value of the dependent variable changes, if any one of the independent variables is varied. • It is generally used for prediction and forecasting.
  • 12. Real life applications • Service providers • Retail • E-commerce • Crime agencies