SlideShare a Scribd company logo
What is Data Mining
E2MATRIX Research Lab
Complete Thesis & IEEE Project Help
E2matrix 1
E2matrix
Opp Phagaara Bus Stand,
Parmar Complex, Backside Axis Bank.
Phagwara, Punjab,
Call : +91 9041262727
E2matrix 2
Introduction Outline
• Define data mining
• Data mining vs. databases
• Basic data mining tasks
• Data mining development
• Data mining issues
Goal: Provide an overview of data mining.
E2matrix 3
Introduction
• Data is produced at a phenomenal rate
• Our ability to store has grown
• Users expect more sophisticated
information
• How?
UNCOVER HIDDEN INFORMATION
DATA MINING
E2matrix 4
Data Mining
• Objective: Fit data to a model
• Potential Result: Higher-level meta information that may
not be obvious when looking at raw data
• Similar terms
– Exploratory data analysis
– Data driven discovery
– Deductive learning
E2matrix 5
Data Mining Algorithm
• Objective: Fit Data to a Model
– Descriptive
– Predictive
• Preferential Questions
– Which technique to choose?
• ARM/Classification/Clustering
• Answer: Depends on what you want to do with data?
– Search Strategy – Technique to search the data
• Interface? Query Language?
• Efficiency
E2matrix 6
Database Processing vs. Data Mining
Processing
• Query
– Well defined
– SQL
• Query
– Poorly defined
– No precise query language
 Output
– Precise
– Subset of database
 Output
– Fuzzy
– Not a subset of database
E2matrix 7
Query Examples
• Database
• Data Mining
– Find all customers who have purchased milk
– Find all items which are frequently purchased
with milk. (association rules)
– Find all credit applicants with last name of Smith.
– Identify customers who have purchased more
than $10,000 in the last month.
– Find all credit applicants who are poor credit
risks. (classification)
– Identify customers with similar buying habits.
(Clustering)
E2matrix 8
Data Mining Models and Tasks
E2matrix 9
Basic Data Mining Tasks
• Classification maps data into predefined
groups or classes
– Supervised learning
– Pattern recognition
– Prediction
• Regression is used to map a data item to a
real valued prediction variable.
• Clustering groups similar data together into
clusters.
– Unsupervised learning
– Segmentation
– Partitioning

More Related Content

PPTX
data warehousing and data mining
PPTX
Big data storages
PPTX
Tatyana Matvienko,Senior Java Developer, Big data storages
PPTX
AzureDay - Introduction Big Data Analytics.
PPT
Data Mining Introduction
PPTX
Data mining nouman javed
PDF
Domain Semantics
PPTX
Data Mining
data warehousing and data mining
Big data storages
Tatyana Matvienko,Senior Java Developer, Big data storages
AzureDay - Introduction Big Data Analytics.
Data Mining Introduction
Data mining nouman javed
Domain Semantics
Data Mining

What's hot (18)

PPTX
Custom Data Search with Stormpath
PDF
Building Knowledge Graphs in 10 steps
PDF
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
PDF
SpeedTrack Tech Overview 2015
PDF
Ghhh
PPTX
Ringgold User Group Meeting 2016 (USA)
PPTX
Big data analytics - Introduction to Big Data and Hadoop
PPTX
Emerging Standards: Data and Data Exchange in Scholarly Publishing
PDF
Consumer Data Management
PDF
ComputableFacts: a Secure System to Store Documents and Graphs
PDF
Data Discovery & Trust through Metadata
PDF
Self-service consumption Data Catalog
PPTX
Small Data, Big Benefits - Christine Orr at SSP 2016
PDF
Data Strategy for a Scalable Practice
PPTX
Machine Learning in the Data Science Context
PDF
Somuvadali 180712051740
PPTX
Overview of Oracle Database 18c Express Edition (XE)
PPT
Big data analytics
Custom Data Search with Stormpath
Building Knowledge Graphs in 10 steps
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
SpeedTrack Tech Overview 2015
Ghhh
Ringgold User Group Meeting 2016 (USA)
Big data analytics - Introduction to Big Data and Hadoop
Emerging Standards: Data and Data Exchange in Scholarly Publishing
Consumer Data Management
ComputableFacts: a Secure System to Store Documents and Graphs
Data Discovery & Trust through Metadata
Self-service consumption Data Catalog
Small Data, Big Benefits - Christine Orr at SSP 2016
Data Strategy for a Scalable Practice
Machine Learning in the Data Science Context
Somuvadali 180712051740
Overview of Oracle Database 18c Express Edition (XE)
Big data analytics
Ad

Similar to Data Mining Techniques (20)

PPTX
what is data mining
PPTX
Data mining introduction
PPTX
01 Introduction to Data Mining
PDF
2 introductory slides
PPT
Data Mining- Unit-I PPT (1).ppt
PPT
Data mining final year project in jalandhar
PPT
Data mining final year project in ludhiana
PDF
Data Mining
PDF
Overview of Data Mining
PPT
`Data mining
PPT
PPT
PPTX
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
PPT
Data Mining Xuequn Shang NorthWestern Polytechnical University
DOCX
Seminar Report Vaibhav
PPTX
lec01-IntroductionToDataMining.pptx
PPTX
Data mining , Knowledge Discovery Process, Classification
PDF
Data Warehousing and Suitable for BCA, BSC, MCA
PPT
6 weeks summer training in data mining,jalandhar
PPT
6 weeks summer training in data mining,ludhiana
what is data mining
Data mining introduction
01 Introduction to Data Mining
2 introductory slides
Data Mining- Unit-I PPT (1).ppt
Data mining final year project in jalandhar
Data mining final year project in ludhiana
Data Mining
Overview of Data Mining
`Data mining
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
Data Mining Xuequn Shang NorthWestern Polytechnical University
Seminar Report Vaibhav
lec01-IntroductionToDataMining.pptx
Data mining , Knowledge Discovery Process, Classification
Data Warehousing and Suitable for BCA, BSC, MCA
6 weeks summer training in data mining,jalandhar
6 weeks summer training in data mining,ludhiana
Ad

More from E2MATRIX (20)

PPTX
Electrical Training in Phagwara
PPTX
Electrical Training in Mohali
PPTX
Electrical Training in Ludhiana
PPTX
Electrical Training in Jalandhar
PPTX
Electrical Training in Chandigarh
PPTX
Electrical Training in Amritsar
PPTX
Big Data Training in Amritsar
PPTX
Big Data Training in Mohali
PPTX
Big Data Training in Ludhiana
PDF
Machine Learning Training in Phagwara
PDF
Machine Learning Training in Ludhiana
PDF
Machine Learning Training in Amritsar
PDF
Machine Learning Training in Mohali
PDF
Machine Learning Training in Jalandhar
PDF
Machine Learning Training in Chandigarh
PPTX
Raspberry Pi training in Ludhiana
PPTX
Raspberry Pi Training in Phagwara
PPTX
Raspberry Pi Training in Mohali
PPTX
Raspberry Pi Training in Chandigarh
PPTX
Raspberry Pi Training in Amritsar
Electrical Training in Phagwara
Electrical Training in Mohali
Electrical Training in Ludhiana
Electrical Training in Jalandhar
Electrical Training in Chandigarh
Electrical Training in Amritsar
Big Data Training in Amritsar
Big Data Training in Mohali
Big Data Training in Ludhiana
Machine Learning Training in Phagwara
Machine Learning Training in Ludhiana
Machine Learning Training in Amritsar
Machine Learning Training in Mohali
Machine Learning Training in Jalandhar
Machine Learning Training in Chandigarh
Raspberry Pi training in Ludhiana
Raspberry Pi Training in Phagwara
Raspberry Pi Training in Mohali
Raspberry Pi Training in Chandigarh
Raspberry Pi Training in Amritsar

Recently uploaded (20)

PPTX
Cell Types and Its function , kingdom of life
PPTX
Digestion and Absorption of Carbohydrates, Proteina and Fats
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Empowerment Technology for Senior High School Guide
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
RMMM.pdf make it easy to upload and study
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
UNIT III MENTAL HEALTH NURSING ASSESSMENT
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
advance database management system book.pdf
PDF
IGGE1 Understanding the Self1234567891011
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Cell Types and Its function , kingdom of life
Digestion and Absorption of Carbohydrates, Proteina and Fats
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Empowerment Technology for Senior High School Guide
What if we spent less time fighting change, and more time building what’s rig...
Supply Chain Operations Speaking Notes -ICLT Program
Paper A Mock Exam 9_ Attempt review.pdf.
RMMM.pdf make it easy to upload and study
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
UNIT III MENTAL HEALTH NURSING ASSESSMENT
Orientation - ARALprogram of Deped to the Parents.pptx
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
A systematic review of self-coping strategies used by university students to ...
advance database management system book.pdf
IGGE1 Understanding the Self1234567891011
Weekly quiz Compilation Jan -July 25.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3

Data Mining Techniques

  • 1. What is Data Mining E2MATRIX Research Lab Complete Thesis & IEEE Project Help E2matrix 1 E2matrix Opp Phagaara Bus Stand, Parmar Complex, Backside Axis Bank. Phagwara, Punjab, Call : +91 9041262727
  • 2. E2matrix 2 Introduction Outline • Define data mining • Data mining vs. databases • Basic data mining tasks • Data mining development • Data mining issues Goal: Provide an overview of data mining.
  • 3. E2matrix 3 Introduction • Data is produced at a phenomenal rate • Our ability to store has grown • Users expect more sophisticated information • How? UNCOVER HIDDEN INFORMATION DATA MINING
  • 4. E2matrix 4 Data Mining • Objective: Fit data to a model • Potential Result: Higher-level meta information that may not be obvious when looking at raw data • Similar terms – Exploratory data analysis – Data driven discovery – Deductive learning
  • 5. E2matrix 5 Data Mining Algorithm • Objective: Fit Data to a Model – Descriptive – Predictive • Preferential Questions – Which technique to choose? • ARM/Classification/Clustering • Answer: Depends on what you want to do with data? – Search Strategy – Technique to search the data • Interface? Query Language? • Efficiency
  • 6. E2matrix 6 Database Processing vs. Data Mining Processing • Query – Well defined – SQL • Query – Poorly defined – No precise query language  Output – Precise – Subset of database  Output – Fuzzy – Not a subset of database
  • 7. E2matrix 7 Query Examples • Database • Data Mining – Find all customers who have purchased milk – Find all items which are frequently purchased with milk. (association rules) – Find all credit applicants with last name of Smith. – Identify customers who have purchased more than $10,000 in the last month. – Find all credit applicants who are poor credit risks. (classification) – Identify customers with similar buying habits. (Clustering)
  • 8. E2matrix 8 Data Mining Models and Tasks
  • 9. E2matrix 9 Basic Data Mining Tasks • Classification maps data into predefined groups or classes – Supervised learning – Pattern recognition – Prediction • Regression is used to map a data item to a real valued prediction variable. • Clustering groups similar data together into clusters. – Unsupervised learning – Segmentation – Partitioning