SlideShare a Scribd company logo
Knowledge Discovery
DefinitionDefinition – “Non-trivial extraction of implicit, previously unknown and potentially useful information from data.”Data Mining – Responsible for detecting patterns from the pre-processed (prepared) data. It is only a part of Knowledge discovery process.
ApplicationsCan be divided into four major kinds:ClassificationNumerical predictionAssociationClusteringSome examples:Automatic abstractionFinancial forecastingTargeted marketingMedical diagnosisCredit card fraud detectionWeather forecasting etc.
Labeled & Unlabeled dataGeneral Terminology: Instances – Dataset of examplesAttributes – Variables in an instanceLabeled dataSpecific attribute whose value in some instances can be used to predict its value in unknown instancesUnlabeled dataNo such specific attribute that can be used to predict the value in unknown instances. Supervised learning – Data mining using labeled dataUnsupervised learning – Data mining using unlabeled data
Labeled dataAttributes can be of two types:Categorical attribute Takes a value from only a fixed set of values (like an enumeration) eg. ‘very good’, ‘good’, ‘poor’Supervised learning is called ClassificationNumerical attributeCan take a value from a continuous range of numerical valuesSupervised learning is called Regression
Unlabeled dataIt doesn’t have any specifically designated attributeUnsupervised learning Data mining using unlabeled dataPurpose - To extract as much as it is possible from the data available.
Supervised learning: ClassificationIt is based on the following three methods:Nearest neighbor matching: Identifying the classified instances that are closest (in some sense) to the unclassified oneClassification rules:Look for rules that can be used to predict the classification of an unknown instanceClassification tree:Generation of classification rules via the tree-like structure
Supervised Learning: Numerical PredictionRegression is done by using Neural NetworksNeural Network: Given a set of inputs to predict one or more outputs
Unsupervised Learning: Association RulesAssociation Rules: To find any relationship that exists amongst the values of variables within a training setExample: IF variable_1>90 and switch_6 = openTHEN variable_3 < 47.5 and switch_9 = closed(probability = 0.8)
Unsupervised Learning: ClusteringTo find groups of items that are similarExample: A company may group its customers based on income to target its policies etc.
Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net

More Related Content

DOCX
Basic Qualitative Analysis for Extension Program Evaluation
PPTX
Classification
PPT
Dma unit 1
PPTX
Presentation_Malware Analysis.pptx
PDF
Oracle ML Cheat Sheet
PPTX
Qualitative Data Analysis
PPTX
Machine learning algorithms for data mining
PPTX
Decision tree induction
Basic Qualitative Analysis for Extension Program Evaluation
Classification
Dma unit 1
Presentation_Malware Analysis.pptx
Oracle ML Cheat Sheet
Qualitative Data Analysis
Machine learning algorithms for data mining
Decision tree induction

What's hot (18)

PPTX
Supervised learning
PPTX
Outlier analysis,Chapter-12, Data Mining: Concepts and Techniques
PPT
4.1 sequentioal search
PDF
Survey on Various Classification Techniques in Data Mining
PPTX
The 8 Step Data Mining Process
PPT
Data Analysis
PDF
Data mining chapter04and5-best
PPTX
Multivariate data analysis
PPTX
Ia for fintech
PPTX
Supervised learning and Unsupervised learning
PPTX
Anomaly Detection Technique
PPTX
Analysis Of Attribute Revelance
PPTX
04 Classification in Data Mining
PPTX
Outlier analysis and anomaly detection
PPTX
Data mining
PPT
Chapter 12 outlier
Supervised learning
Outlier analysis,Chapter-12, Data Mining: Concepts and Techniques
4.1 sequentioal search
Survey on Various Classification Techniques in Data Mining
The 8 Step Data Mining Process
Data Analysis
Data mining chapter04and5-best
Multivariate data analysis
Ia for fintech
Supervised learning and Unsupervised learning
Anomaly Detection Technique
Analysis Of Attribute Revelance
04 Classification in Data Mining
Outlier analysis and anomaly detection
Data mining
Chapter 12 outlier
Ad

Viewers also liked (20)

PPTX
Pentaho: Reporting Solution Development
PPTX
R Environment
PPTX
SPSS: Data Editor
PPTX
LISP: Declarations In Lisp
PPTX
XL-Miner: Timeseries
PPTX
Data Mining The Sky
PPTX
Data-Applied: Technology Insights
PPTX
Bernoullis Random Variables And Binomial Distribution
PPTX
Data Applied: Developer Quicklook
PPT
Wisconsin Fertility Institute: Injection Class 2011
PPTX
Txomin Hartz Txikia
PDF
Apresentação Red Advisers
PPTX
RapidMiner: Advanced Processes And Operators
PPTX
Communicating simply
PPT
PresentacióN De Quimica
PPTX
Matlab Importing Data
PPTX
RapidMiner: Setting Up A Process
PDF
Norihicodanch
PPTX
Classification Continued
Pentaho: Reporting Solution Development
R Environment
SPSS: Data Editor
LISP: Declarations In Lisp
XL-Miner: Timeseries
Data Mining The Sky
Data-Applied: Technology Insights
Bernoullis Random Variables And Binomial Distribution
Data Applied: Developer Quicklook
Wisconsin Fertility Institute: Injection Class 2011
Txomin Hartz Txikia
Apresentação Red Advisers
RapidMiner: Advanced Processes And Operators
Communicating simply
PresentacióN De Quimica
Matlab Importing Data
RapidMiner: Setting Up A Process
Norihicodanch
Classification Continued
Ad

Similar to Knowledge Discovery (20)

PPT
Data mining
PPTX
Lecturer3 by RamaKrishna SRU waranagal telanga
PDF
TTG Int.LTD Data Mining Technique
PPT
Data mining
PPT
Introduction-to-Knowledge Discovery in Database
DOC
Presentation on Machine Learning and Data Mining
PPTX
IntroToDataMining : Key Components and process
PPTX
Supervised and Unsupervised Learning.pptx
PPT
Talk
PDF
Data mining
PPTX
Machine Learning.pptx
PPT
Introduction to Data Mining
PPTX
01-data mining-introduction-bayero-u.pptx
PPTX
Data mining concepts and work
PPTX
3510-6510_Ch4.pptx
PDF
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
PPTX
Data mining an introduction
PPTX
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
PPT
Data Mining-2023 (2).ppt
PPTX
Data mining , Knowledge Discovery Process, Classification
Data mining
Lecturer3 by RamaKrishna SRU waranagal telanga
TTG Int.LTD Data Mining Technique
Data mining
Introduction-to-Knowledge Discovery in Database
Presentation on Machine Learning and Data Mining
IntroToDataMining : Key Components and process
Supervised and Unsupervised Learning.pptx
Talk
Data mining
Machine Learning.pptx
Introduction to Data Mining
01-data mining-introduction-bayero-u.pptx
Data mining concepts and work
3510-6510_Ch4.pptx
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
Data mining an introduction
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
Data Mining-2023 (2).ppt
Data mining , Knowledge Discovery Process, Classification

More from DataminingTools Inc (20)

PPTX
Terminology Machine Learning
PPTX
Techniques Machine Learning
PPTX
Machine learning Introduction
PPTX
Areas of machine leanring
PPTX
AI: Planning and AI
PPTX
AI: Logic in AI 2
PPTX
AI: Logic in AI
PPTX
AI: Learning in AI 2
PPTX
AI: Learning in AI
PPTX
AI: Introduction to artificial intelligence
PPTX
AI: Belief Networks
PPTX
AI: AI & Searching
PPTX
AI: AI & Problem Solving
PPTX
Data Mining: Text and web mining
PPTX
Data Mining: Outlier analysis
PPTX
Data Mining: Mining stream time series and sequence data
PPTX
Data Mining: Mining ,associations, and correlations
PPTX
Data Mining: Graph mining and social network analysis
PPTX
Data warehouse and olap technology
PPTX
Data Mining: Data processing
Terminology Machine Learning
Techniques Machine Learning
Machine learning Introduction
Areas of machine leanring
AI: Planning and AI
AI: Logic in AI 2
AI: Logic in AI
AI: Learning in AI 2
AI: Learning in AI
AI: Introduction to artificial intelligence
AI: Belief Networks
AI: AI & Searching
AI: AI & Problem Solving
Data Mining: Text and web mining
Data Mining: Outlier analysis
Data Mining: Mining stream time series and sequence data
Data Mining: Mining ,associations, and correlations
Data Mining: Graph mining and social network analysis
Data warehouse and olap technology
Data Mining: Data processing

Recently uploaded (20)

PDF
cuic standard and advanced reporting.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Big Data Technologies - Introduction.pptx
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
MYSQL Presentation for SQL database connectivity
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Modernizing your data center with Dell and AMD
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
Cloud computing and distributed systems.
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
cuic standard and advanced reporting.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Big Data Technologies - Introduction.pptx
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Spectral efficient network and resource selection model in 5G networks
NewMind AI Monthly Chronicles - July 2025
MYSQL Presentation for SQL database connectivity
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Network Security Unit 5.pdf for BCA BBA.
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Digital-Transformation-Roadmap-for-Companies.pptx
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Modernizing your data center with Dell and AMD
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Cloud computing and distributed systems.
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication

Knowledge Discovery

  • 2. DefinitionDefinition – “Non-trivial extraction of implicit, previously unknown and potentially useful information from data.”Data Mining – Responsible for detecting patterns from the pre-processed (prepared) data. It is only a part of Knowledge discovery process.
  • 3. ApplicationsCan be divided into four major kinds:ClassificationNumerical predictionAssociationClusteringSome examples:Automatic abstractionFinancial forecastingTargeted marketingMedical diagnosisCredit card fraud detectionWeather forecasting etc.
  • 4. Labeled & Unlabeled dataGeneral Terminology: Instances – Dataset of examplesAttributes – Variables in an instanceLabeled dataSpecific attribute whose value in some instances can be used to predict its value in unknown instancesUnlabeled dataNo such specific attribute that can be used to predict the value in unknown instances. Supervised learning – Data mining using labeled dataUnsupervised learning – Data mining using unlabeled data
  • 5. Labeled dataAttributes can be of two types:Categorical attribute Takes a value from only a fixed set of values (like an enumeration) eg. ‘very good’, ‘good’, ‘poor’Supervised learning is called ClassificationNumerical attributeCan take a value from a continuous range of numerical valuesSupervised learning is called Regression
  • 6. Unlabeled dataIt doesn’t have any specifically designated attributeUnsupervised learning Data mining using unlabeled dataPurpose - To extract as much as it is possible from the data available.
  • 7. Supervised learning: ClassificationIt is based on the following three methods:Nearest neighbor matching: Identifying the classified instances that are closest (in some sense) to the unclassified oneClassification rules:Look for rules that can be used to predict the classification of an unknown instanceClassification tree:Generation of classification rules via the tree-like structure
  • 8. Supervised Learning: Numerical PredictionRegression is done by using Neural NetworksNeural Network: Given a set of inputs to predict one or more outputs
  • 9. Unsupervised Learning: Association RulesAssociation Rules: To find any relationship that exists amongst the values of variables within a training setExample: IF variable_1>90 and switch_6 = openTHEN variable_3 < 47.5 and switch_9 = closed(probability = 0.8)
  • 10. Unsupervised Learning: ClusteringTo find groups of items that are similarExample: A company may group its customers based on income to target its policies etc.
  • 11. Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net