SlideShare a Scribd company logo
Data Mining
Muhammad Farhan Arif 112
Muhammad Arslan Touqeer 117
Ahmad Yaqoob 128
Muhammad Umer Mehboob 131
Data mining
• Data mining is the process of discovering interesting
patterns (or knowledge) from large amounts of data.
• Data mining is also called knowledge discovery and data
mining (KDD)
• Wikipedia definition: “Data mining is the entire process
of applying computer-based methodology, including new
techniques for knowledge discovery, from data.”
– Process of semi-automatically analyzing large databases
to find patterns that are sources valid, novel, potentially
useful, understandable
–Source:
• Databases (most obvious)
• Text Documents
• Computer Simulations (web)
• Social Networks
• Image.
Data Mining Tasks
1_Classification
2_Regression
3_Deviation detection
4_Clustring
5_ Association Rule Discovery
6_Sequential Pattern Discovery
• Components/Functionalities:
There are two main components:
• Knowledge Discovery
Concrete information gleaned from known data. Data you
may not have known, but which is supported by recorded
facts. Also known as descriptive
Knowledge Prediction
Uses known data to forecast future trends, events, etc.
Also known as predictive.(ie: Stock market predictions)
Classification :
In this we do able to identify or predict what class we are talking about.
e.G in business we reduce the cost of mailing by electing and sending mail to those consumers which
are likely to purchase those products.
Regressions:
Predict the values of the given continuous variable based on the values of variable , by
assuming the given linear or nonlinear values.
e.g predicting the predict the financial status of company by data mining of last few years
Deviation Detection:
Detect the significant deviation from the normal behavior. E.g credit card fraud detection.
12
Clustering
Given a set of data points , each set has attribute and similarity b/w them. The fin d
cluster that , data points in one cluster are more similar to each other and in separate
cluster are less similar to one another
Association rule :
When we are given the set of records and by association , it makes the proper associated items , grouped each other.
E.gmemory card and mobile.
Sequential Pattered Discovery :
Set of objects, associated with its own timeline of events then finding the rules that strongly predict the strong
Sequential dependencies . E.g
(intro to c++)(c++ premier)---TCl, TCK
Architecture of Data Mining
DATA CLEANING
DATA INTRGRATION
DATA SELECTION
DATA TRANSFORMATION
DATA MINING
PATTEREN EVALUATION
KNOWLEDGE REPRESENTAION
• Rapid computerization of businesses produce huge amount of data
• How to make best use of data?
• A growing realization: knowledge discovered from data can be used for competitive advantage.
• Make use of your data assets
• There is a big gap from stored data to knowledge; and the transition won’t occur automatically.
• Many interesting things you want to find cannot be found using database queries
“find me people likely to buy my products”
“Who are likely to respond to my promotion”
Uses:
• Business Strategies•
Market Basket Analysis
Identify customer demographics, preferences, and
purchasing patterns.
• AI/Machine Learning•
Combinatorial/Game Data Mining
Good for analyzing winning strategies to games, and thus
developing intelligent AI opponents.
• Risk Analysis•
Product Defect Analysis
Analyze product defect rates for given plants and predict
possible complications
• Scientific Analysis:
• User Behavior Validation
Fraud Detection
In the realm of cell phones
Comparing phone activity to calling records. Can help detect
calls made on cloned phones.
Similarly, with credit cards, comparing purchases with
historical purchases. Can detect activity with stolen cards.
•
Extra-Terrestrial Intelligence
Scanning Satellite receptions for possible transmissions from
other planets.
Uses
When/Why we do data mining
• The data is abundant.
• The data is being warehoused.
• The computing power is affordable.
• The competitive pressure is strong.
• Data mining tools have become available
+
_
camera Display Battery weight durable
Mobile 1 vs Mobile 2
M1
M2
Warning :: Prevalence of Data Mining
• Your data is already being mined, whether you like it or not.
• Many web services require that you allow access to your information [for
data mining] in order to use the service.
• Google mines email data in Gmail accounts to present account owners
with ads.
• Facebook requires users to allow access to info from non-Facebook pages.
Facebook privacy policy:
"We may use information about you that we collect from other sources,
including but not limited to newspapers and Internet sources such as
blogs, instant messaging services and other users of Facebook, to
supplement your profile.

More Related Content

PPTX
Data mining
PPTX
Data mining introduction
PPTX
Data mining services
PPT
Data mining
PPTX
Data Mining
PDF
Data Mining: Future Trends and Applications
PDF
Introduction to Data Mining
Data mining
Data mining introduction
Data mining services
Data mining
Data Mining
Data Mining: Future Trends and Applications
Introduction to Data Mining

What's hot (20)

PPT
data mining
PPTX
Data Mining
PPT
Introduction-to-Knowledge Discovery in Database
PPTX
Data mining
PPT
Data mining
PDF
Data mining
PPTX
9 Data Mining Challenges From Data Scientists Like You
PPTX
Data mining
PPT
Data mining by_ashok
PPT
Data mining in agriculture
PPTX
Internet of Things: Lightning Round, Hite
PPTX
Data mining
PPTX
Data Mining: Classification and analysis
PPTX
Data Mining: Applying data mining
PPTX
Data mining
PDF
Data Mining Techniques
PPTX
Data mining and knowledge discovery
PPT
Data mining
PPT
Big Data
PPTX
Data mining and its applications!
data mining
Data Mining
Introduction-to-Knowledge Discovery in Database
Data mining
Data mining
Data mining
9 Data Mining Challenges From Data Scientists Like You
Data mining
Data mining by_ashok
Data mining in agriculture
Internet of Things: Lightning Round, Hite
Data mining
Data Mining: Classification and analysis
Data Mining: Applying data mining
Data mining
Data Mining Techniques
Data mining and knowledge discovery
Data mining
Big Data
Data mining and its applications!
Ad

Similar to Data Mining in Operating System (20)

PPT
Data Mining- Unit-I PPT (1).ppt
DOCX
Data Warehose and Data Mining Unit II.docx
PPTX
Introduction To Data Mining and Data Mining Techniques.pptx
PPTX
Yogesh Waghode Data-Mining-ppt seminar report
PPTX
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
PDF
Lect 1 introduction
PPT
Dma unit 1
PPT
Introduction to Data Mining
PDF
Database
PDF
Data mining and Machine learning expained in jargon free & lucid language
PPTX
Lect 1 introduction
PPTX
lec01-IntroductionToDataMining.pptx
PDF
DM-Unit-1-Part 1-R.pdf
PPTX
Data mining Basics and complete description onword
PDF
Module-1-IntroductionToDataMining (Data Mining)
PDF
Data Mining and Big Data Challenges and Research Opportunities
PPTX
BAS 250 Lecture 1
DOCX
Seminar Report Vaibhav
Data Mining- Unit-I PPT (1).ppt
Data Warehose and Data Mining Unit II.docx
Introduction To Data Mining and Data Mining Techniques.pptx
Yogesh Waghode Data-Mining-ppt seminar report
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
Lect 1 introduction
Dma unit 1
Introduction to Data Mining
Database
Data mining and Machine learning expained in jargon free & lucid language
Lect 1 introduction
lec01-IntroductionToDataMining.pptx
DM-Unit-1-Part 1-R.pdf
Data mining Basics and complete description onword
Module-1-IntroductionToDataMining (Data Mining)
Data Mining and Big Data Challenges and Research Opportunities
BAS 250 Lecture 1
Seminar Report Vaibhav
Ad

More from ITz_1 (8)

PPTX
Software designm complexity
PPTX
Linux operating system
PPTX
Embedded Software
PPT
PCI
PPTX
5 major social institutions
PPT
Java script programs
PPT
Java script
PPTX
Class selectors
Software designm complexity
Linux operating system
Embedded Software
PCI
5 major social institutions
Java script programs
Java script
Class selectors

Recently uploaded (20)

PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Complications of Minimal Access Surgery at WLH
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Pharma ospi slides which help in ospi learning
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
master seminar digital applications in india
PDF
RMMM.pdf make it easy to upload and study
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Cell Structure & Organelles in detailed.
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Insiders guide to clinical Medicine.pdf
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Pre independence Education in Inndia.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Abdominal Access Techniques with Prof. Dr. R K Mishra
Complications of Minimal Access Surgery at WLH
Anesthesia in Laparoscopic Surgery in India
Pharma ospi slides which help in ospi learning
2.FourierTransform-ShortQuestionswithAnswers.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
master seminar digital applications in india
RMMM.pdf make it easy to upload and study
TR - Agricultural Crops Production NC III.pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
Cell Structure & Organelles in detailed.
102 student loan defaulters named and shamed – Is someone you know on the list?
Insiders guide to clinical Medicine.pdf
Basic Mud Logging Guide for educational purpose
Renaissance Architecture: A Journey from Faith to Humanism
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPH.pptx obstetrics and gynecology in nursing
Pre independence Education in Inndia.pdf

Data Mining in Operating System

  • 1. Data Mining Muhammad Farhan Arif 112 Muhammad Arslan Touqeer 117 Ahmad Yaqoob 128 Muhammad Umer Mehboob 131
  • 2. Data mining • Data mining is the process of discovering interesting patterns (or knowledge) from large amounts of data. • Data mining is also called knowledge discovery and data mining (KDD) • Wikipedia definition: “Data mining is the entire process of applying computer-based methodology, including new techniques for knowledge discovery, from data.” – Process of semi-automatically analyzing large databases to find patterns that are sources valid, novel, potentially useful, understandable
  • 3. –Source: • Databases (most obvious) • Text Documents • Computer Simulations (web) • Social Networks • Image. Data Mining Tasks 1_Classification 2_Regression 3_Deviation detection 4_Clustring 5_ Association Rule Discovery 6_Sequential Pattern Discovery
  • 4. • Components/Functionalities: There are two main components: • Knowledge Discovery Concrete information gleaned from known data. Data you may not have known, but which is supported by recorded facts. Also known as descriptive Knowledge Prediction Uses known data to forecast future trends, events, etc. Also known as predictive.(ie: Stock market predictions)
  • 5. Classification : In this we do able to identify or predict what class we are talking about. e.G in business we reduce the cost of mailing by electing and sending mail to those consumers which are likely to purchase those products. Regressions: Predict the values of the given continuous variable based on the values of variable , by assuming the given linear or nonlinear values. e.g predicting the predict the financial status of company by data mining of last few years Deviation Detection: Detect the significant deviation from the normal behavior. E.g credit card fraud detection.
  • 6. 12 Clustering Given a set of data points , each set has attribute and similarity b/w them. The fin d cluster that , data points in one cluster are more similar to each other and in separate cluster are less similar to one another Association rule : When we are given the set of records and by association , it makes the proper associated items , grouped each other. E.gmemory card and mobile. Sequential Pattered Discovery : Set of objects, associated with its own timeline of events then finding the rules that strongly predict the strong Sequential dependencies . E.g (intro to c++)(c++ premier)---TCl, TCK
  • 7. Architecture of Data Mining DATA CLEANING DATA INTRGRATION DATA SELECTION DATA TRANSFORMATION DATA MINING PATTEREN EVALUATION KNOWLEDGE REPRESENTAION
  • 8. • Rapid computerization of businesses produce huge amount of data • How to make best use of data? • A growing realization: knowledge discovered from data can be used for competitive advantage. • Make use of your data assets • There is a big gap from stored data to knowledge; and the transition won’t occur automatically. • Many interesting things you want to find cannot be found using database queries “find me people likely to buy my products” “Who are likely to respond to my promotion”
  • 9. Uses: • Business Strategies• Market Basket Analysis Identify customer demographics, preferences, and purchasing patterns. • AI/Machine Learning• Combinatorial/Game Data Mining Good for analyzing winning strategies to games, and thus developing intelligent AI opponents. • Risk Analysis• Product Defect Analysis Analyze product defect rates for given plants and predict possible complications • Scientific Analysis:
  • 10. • User Behavior Validation Fraud Detection In the realm of cell phones Comparing phone activity to calling records. Can help detect calls made on cloned phones. Similarly, with credit cards, comparing purchases with historical purchases. Can detect activity with stolen cards. • Extra-Terrestrial Intelligence Scanning Satellite receptions for possible transmissions from other planets. Uses
  • 11. When/Why we do data mining • The data is abundant. • The data is being warehoused. • The computing power is affordable. • The competitive pressure is strong. • Data mining tools have become available
  • 12. + _ camera Display Battery weight durable Mobile 1 vs Mobile 2 M1 M2
  • 13. Warning :: Prevalence of Data Mining • Your data is already being mined, whether you like it or not. • Many web services require that you allow access to your information [for data mining] in order to use the service. • Google mines email data in Gmail accounts to present account owners with ads. • Facebook requires users to allow access to info from non-Facebook pages. Facebook privacy policy: "We may use information about you that we collect from other sources, including but not limited to newspapers and Internet sources such as blogs, instant messaging services and other users of Facebook, to supplement your profile.