SlideShare a Scribd company logo
Group 7
What is Data
                  Mining ?



                                                Mining and discovery of new
                                                information in terms of
                                                patterns or
                                                rules from vast amounts of
                                                data.



The process of discovering meaningful new correlations, patterns and trends by sifting
through large amounts of data stored in repositoties, using pattern recognition
technologies as well as statical and methematics techniques.
Why we mine
  Data ?




  Commercial View Point :-
  Lots of data is being collected and warehoused .
  Computers have become cheaper and more powerful.
  Competitive Pressure is Strong .


  Scientific View Point :-
  Data collected and stored at enormous speeds (GB/hour).
  Traditional techniques infeasible for raw data.
  Data mining may help scientists.
On what kind of
   Data...?



          •   Relational databases
          •   Data warehouses
          •   Transactional databases
          •   Advanced database systems:
                   Object-relational
                   Spacial and Temporal
                   Time-series
                   Multimedia, text
                   WWW
What are the goals
 of Data mining?



    • Prediction  e.g. sales volume, earthquakes
    • Identification e.g. existence of genes, system
    intrusions
    • Classification of different categories e.g. discount
    seeking shoppers or loyal regular shoppers in a
    supermarket
    • Optimization of limited resources such as time,
    space, money or materials and maximization of
    outputs such as sales or profits
What are the
      applications of Data-
            Mining ?


● Marketing
                                     ● Finance
 Analysis of consumer behavior
                                      Creditworthiness of clients
 Advertising campaigns
                                      Performance analysis of finance
 Targeted mailings
                                        investments
 Segmentation of
                                      Fraud detection
  customers, stores, or products

● Manufacturing
                                     ● Health Care
 Optimization of resources
                                      Discovering patterns in X-ray
 Optimization of manufacturing
                                        images
  processes
                                      Analyzing side effects of drugs
 Product design based on customer
                                      Effectiveness of treatments
  requirements
What are the present
commercial tools for
   Data Mining ?




                     Data to knowledge
 SAS                                            Oracle data-miner




 Intelligent miner                 Clementine
How to build a data
  mining model?       An important concept is
                      that building a mining
                      model is part of a larger
                      process.
1. Defining
    the
 problem.     Clearly define the business
                       problem.
2. Preparing
    Data       consolidate and clean the data that
               was identified in the Defining the
               Problem step.
3.Exploring
   Data
              Explore the prepared data



       .
4.Building
 Models      Before you build a model, you must
             randomly separate the prepared data into
             separate training and testing datasets.
             You use the training dataset to build the
             model, and the testing dataset to test the
             accuracy of the model by creating
             prediction queries.
5. Exploring
and validating
models           Explore the models that you
                 have built and test their
                 effectiveness.
6. Deploying
and updating
               Deploy to a production
models         environment the models
               that performed the best.
What are the major
issues in Data-Mining
      concept ?

    Mining different kinds of knowledge in databases
    Interactive mining of knowledge at multiple levels of
     abstraction
    Incorporation of background knowledge
    Data mining query languages and ad-hoc data mining
    Expression and visualization of data mining results
    Handling noise and incomplete data
    Pattern evaluation: the interestingness problem
    Integration of the discovered knowledge with existing
     knowledge: A knowledge fusion problem
    Protection of data security, integrity, and privacy
How will be the future of
 Data-Mining concept?




      ● Active research is ongoing
       Neural Networks
       Regression Analysis
       Genetic Algorithms
      ● Data mining is used in many areas today. We
      cannot even begin to imagine what the future
      holds in its womb!
Thank You !

More Related Content

PPTX
MEMORY & I/O SYSTEMS
PPTX
UNIT II –8085 MICROPROCESSOR AND 8051 MICROCONTROLLER---ME6702– MECHATRONICS
PDF
CNIT 126 13: Data Encoding
PPT
Coa module2
PPTX
Instruction set of 8051 Microcontrollers
PPTX
Embedded os
PDF
Algorithmic problem solving
PPTX
cache memory and types of cache memory,
MEMORY & I/O SYSTEMS
UNIT II –8085 MICROPROCESSOR AND 8051 MICROCONTROLLER---ME6702– MECHATRONICS
CNIT 126 13: Data Encoding
Coa module2
Instruction set of 8051 Microcontrollers
Embedded os
Algorithmic problem solving
cache memory and types of cache memory,

What's hot (20)

PPTX
DDR SDRAMs
PPTX
Computer architecture and organization
PDF
8085 Architecture
PPTX
Unix operating system
PPT
chap 18 multicore computers
PPTX
Computer Organisation & Architecture (chapter 1)
PPTX
Lecture 37
PPTX
Central Processing Unit
PPTX
Availability and reliability
PPTX
Memory hierarchy
PPTX
Cache memory
PPTX
Client Server models in JAVA
PPTX
Unit 4 Concurrency control.pptx dbms lovely
PPTX
Desktop and multiprocessor systems
PPT
Pipelining
PPTX
Intel Pentium Pro
PPTX
Direct memory access
PPT
Operating system vulnerability and control
PPTX
Intel processor family
PPTX
Motherboard ppt
DDR SDRAMs
Computer architecture and organization
8085 Architecture
Unix operating system
chap 18 multicore computers
Computer Organisation & Architecture (chapter 1)
Lecture 37
Central Processing Unit
Availability and reliability
Memory hierarchy
Cache memory
Client Server models in JAVA
Unit 4 Concurrency control.pptx dbms lovely
Desktop and multiprocessor systems
Pipelining
Intel Pentium Pro
Direct memory access
Operating system vulnerability and control
Intel processor family
Motherboard ppt
Ad

Viewers also liked (9)

PPT
Plán školení technik Haier
PPTX
The european union tr
PPT
Pp origens catalunya (anglès) comenius
PPT
AGEL - cesta k rovnováze
PPT
Prezentacja EU
PPT
Gastronomy 2
PPT
Agel intro
PPTX
Re engineering process of sri lankan national transport service
ODP
Gun industry
Plán školení technik Haier
The european union tr
Pp origens catalunya (anglès) comenius
AGEL - cesta k rovnováze
Prezentacja EU
Gastronomy 2
Agel intro
Re engineering process of sri lankan national transport service
Gun industry
Ad

Similar to Data mining concepts (20)

PPTX
Exploratory data analysis for business MODULE 1.pptx
PDF
What is data mining ?
PDF
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
PPTX
Introduction To Data Mining and Data Mining Techniques.pptx
PPTX
Knowledge Discovery and Data Mining
PPTX
data minig for eng with all topics and history
DOCX
Seminar Report Vaibhav
PPT
Introduction To Data Mining
PPT
Introduction To Data Mining
PPT
3 marketing research
PDF
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
PPTX
Data mining
PPTX
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
PPTX
Data mining (prefinals)
PPTX
Zakipoint Introduction
PPTX
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
PPT
Data mining
PPTX
Data mining, need , process and KDD Its steps process
PDF
17 cs002
PDF
2011 Shopper Insights Brochure
Exploratory data analysis for business MODULE 1.pptx
What is data mining ?
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Introduction To Data Mining and Data Mining Techniques.pptx
Knowledge Discovery and Data Mining
data minig for eng with all topics and history
Seminar Report Vaibhav
Introduction To Data Mining
Introduction To Data Mining
3 marketing research
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
Data mining
Data-Mining-Specialist-Advanced-Techniques-for-Data-Analysisppt.pptx
Data mining (prefinals)
Zakipoint Introduction
Lecturedsfndskfjdsklfjldsdsfdsgmjdflgmdflmg.pptx
Data mining
Data mining, need , process and KDD Its steps process
17 cs002
2011 Shopper Insights Brochure

More from Udara Seneviratne (19)

PDF
Industrial presentation
PDF
Expert Food Analysis System
PDF
Eye disease expert system
PDF
Ayurvedic diet management system
PDF
Media streaming
PDF
Automated Traval Ticketing System
PPTX
Business Strategic Analysis of RyanAir
PPTX
Pros and cons of facebook
PDF
Did you know....
PPTX
ODP
Brain damaging habits
PPTX
Mobile computing
PPS
Scedule feasibility
PPS
Environmental issues
PPTX
Survey report of life style of young people in badulla area
PPS
How to succeed
PPS
Parents wish1
PPTX
The poor man
PPTX
Identity styles of communication
Industrial presentation
Expert Food Analysis System
Eye disease expert system
Ayurvedic diet management system
Media streaming
Automated Traval Ticketing System
Business Strategic Analysis of RyanAir
Pros and cons of facebook
Did you know....
Brain damaging habits
Mobile computing
Scedule feasibility
Environmental issues
Survey report of life style of young people in badulla area
How to succeed
Parents wish1
The poor man
Identity styles of communication

Recently uploaded (20)

PDF
KodekX | Application Modernization Development
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Machine learning based COVID-19 study performance prediction
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Spectroscopy.pptx food analysis technology
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
sap open course for s4hana steps from ECC to s4
KodekX | Application Modernization Development
The AUB Centre for AI in Media Proposal.docx
Machine learning based COVID-19 study performance prediction
NewMind AI Weekly Chronicles - August'25 Week I
The Rise and Fall of 3GPP – Time for a Sabbatical?
MYSQL Presentation for SQL database connectivity
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Spectroscopy.pptx food analysis technology
Per capita expenditure prediction using model stacking based on satellite ima...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
cuic standard and advanced reporting.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Building Integrated photovoltaic BIPV_UPV.pdf
Unlocking AI with Model Context Protocol (MCP)
Understanding_Digital_Forensics_Presentation.pptx
Review of recent advances in non-invasive hemoglobin estimation
sap open course for s4hana steps from ECC to s4

Data mining concepts

  • 2. What is Data Mining ? Mining and discovery of new information in terms of patterns or rules from vast amounts of data. The process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositoties, using pattern recognition technologies as well as statical and methematics techniques.
  • 3. Why we mine Data ? Commercial View Point :- Lots of data is being collected and warehoused . Computers have become cheaper and more powerful. Competitive Pressure is Strong . Scientific View Point :- Data collected and stored at enormous speeds (GB/hour). Traditional techniques infeasible for raw data. Data mining may help scientists.
  • 4. On what kind of Data...? • Relational databases • Data warehouses • Transactional databases • Advanced database systems: Object-relational Spacial and Temporal Time-series Multimedia, text WWW
  • 5. What are the goals of Data mining? • Prediction e.g. sales volume, earthquakes • Identification e.g. existence of genes, system intrusions • Classification of different categories e.g. discount seeking shoppers or loyal regular shoppers in a supermarket • Optimization of limited resources such as time, space, money or materials and maximization of outputs such as sales or profits
  • 6. What are the applications of Data- Mining ? ● Marketing ● Finance  Analysis of consumer behavior  Creditworthiness of clients  Advertising campaigns  Performance analysis of finance  Targeted mailings investments  Segmentation of  Fraud detection customers, stores, or products ● Manufacturing ● Health Care  Optimization of resources  Discovering patterns in X-ray  Optimization of manufacturing images processes  Analyzing side effects of drugs  Product design based on customer  Effectiveness of treatments requirements
  • 7. What are the present commercial tools for Data Mining ? Data to knowledge SAS Oracle data-miner Intelligent miner Clementine
  • 8. How to build a data mining model? An important concept is that building a mining model is part of a larger process.
  • 9. 1. Defining the problem. Clearly define the business problem.
  • 10. 2. Preparing Data consolidate and clean the data that was identified in the Defining the Problem step.
  • 11. 3.Exploring Data Explore the prepared data .
  • 12. 4.Building Models Before you build a model, you must randomly separate the prepared data into separate training and testing datasets. You use the training dataset to build the model, and the testing dataset to test the accuracy of the model by creating prediction queries.
  • 13. 5. Exploring and validating models Explore the models that you have built and test their effectiveness.
  • 14. 6. Deploying and updating Deploy to a production models environment the models that performed the best.
  • 15. What are the major issues in Data-Mining concept ?  Mining different kinds of knowledge in databases  Interactive mining of knowledge at multiple levels of abstraction  Incorporation of background knowledge  Data mining query languages and ad-hoc data mining  Expression and visualization of data mining results  Handling noise and incomplete data  Pattern evaluation: the interestingness problem  Integration of the discovered knowledge with existing knowledge: A knowledge fusion problem  Protection of data security, integrity, and privacy
  • 16. How will be the future of Data-Mining concept? ● Active research is ongoing  Neural Networks  Regression Analysis  Genetic Algorithms ● Data mining is used in many areas today. We cannot even begin to imagine what the future holds in its womb!