SlideShare a Scribd company logo
4
Most read
10
Most read
12
Most read
Amarjit Kaur
Samiksha Sharma
 What is data mining ?
 Why data mining ?
 Data mining as a necessity
 Evolution of database
 Origin of data mining
 Data mining : A KDD process
 Applications
 Management areas
 Examples
 techniques
 Extration of implicit, previously unknown and
potentially useful information from data
 Exploration & analysis by automatic and semi
automatic means of large quantities of data
in order to discover meaningful patterns
 Lots of data is being collected and
warehoused
 Web data , e commerce
 Purchases at departmental store/ groceries
store
 Bank/credit card transactions
 Computers have become cheaper and more
powerful
 Competitve pressure is strong
 Provide better,customized services for an
edge (e.g in customer relationship
management)
DATA MINING –
AS A
NECESSITY
DATA EXPLOSION PROBLEM
 Automated data collection tools and mature
database technology lead to tremendous
amounts of data stored in databases, data
warehouses and other information
repositories
 We are drowning in data, but starving for
knowledge!
 Solution : data mining
 Extraction of interesting knowledge(rules,
regularities,patterns,constraints) from data
in large databases
 1960s:
Data collection, database creation, IMS and
network DBMS.
 1970s:
Relational data model, relational DBMS
implementation
 1980s:
RDBMS, advanced data models( extended
relational, OO, deductive sets) and application-
oriented DBMS(spatial,scientific,engineering etc)
 1990s-2000s:
Data mining and data warehousing, multimedia
databases, and web databases
 The term “data mining” was introduced in the
1990s. Data mining roots are traced back along
three family lines:
Classical
statistics
Artificial
intelligence
Machine
learning
 Statistical are the foundations of most technologies on
which data mining is built, e.g. regression analysis,
standard deviation etc. All these are used to study data
and data relationships.
 Artificial intelligence which is built upon heuristics as
opposed to statistics, attempts to apply human-
thoughts like processing to statistical problems. E.g.
RDBMS.
 Machine learning is to union of statistics and AI.
 DATA MINING therefore uses AI and statistical approach
together. It blends AI heuristics with advanced
statistical analysis to study data and find previously –
hidden trends or patterns within company using
statistical fundamental concepts and adding more
advanced AI algorithms to achieve the goal.
Information Technology Data Mining
 Database analysis and decision support
 Market analysis and management
 Target marketing, customer relation
management, market basket analysis, cross
selling, market segmentation
 Risk analysis and management
 Forecasting, customer retention,improved
underwriting, quality control,competitive
analysis
 Fraud detection and management
 Other applications
 Text mining(news group,email,documents) and
web analysis
 Intelligent query answering
 Sports
 IBM Advanced Scout analyzed NBA game statistics to
gain competitive advantage for New York Knicks and
Miami Heat
 Astronomy
 JPL and the Palomar Observatory discovered 22
quasars with the help of data-mining.
 Internet Web Surf-Aid
 IBM Surf-Aid applies data mining algorithms to web
access logs for market-related pages to discover
customer preference and behaviour pages, analyzing
effectiveness of web marketing, improving web site
organizations etc
 Cross-market analysis
 Associations/Co-relations between product sales
 Prediction based on the associations information
 Customer profiling
 Data mining can tell you what types of customers
buy what products.
 Identifying customer requirements
 Identifying the best products for different
customers.
 Use prediction to find what factors will attract new
customers.
 Provides summary information
 Various multidimensional summary reports
 Statistical summary information
 Finance planning and asset evaluation
i. Cash flow analysis and prediction
ii. Contingent claim analysis to evaluate assets
iii. Cross-sectional and time series analysis
 Resources planning
i. Summarize and compare the resources and
spending
 Competition
i. Monitor competition and market directions
ii. Set price strategy in the market
iii. Grouping of customer into classes
 Applications
 Widely used in health care, retail, credit card
services, telecommunications etc.
 Approach
 Use historical data to build models of fraudulent
behaviour and use data mining to help identify
similar instances.
 Examples
 Auto insurance : detect a group of people who
stage accidents to collect on insurance.
 Money laundering : detect suspicious money
transactions
Information Technology Data Mining
Information Technology Data Mining
Information Technology Data Mining
Information Technology Data Mining
Techniques
CLASSIFICATION
ASSOCIATION
SEQUENCE
CLUSTER
Information Technology Data Mining

More Related Content

PPTX
Research ayurveda
PPTX
Data mining
PPTX
RASA DRAVYA PARICHAYA I - PARADA.pptx as per ncism
PPTX
Basti dravya
PPTX
Deafness / Badhirya
PPT
Tridosha
PPTX
SAMPRAPTI VIGHATANA JWARA KAYACHIKITSA NCISM
PPTX
Medhya rasayan
Research ayurveda
Data mining
RASA DRAVYA PARICHAYA I - PARADA.pptx as per ncism
Basti dravya
Deafness / Badhirya
Tridosha
SAMPRAPTI VIGHATANA JWARA KAYACHIKITSA NCISM
Medhya rasayan

What's hot (20)

PPTX
Sheetapitta, udarda and kotha
PPTX
Uttara basti netra,Putaka & Matra
PDF
Research methodology for ayurveda MD/MS
PDF
Virechana 01 varanasi
PPT
Rajayakshma or Tuberculosis by Dr.Sandeep sharma
PPTX
Dasha vidha Pareeksha.pptx Dr Akshay Shetty
PPTX
Gall bladder stone treatment by dr.shailesh phalle
PPTX
Sikata Varga
PPTX
Paschat karma of virechana
PPTX
PPTX
Ppt aushadh sevana kala
PPTX
Ikshu.pptx
PPTX
Grahni an approach towards diagnosis and management of the
PDF
CONCEPT OF SHODHANA
PPT
Dr.Lavanya- Rasoushadhis Better than Kashtoushadhis. Substantiate
PPTX
Murivenna
PPTX
Concept of agni & ama seminar
PPTX
Ayurvedic Approach Towards Gall Stone & A Case Study
PPTX
Definitions in Rasa Shastra part 1
Sheetapitta, udarda and kotha
Uttara basti netra,Putaka & Matra
Research methodology for ayurveda MD/MS
Virechana 01 varanasi
Rajayakshma or Tuberculosis by Dr.Sandeep sharma
Dasha vidha Pareeksha.pptx Dr Akshay Shetty
Gall bladder stone treatment by dr.shailesh phalle
Sikata Varga
Paschat karma of virechana
Ppt aushadh sevana kala
Ikshu.pptx
Grahni an approach towards diagnosis and management of the
CONCEPT OF SHODHANA
Dr.Lavanya- Rasoushadhis Better than Kashtoushadhis. Substantiate
Murivenna
Concept of agni & ama seminar
Ayurvedic Approach Towards Gall Stone & A Case Study
Definitions in Rasa Shastra part 1
Ad

Viewers also liked (15)

PDF
SAS M2006 Presentation
PDF
SAS M2007 Presentation
PDF
Macquarie University Workshop on Text Mining and Health
PPT
The Do's and Don'ts of Data Mining
PPT
Using Data Mining Techniques to Analyze Crime Pattern
PPTX
Communication Research Methods
PPTX
Statistical software
PPTX
Major issues in data mining
PPTX
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
DOCX
Mr. Charles Kangethe cv 2016
PDF
PfizerProposal_Final-1
PPTX
klinik medika edukasi PJK
PPTX
Flick Film FestivalInternship
PPTX
BVPresentation
PPTX
Top 8 Arquitectura europea
SAS M2006 Presentation
SAS M2007 Presentation
Macquarie University Workshop on Text Mining and Health
The Do's and Don'ts of Data Mining
Using Data Mining Techniques to Analyze Crime Pattern
Communication Research Methods
Statistical software
Major issues in data mining
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
Mr. Charles Kangethe cv 2016
PfizerProposal_Final-1
klinik medika edukasi PJK
Flick Film FestivalInternship
BVPresentation
Top 8 Arquitectura europea
Ad

Similar to Information Technology Data Mining (20)

PPT
PPT
PPTX
Week-1-Introduction to Data Mining.pptx
PPT
6 weeks summer training in data mining,ludhiana
PPT
6 weeks summer training in data mining,jalandhar
PPT
6months industrial training in data mining,ludhiana
PPT
6months industrial training in data mining, jalandhar
PPTX
Business Intelligence and Analytics Unit-2 part-A .pptx
PDF
Lect 1 introduction
PPTX
Lect 1 introduction
PPT
Introduction of Data Mining - Concept and techniques
PPTX
Data mining
PDF
Data mining 1 - Introduction (cheat sheet - printable)
PPT
Data mining final year project in ludhiana
PPT
Data mining final year project in jalandhar
PPT
hanjia chapter_1.ppt data mining chapter 1
PPTX
Data mining
PPT
Data mining by_ashok
PPT
01Intro(1).ppt Introduction In computer science
PPT
Unit 1 (Chapter-1) on data mining concepts.ppt
Week-1-Introduction to Data Mining.pptx
6 weeks summer training in data mining,ludhiana
6 weeks summer training in data mining,jalandhar
6months industrial training in data mining,ludhiana
6months industrial training in data mining, jalandhar
Business Intelligence and Analytics Unit-2 part-A .pptx
Lect 1 introduction
Lect 1 introduction
Introduction of Data Mining - Concept and techniques
Data mining
Data mining 1 - Introduction (cheat sheet - printable)
Data mining final year project in ludhiana
Data mining final year project in jalandhar
hanjia chapter_1.ppt data mining chapter 1
Data mining
Data mining by_ashok
01Intro(1).ppt Introduction In computer science
Unit 1 (Chapter-1) on data mining concepts.ppt

Information Technology Data Mining

  • 2.  What is data mining ?  Why data mining ?  Data mining as a necessity  Evolution of database  Origin of data mining  Data mining : A KDD process  Applications  Management areas  Examples  techniques
  • 3.  Extration of implicit, previously unknown and potentially useful information from data  Exploration & analysis by automatic and semi automatic means of large quantities of data in order to discover meaningful patterns
  • 4.  Lots of data is being collected and warehoused  Web data , e commerce  Purchases at departmental store/ groceries store  Bank/credit card transactions  Computers have become cheaper and more powerful  Competitve pressure is strong  Provide better,customized services for an edge (e.g in customer relationship management)
  • 5. DATA MINING – AS A NECESSITY
  • 6. DATA EXPLOSION PROBLEM  Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories  We are drowning in data, but starving for knowledge!  Solution : data mining  Extraction of interesting knowledge(rules, regularities,patterns,constraints) from data in large databases
  • 7.  1960s: Data collection, database creation, IMS and network DBMS.  1970s: Relational data model, relational DBMS implementation  1980s: RDBMS, advanced data models( extended relational, OO, deductive sets) and application- oriented DBMS(spatial,scientific,engineering etc)  1990s-2000s: Data mining and data warehousing, multimedia databases, and web databases
  • 8.  The term “data mining” was introduced in the 1990s. Data mining roots are traced back along three family lines: Classical statistics Artificial intelligence Machine learning
  • 9.  Statistical are the foundations of most technologies on which data mining is built, e.g. regression analysis, standard deviation etc. All these are used to study data and data relationships.  Artificial intelligence which is built upon heuristics as opposed to statistics, attempts to apply human- thoughts like processing to statistical problems. E.g. RDBMS.  Machine learning is to union of statistics and AI.  DATA MINING therefore uses AI and statistical approach together. It blends AI heuristics with advanced statistical analysis to study data and find previously – hidden trends or patterns within company using statistical fundamental concepts and adding more advanced AI algorithms to achieve the goal.
  • 11.  Database analysis and decision support  Market analysis and management  Target marketing, customer relation management, market basket analysis, cross selling, market segmentation  Risk analysis and management  Forecasting, customer retention,improved underwriting, quality control,competitive analysis  Fraud detection and management  Other applications  Text mining(news group,email,documents) and web analysis  Intelligent query answering
  • 12.  Sports  IBM Advanced Scout analyzed NBA game statistics to gain competitive advantage for New York Knicks and Miami Heat  Astronomy  JPL and the Palomar Observatory discovered 22 quasars with the help of data-mining.  Internet Web Surf-Aid  IBM Surf-Aid applies data mining algorithms to web access logs for market-related pages to discover customer preference and behaviour pages, analyzing effectiveness of web marketing, improving web site organizations etc
  • 13.  Cross-market analysis  Associations/Co-relations between product sales  Prediction based on the associations information  Customer profiling  Data mining can tell you what types of customers buy what products.  Identifying customer requirements  Identifying the best products for different customers.  Use prediction to find what factors will attract new customers.  Provides summary information  Various multidimensional summary reports  Statistical summary information
  • 14.  Finance planning and asset evaluation i. Cash flow analysis and prediction ii. Contingent claim analysis to evaluate assets iii. Cross-sectional and time series analysis  Resources planning i. Summarize and compare the resources and spending  Competition i. Monitor competition and market directions ii. Set price strategy in the market iii. Grouping of customer into classes
  • 15.  Applications  Widely used in health care, retail, credit card services, telecommunications etc.  Approach  Use historical data to build models of fraudulent behaviour and use data mining to help identify similar instances.  Examples  Auto insurance : detect a group of people who stage accidents to collect on insurance.  Money laundering : detect suspicious money transactions