SlideShare a Scribd company logo
1
Presentation Outlines
Introduction
Similarity & Difference between Data Mining and
Web Mining
Reasons for Web Mining
Types of Web Mining
Architecture of Web Mining
Application of Web Mining
Challenges of Web Mining
Conclusion and Recommendations 2
INTROUDUCTION
Data Mining is the set of methodologies used in
analyzing data from various dimensions and perspectives,
finding previously unknown hidden patterns,
classifying and grouping the data and
summarizing the identified relationships.
3
INTROUDUCTION
Web mining can be broadly defined as the discovery and
analysis of useful information from the World Wide Web.
The data is collected from the server, client and database in
Web mining.
Web mining is a subset of data mining.
4
Difference between Datamining and Web mining
 In DM data is stored in data warehouse while data is stored in
web server database and web logos in WM.
 DM uses Structured data while WM uses Structured and
Unstructured data.
5
Similarity of Datamining and Web mining
 Their common goal is to extracting, discovering, finding and
mining hidden knowledge.
 Their concept is identification of patterns from the data
available in the system/web.
 Both are useful for decision making and prediction.
 Both follows the same process
 Both needs input/ source data to complete their process
6
REASONS FOR WEB MINING
 While dealing with the web data we face with the following
problems.
 User side Problem: Users browse or use the search service to find
a relevant information from the web.
 They face problems like:
•Low Precision,
•getting an irrelevant information and
• Low Recall .
7
Cont…
 Information Providers/Server Problem:
 What do customers do,
 what do the customers want,
 how effectively use the web data to market products
 and service to the customers?
8
Web Mining Tools
 Data Miner (Web Content Mining Tool)
 Google Analytics (Web Usage Mining Tool)
 Majestic (Web structure mining tool)
 Scrappy (Web content mining tool)
 Oracle data Mining (Web Usage Mining Tool)
 Bixo (Web structure mining tool)
 Weka (Web Usage Mining tool )
9
TYPES OF WEB MINING
 Web mining can be generally divided into three categories,
based on the data to be mined as seen in Figure :
Figure 1: Types of Web Mining
10
Web Content Mining
 Web Content Mining the process of collecting useful data from
websites.
 This content includes news, comments, company information,
product catalogs, etc.
 It is extract information or knowledge from collected sources.
 This content may consists text, image, video, sound or
structured records such as lists and tables.
11
Web Structure Mining
 It is the process of extracting structural information from the
web.
Hyperlinks: is a structural component that connects the web
page to a different location.
Document Structure: organization of content from the web
page in tree-structure format based on HTML and XML tags
with in the page.
12
Web Usage Mining
 It is the application of data mining techniques to discover
patterns using the Web to better understand and meet the needs
of the user.
 It is classified in to three based on the kind of data usage.
 Web Server Data: the web server including IP address, page reference
and access time collects user logs.
 Application Server Data: ability to track various kinds of business events.
 Application Level Data: defining new kinds of events and logging them
by generating histories of the events.
13
Architecture of Web Mining
14
Figure 2: Architecture of Web Mining
APPLICATIONS OF WEB MINING
 A web mining has a lot of application in different sectors or areas.
Figure 3: Application of Web mining
15
Cont…
E-Learning:
 Web mining can be used for improving and enhancing the
process of E-learning environments.
 Applications of web mining to e-learning are usually web usage
based.
 Machine learning techniques and web usage mining enhance
web based learning environments
16
Cont…
Electronic commerce:
 A major challenge e-commerce is to understand visitors or
customers needs and to value orientations as such as possible.
 It can improve capacity of service for consumer and
competitive advantages.
17
Cont…
Security and Crime Investigation:
 Web mining techniques are also used for protection of user
system or logging information against such cybercrimes as
hacking,
internet fraud,
fraudulent websites,
illegal online gambling,
virus spreading,
child pornography distribution and
cyber terrorism. 18
Cont…
Electronic Business:
Web mining techniques can support a web enabled
electronic business to improve on
•Marketing,
•Customer support and
•Sales operations.
19
Advantages of Web Mining
 Increases of profits of companies or organizations by sealing products.
 Protect user system or logging information from cybercrimes.
 Improves capacity of service for consumer and competitive advantages.
 improving and enhancing the process of E-learning environments.
 It opens door for Business Intelligence or Knowledge economy.
 It supports for Decision Making and prediction.
 Mining and Discovering hidden knowledge.
 Used for data analysis.
20
Disadvantages
 URL’s can be tracked to access the data,
 Multiplicity of events and URL’s,
 Large amount of data remain unused
 Since data are updatable it is not good to say they are untrusted
21
WEB MINING CHALENGES
Web mining is faced with various technical and non-technical
challenges.
The non-technical restrictions can be included the
lack of management support,
inadequate fund and
lack of required resources such as professional human
resources.
22
Cont…
The technical issues are
Incorrect and Inaccurate Data
Data may be inaccurate.
Data may be incomplete and unavailable.
The lack of tools
Available tools only support one of the web mining
types such as classification or clustering.
23
CONCLUSION
 As web usage and information source in the World Wide Web
are growing continuously it is a good opportunity having web
miner to extract hidden knowledge's from the web.
 As a weakness not all but some researchers are replaced Web
mining by Text mining. It is strongly wrong since web mining is
concentrated with too much multimedia information's but text
mining is only for textual data.
24
RECOMMENDATION
For the future Web mining tools should become supportable for all
clustering, classification and association techniques.
Since privacy is a big challenge for and harms the process of web
mining it is good for the future things or data's should be released
publicly and to increase the societies habit of knowledge sharing by
serving training and collaborative opportunities.
25
26

More Related Content

PPTX
web mining
PPTX
Web mining
PPTX
Web content mining
ODP
Web content mining
PPTX
Web Mining & Text Mining
PPTX
Data mining
PPT
4.3 multimedia datamining
PPTX
Text mining
web mining
Web mining
Web content mining
Web content mining
Web Mining & Text Mining
Data mining
4.3 multimedia datamining
Text mining

What's hot (20)

PPTX
Text MIning
PPTX
Data mining presentation.ppt
PDF
Data science presentation
ODP
Web Content Mining
PPTX
Data mining
PPTX
Web usage mining
PPTX
Web Mining Presentation Final
PPTX
Introduction to Data Mining
PPTX
PPTX
OLAP & DATA WAREHOUSE
PPTX
Data visualization
PPT
OLAP
PPTX
Data mining , Knowledge Discovery Process, Classification
PPT
Multimedia Mining
PPT
Data mining slides
 
PPTX
Web mining (structure mining)
PPTX
Data mining tasks
PDF
An introduction to Machine Learning
 
PPT
01 Data Mining: Concepts and Techniques, 2nd ed.
PPTX
Kdd process
Text MIning
Data mining presentation.ppt
Data science presentation
Web Content Mining
Data mining
Web usage mining
Web Mining Presentation Final
Introduction to Data Mining
OLAP & DATA WAREHOUSE
Data visualization
OLAP
Data mining , Knowledge Discovery Process, Classification
Multimedia Mining
Data mining slides
 
Web mining (structure mining)
Data mining tasks
An introduction to Machine Learning
 
01 Data Mining: Concepts and Techniques, 2nd ed.
Kdd process
Ad

Similar to Web mining (20)

PPTX
Web mining
PPTX
Web mining
PPTX
Web
PDF
Web mining and social media mining
PPTX
WEB MININGG.pptx go to thw lab where we found ppt
PPTX
Web Mining
PPTX
Web mining
PPTX
Web Mining by dhirba mahara on web mining
PDF
RESEARCH ISSUES IN WEB MINING
 
PDF
RESEARCH ISSUES IN WEB MINING
 
PDF
RESEARCH ISSUES IN WEB MINING
 
PDF
RESEARCH ISSUES IN WEB MINING
 
PDF
RESEARCH ISSUES IN WEB MINING
 
PDF
RESEARCH ISSUES IN WEB MINING
 
PDF
RESEARCH ISSUES IN WEB MINING
 
PPTX
Web Mining
PPTX
Web mining
PPTX
WEB MINING.
DOCX
Minning www
PPTX
Web mining
Web mining
Web mining
Web
Web mining and social media mining
WEB MININGG.pptx go to thw lab where we found ppt
Web Mining
Web mining
Web Mining by dhirba mahara on web mining
RESEARCH ISSUES IN WEB MINING
 
RESEARCH ISSUES IN WEB MINING
 
RESEARCH ISSUES IN WEB MINING
 
RESEARCH ISSUES IN WEB MINING
 
RESEARCH ISSUES IN WEB MINING
 
RESEARCH ISSUES IN WEB MINING
 
RESEARCH ISSUES IN WEB MINING
 
Web Mining
Web mining
WEB MINING.
Minning www
Web mining
Ad

Recently uploaded (20)

PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Database Infoormation System (DBIS).pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
.pdf is not working space design for the following data for the following dat...
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Global journeys: estimating international migration
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Fluorescence-microscope_Botany_detailed content
Database Infoormation System (DBIS).pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Major-Components-ofNKJNNKNKNKNKronment.pptx
Introduction-to-Cloud-ComputingFinal.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
.pdf is not working space design for the following data for the following dat...
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Business Ppt On Nestle.pptx huunnnhhgfvu
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Galatica Smart Energy Infrastructure Startup Pitch Deck
Global journeys: estimating international migration
168300704-gasification-ppt.pdfhghhhsjsjhsuxush

Web mining

  • 1. 1
  • 2. Presentation Outlines Introduction Similarity & Difference between Data Mining and Web Mining Reasons for Web Mining Types of Web Mining Architecture of Web Mining Application of Web Mining Challenges of Web Mining Conclusion and Recommendations 2
  • 3. INTROUDUCTION Data Mining is the set of methodologies used in analyzing data from various dimensions and perspectives, finding previously unknown hidden patterns, classifying and grouping the data and summarizing the identified relationships. 3
  • 4. INTROUDUCTION Web mining can be broadly defined as the discovery and analysis of useful information from the World Wide Web. The data is collected from the server, client and database in Web mining. Web mining is a subset of data mining. 4
  • 5. Difference between Datamining and Web mining  In DM data is stored in data warehouse while data is stored in web server database and web logos in WM.  DM uses Structured data while WM uses Structured and Unstructured data. 5
  • 6. Similarity of Datamining and Web mining  Their common goal is to extracting, discovering, finding and mining hidden knowledge.  Their concept is identification of patterns from the data available in the system/web.  Both are useful for decision making and prediction.  Both follows the same process  Both needs input/ source data to complete their process 6
  • 7. REASONS FOR WEB MINING  While dealing with the web data we face with the following problems.  User side Problem: Users browse or use the search service to find a relevant information from the web.  They face problems like: •Low Precision, •getting an irrelevant information and • Low Recall . 7
  • 8. Cont…  Information Providers/Server Problem:  What do customers do,  what do the customers want,  how effectively use the web data to market products  and service to the customers? 8
  • 9. Web Mining Tools  Data Miner (Web Content Mining Tool)  Google Analytics (Web Usage Mining Tool)  Majestic (Web structure mining tool)  Scrappy (Web content mining tool)  Oracle data Mining (Web Usage Mining Tool)  Bixo (Web structure mining tool)  Weka (Web Usage Mining tool ) 9
  • 10. TYPES OF WEB MINING  Web mining can be generally divided into three categories, based on the data to be mined as seen in Figure : Figure 1: Types of Web Mining 10
  • 11. Web Content Mining  Web Content Mining the process of collecting useful data from websites.  This content includes news, comments, company information, product catalogs, etc.  It is extract information or knowledge from collected sources.  This content may consists text, image, video, sound or structured records such as lists and tables. 11
  • 12. Web Structure Mining  It is the process of extracting structural information from the web. Hyperlinks: is a structural component that connects the web page to a different location. Document Structure: organization of content from the web page in tree-structure format based on HTML and XML tags with in the page. 12
  • 13. Web Usage Mining  It is the application of data mining techniques to discover patterns using the Web to better understand and meet the needs of the user.  It is classified in to three based on the kind of data usage.  Web Server Data: the web server including IP address, page reference and access time collects user logs.  Application Server Data: ability to track various kinds of business events.  Application Level Data: defining new kinds of events and logging them by generating histories of the events. 13
  • 14. Architecture of Web Mining 14 Figure 2: Architecture of Web Mining
  • 15. APPLICATIONS OF WEB MINING  A web mining has a lot of application in different sectors or areas. Figure 3: Application of Web mining 15
  • 16. Cont… E-Learning:  Web mining can be used for improving and enhancing the process of E-learning environments.  Applications of web mining to e-learning are usually web usage based.  Machine learning techniques and web usage mining enhance web based learning environments 16
  • 17. Cont… Electronic commerce:  A major challenge e-commerce is to understand visitors or customers needs and to value orientations as such as possible.  It can improve capacity of service for consumer and competitive advantages. 17
  • 18. Cont… Security and Crime Investigation:  Web mining techniques are also used for protection of user system or logging information against such cybercrimes as hacking, internet fraud, fraudulent websites, illegal online gambling, virus spreading, child pornography distribution and cyber terrorism. 18
  • 19. Cont… Electronic Business: Web mining techniques can support a web enabled electronic business to improve on •Marketing, •Customer support and •Sales operations. 19
  • 20. Advantages of Web Mining  Increases of profits of companies or organizations by sealing products.  Protect user system or logging information from cybercrimes.  Improves capacity of service for consumer and competitive advantages.  improving and enhancing the process of E-learning environments.  It opens door for Business Intelligence or Knowledge economy.  It supports for Decision Making and prediction.  Mining and Discovering hidden knowledge.  Used for data analysis. 20
  • 21. Disadvantages  URL’s can be tracked to access the data,  Multiplicity of events and URL’s,  Large amount of data remain unused  Since data are updatable it is not good to say they are untrusted 21
  • 22. WEB MINING CHALENGES Web mining is faced with various technical and non-technical challenges. The non-technical restrictions can be included the lack of management support, inadequate fund and lack of required resources such as professional human resources. 22
  • 23. Cont… The technical issues are Incorrect and Inaccurate Data Data may be inaccurate. Data may be incomplete and unavailable. The lack of tools Available tools only support one of the web mining types such as classification or clustering. 23
  • 24. CONCLUSION  As web usage and information source in the World Wide Web are growing continuously it is a good opportunity having web miner to extract hidden knowledge's from the web.  As a weakness not all but some researchers are replaced Web mining by Text mining. It is strongly wrong since web mining is concentrated with too much multimedia information's but text mining is only for textual data. 24
  • 25. RECOMMENDATION For the future Web mining tools should become supportable for all clustering, classification and association techniques. Since privacy is a big challenge for and harms the process of web mining it is good for the future things or data's should be released publicly and to increase the societies habit of knowledge sharing by serving training and collaborative opportunities. 25
  • 26. 26