SlideShare a Scribd company logo
P1WU
UNIT – III: CLASSIFICATION
Topic 2: UNSUPERVIZED ALGORITHMS -
CLUSTERING
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
UNIT III
1.A Characterization of Text
Classification
2. Unsupervised
Algorithms: Clustering
3. Naïve Text Classification
4. Supervised Algorithms
5. Decision Tree
6. k-NN Classifier
7. SVM Classifier
8. Feature Selection or
Dimensionality Reduction
9. Evaluation metrics
10. Accuracy and Error
11. Organizing the classes
12. Indexing and Searching
13. Inverted Indexes
14. Sequential Searching
15. Multi-dimensional
Indexing
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
INTRODUCTION TO UNSUPERVIZED ALGORITHMS
• Below is the list of some popular unsupervised learning algorithms:
• K-means clustering
• KNN (k-nearest neighbors)
• Hierarchal clustering
• Anomaly detection
• Neural Networks
• Principle Component Analysis
• Independent Component Analysis
• Apriori algorithm
• Singular value decomposition
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
INTRODUCTION TO UNSUPERVIZED ALGORITHMS
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
WHAT ARE CLUSTERING?
• Clustering or cluster analysis is a
machine learning technique, which
groups the unlabelled dataset.
• It can be defined as "A way of
grouping the data points into
different clusters, consisting of
similar data points. The objects with
the possible similarities remain in a
group that has less or no similarities
with another group."
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
WHAT ARE CLUSTERING?
• It does it by
• finding some similar patterns in the unlabelled dataset
such as shape, size, color, behavior, etc., and divides them
as per the presence and absence of those similar patterns.
• It is an unsupervised learning method,
• hence no supervision is provided to the algorithm, and it
deals with the unlabeled dataset.
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
Difference between Supervised and Unsupervised Learning
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
Supervised Learning Unsupervised Learning
Supervised learning algorithms aretrained using labeled data. Unsupervised learning algorithmsare trained using unlabeled data.
Supervised learning model takesdirect feedback to check if it is
predicting correct output or not.
Unsupervised learning model doesnot take any feedback.
Supervised learning model predictsthe output. Unsupervised learning model findsthe hidden patterns in data.
Supervised learning needs supervision to train the model. Unsupervised learning does not needany supervision to train the model.
Supervised learning can becategorized
in Classification and Regression problems.
Unsupervised Learning can beclassified in Clustering and
Associations problems.
Supervised learning can be used for those cases where we
know theinput as well as corresponding outputs.
Unsupervised learning can be used for those cases where we have
onlyinput data and no corresponding output data.
Supervised learning model produces an accurate result. Unsupervised learning model may give less accurate result as compared
to supervised learning.
It includes various algorithms such It includes various algorithms such
Advantages of Unsupervised Learning
• Unsupervised learning is used for more complex tasks
as compared to supervised learning because,
• in unsupervised learning, we don't have labeled input data.
• Unsupervised learning is preferable as
• it is easy to get unlabeled data in comparison to labeled
data.
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
Disadvantages of Unsupervised Learning
• Unsupervised learning is
• intrinsically more difficult than supervised learning as it does not have
corresponding output.
• The result of the unsupervised learning algorithm might be
• less accurate as input data is not labeled, and algorithms do not know the
exact output in advance.
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES
Any Questions?
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SEMESTER – VIII
PROFESSIONAL ELECTIVE – IV
CS8080- INFORMATION RETRIEVAL TECHNIQUES

More Related Content

PDF
CS8080_IRT_UNIT - III T4 SUPERVISED ALGORITHMS.pdf
PDF
CS8080_IRT_UNIT - III T7 SVM CLASSIFIER.pdf
PDF
CS8080_IRT_UNIT - III T3 NAIVE TEXT CLASSIFICATION.pdf
PDF
Lecture 1: What is Machine Learning?
PPTX
Automated Machine Learning
PDF
CS8080 information retrieval techniques unit iii ppt in pdf
PPTX
Intro/Overview on Machine Learning Presentation
PDF
CS8080_IRT_UNIT - III T6 K-NN CLASSIFIER.pdf
CS8080_IRT_UNIT - III T4 SUPERVISED ALGORITHMS.pdf
CS8080_IRT_UNIT - III T7 SVM CLASSIFIER.pdf
CS8080_IRT_UNIT - III T3 NAIVE TEXT CLASSIFICATION.pdf
Lecture 1: What is Machine Learning?
Automated Machine Learning
CS8080 information retrieval techniques unit iii ppt in pdf
Intro/Overview on Machine Learning Presentation
CS8080_IRT_UNIT - III T6 K-NN CLASSIFIER.pdf

What's hot (20)

PDF
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
PPTX
Machine learning overview
PPTX
Inductive bias
PDF
AutoML lectures (ACDL 2019)
PPTX
Introduction to machine learning
PPTX
Simple overview of machine learning
PPTX
Machine Learning and Real-World Applications
PPTX
Intro to Machine Learning & AI
DOC
Intro/Overview on Machine Learning Presentation -2
PPTX
Supervised learning and unsupervised learning
PPTX
Types of machine learning
PPTX
Artificial neural networks and its applications
PDF
Supervised and Unsupervised Machine Learning
PPTX
Machine learning
PPTX
Machine learning
PDF
Sensor Cloud
PDF
Deep learning
PPTX
Supervised and unsupervised learning
PPTX
Machine Learning
PDF
Reinforcement Learning In AI Powerpoint Presentation Slide Templates Complete...
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
Machine learning overview
Inductive bias
AutoML lectures (ACDL 2019)
Introduction to machine learning
Simple overview of machine learning
Machine Learning and Real-World Applications
Intro to Machine Learning & AI
Intro/Overview on Machine Learning Presentation -2
Supervised learning and unsupervised learning
Types of machine learning
Artificial neural networks and its applications
Supervised and Unsupervised Machine Learning
Machine learning
Machine learning
Sensor Cloud
Deep learning
Supervised and unsupervised learning
Machine Learning
Reinforcement Learning In AI Powerpoint Presentation Slide Templates Complete...
Ad

Similar to CS8080_IRT_UNIT - III T2 UNSUPERVISED ALGORITHMS -CLUSTERING.pdf (20)

PPTX
supervised and unsupervised machine learning, types of machine learning
PDF
Module 7: Unsupervised Learning
PPTX
Unsupervised Learning and k means (1).pptx
PDF
Mlmlmlmlmlmlmlmlmlmlmlmlmlmlmlml.lmlmlmlmlm
PPTX
Supervised learning and Unsupervised learning
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Unit 2 unsupervised learning.pptx
PPTX
Supervised and Unsupervised Machine learning
PPTX
SUPERVISED LEARNING AND UNSUPERVISED LEARNING
PPTX
MachineLearning_intro_Types_of_learning.pptx
PDF
Unit1_Types of MACHINE LEARNING 2020pattern.pdf
PPTX
Lecturer3 by RamaKrishna SRU waranagal telanga
PDF
IRJET-Comparison between Supervised Learning and Unsupervised Learning
PPTX
cluster.pptx
PDF
Unsupervised Learning | IABAC Certification
PDF
Supervised learning and unsupervised learning new 2024.pdf
PDF
Supervised learning and unsupervised learning new 2024.pdf
PPTX
Supervised learning and unsupervised learning new 2024.pptx
PPTX
Machine learning
PDF
Machine Learning - Deep Learning
supervised and unsupervised machine learning, types of machine learning
Module 7: Unsupervised Learning
Unsupervised Learning and k means (1).pptx
Mlmlmlmlmlmlmlmlmlmlmlmlmlmlmlml.lmlmlmlmlm
Supervised learning and Unsupervised learning
Supervised vs unsupervised machine learning algorithms
Unit 2 unsupervised learning.pptx
Supervised and Unsupervised Machine learning
SUPERVISED LEARNING AND UNSUPERVISED LEARNING
MachineLearning_intro_Types_of_learning.pptx
Unit1_Types of MACHINE LEARNING 2020pattern.pdf
Lecturer3 by RamaKrishna SRU waranagal telanga
IRJET-Comparison between Supervised Learning and Unsupervised Learning
cluster.pptx
Unsupervised Learning | IABAC Certification
Supervised learning and unsupervised learning new 2024.pdf
Supervised learning and unsupervised learning new 2024.pdf
Supervised learning and unsupervised learning new 2024.pptx
Machine learning
Machine Learning - Deep Learning
Ad

More from AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING (20)

PPTX
JAVA PROGRAM CONSTRUCTS OR LANGUAGE BASICS.pptx
PPTX
CS3391 OOP UT-I T4 JAVA BUZZWORDS.pptx
PPTX
CS3391 OOP UT-I T1 OVERVIEW OF OOP
PPTX
CS3391 OOP UT-I T3 FEATURES OF OBJECT ORIENTED PROGRAMMING
PPTX
CS3391 OOP UT-I T2 OBJECT ORIENTED PROGRAMMING PARADIGM.pptx
PDF
CS3391 -OOP -UNIT – V NOTES FINAL.pdf
PDF
CS3391 -OOP -UNIT – IV NOTES FINAL.pdf
PDF
CS3391 -OOP -UNIT – III NOTES FINAL.pdf
PDF
CS3391 -OOP -UNIT – II NOTES FINAL.pdf
PDF
CS3391 -OOP -UNIT – I NOTES FINAL.pdf
PDF
PDF
CS8080_IRT_UNIT - III T14 SEQUENTIAL SEARCHING.pdf
PDF
CS8080_IRT_UNIT - III T15 MULTI-DIMENSIONAL INDEXING.pdf
PDF
CS8080_IRT_UNIT - III T13 INVERTED INDEXES.pdf
PDF
CS8080 IRT UNIT - III SLIDES IN PDF.pdf
PDF
CS8080_IRT_UNIT - III T11 ORGANIZING THE CLASSES.pdf
PDF
CS8080_IRT_UNIT - III T12 INDEXING AND SEARCHING.pdf
PDF
CS8080_IRT_UNIT - III T11 ORGANIZING THE CLASSES.pdf
JAVA PROGRAM CONSTRUCTS OR LANGUAGE BASICS.pptx
CS3391 OOP UT-I T4 JAVA BUZZWORDS.pptx
CS3391 OOP UT-I T1 OVERVIEW OF OOP
CS3391 OOP UT-I T3 FEATURES OF OBJECT ORIENTED PROGRAMMING
CS3391 OOP UT-I T2 OBJECT ORIENTED PROGRAMMING PARADIGM.pptx
CS3391 -OOP -UNIT – V NOTES FINAL.pdf
CS3391 -OOP -UNIT – IV NOTES FINAL.pdf
CS3391 -OOP -UNIT – III NOTES FINAL.pdf
CS3391 -OOP -UNIT – II NOTES FINAL.pdf
CS3391 -OOP -UNIT – I NOTES FINAL.pdf
CS8080_IRT_UNIT - III T14 SEQUENTIAL SEARCHING.pdf
CS8080_IRT_UNIT - III T15 MULTI-DIMENSIONAL INDEXING.pdf
CS8080_IRT_UNIT - III T13 INVERTED INDEXES.pdf
CS8080 IRT UNIT - III SLIDES IN PDF.pdf
CS8080_IRT_UNIT - III T11 ORGANIZING THE CLASSES.pdf
CS8080_IRT_UNIT - III T12 INDEXING AND SEARCHING.pdf
CS8080_IRT_UNIT - III T11 ORGANIZING THE CLASSES.pdf

Recently uploaded (20)

DOCX
573137875-Attendance-Management-System-original
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
web development for engineering and engineering
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
additive manufacturing of ss316l using mig welding
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
PPT on Performance Review to get promotions
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Digital Logic Computer Design lecture notes
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
composite construction of structures.pdf
573137875-Attendance-Management-System-original
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Embodied AI: Ushering in the Next Era of Intelligent Systems
Lecture Notes Electrical Wiring System Components
bas. eng. economics group 4 presentation 1.pptx
Internet of Things (IOT) - A guide to understanding
web development for engineering and engineering
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
additive manufacturing of ss316l using mig welding
R24 SURVEYING LAB MANUAL for civil enggi
PPT on Performance Review to get promotions
UNIT 4 Total Quality Management .pptx
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Digital Logic Computer Design lecture notes
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
composite construction of structures.pdf

CS8080_IRT_UNIT - III T2 UNSUPERVISED ALGORITHMS -CLUSTERING.pdf

  • 1. P1WU UNIT – III: CLASSIFICATION Topic 2: UNSUPERVIZED ALGORITHMS - CLUSTERING AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 2. UNIT III 1.A Characterization of Text Classification 2. Unsupervised Algorithms: Clustering 3. Naïve Text Classification 4. Supervised Algorithms 5. Decision Tree 6. k-NN Classifier 7. SVM Classifier 8. Feature Selection or Dimensionality Reduction 9. Evaluation metrics 10. Accuracy and Error 11. Organizing the classes 12. Indexing and Searching 13. Inverted Indexes 14. Sequential Searching 15. Multi-dimensional Indexing AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 3. INTRODUCTION TO UNSUPERVIZED ALGORITHMS • Below is the list of some popular unsupervised learning algorithms: • K-means clustering • KNN (k-nearest neighbors) • Hierarchal clustering • Anomaly detection • Neural Networks • Principle Component Analysis • Independent Component Analysis • Apriori algorithm • Singular value decomposition AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 4. INTRODUCTION TO UNSUPERVIZED ALGORITHMS AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 5. WHAT ARE CLUSTERING? • Clustering or cluster analysis is a machine learning technique, which groups the unlabelled dataset. • It can be defined as "A way of grouping the data points into different clusters, consisting of similar data points. The objects with the possible similarities remain in a group that has less or no similarities with another group." AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 6. WHAT ARE CLUSTERING? • It does it by • finding some similar patterns in the unlabelled dataset such as shape, size, color, behavior, etc., and divides them as per the presence and absence of those similar patterns. • It is an unsupervised learning method, • hence no supervision is provided to the algorithm, and it deals with the unlabeled dataset. AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 7. Difference between Supervised and Unsupervised Learning AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES Supervised Learning Unsupervised Learning Supervised learning algorithms aretrained using labeled data. Unsupervised learning algorithmsare trained using unlabeled data. Supervised learning model takesdirect feedback to check if it is predicting correct output or not. Unsupervised learning model doesnot take any feedback. Supervised learning model predictsthe output. Unsupervised learning model findsthe hidden patterns in data. Supervised learning needs supervision to train the model. Unsupervised learning does not needany supervision to train the model. Supervised learning can becategorized in Classification and Regression problems. Unsupervised Learning can beclassified in Clustering and Associations problems. Supervised learning can be used for those cases where we know theinput as well as corresponding outputs. Unsupervised learning can be used for those cases where we have onlyinput data and no corresponding output data. Supervised learning model produces an accurate result. Unsupervised learning model may give less accurate result as compared to supervised learning. It includes various algorithms such It includes various algorithms such
  • 8. Advantages of Unsupervised Learning • Unsupervised learning is used for more complex tasks as compared to supervised learning because, • in unsupervised learning, we don't have labeled input data. • Unsupervised learning is preferable as • it is easy to get unlabeled data in comparison to labeled data. AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 9. Disadvantages of Unsupervised Learning • Unsupervised learning is • intrinsically more difficult than supervised learning as it does not have corresponding output. • The result of the unsupervised learning algorithm might be • less accurate as input data is not labeled, and algorithms do not know the exact output in advance. AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES
  • 10. Any Questions? AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SEMESTER – VIII PROFESSIONAL ELECTIVE – IV CS8080- INFORMATION RETRIEVAL TECHNIQUES