SlideShare a Scribd company logo
2
Most read
3
Most read
4
Most read
Semi Supervised Learning
Devansh
Context
Supervised learning is amazing. It has been used to solve a whole bunch of
problems.
It is also very intuitive when it comes to the idea. With lots of data and a directed
learning mechanism, it makes sense that our agents should be able to learn
specific things.
Labeling/annotating data can get extremely expensive. For eg. medical images
are typically very expensive to annotate (check out the video on SinGAN-Seg for
more details).
There is a lot of unlabeled data that is lying around, which can be used for cheap.
Enter Semi-Supervised Learning (Prequel)
Don’t confuse this with Self-Supervised Learning (I make that mistake a lot.)
How can we use the simpler concept and higher performance of Supervised
Learning with the cheap, large scale data availability of using Unsupervised
Learning?
That’s a good question. How would you try to use both processes to leverage the
strengths of both types (pause the video and think for a second)
An Image to help you think
SSL: Overview
We take a (relatively) small amount of labelled
data. Use that to learn.
Then take the unlabeled data use your model
to figure out the (pseudo) labels. Use that to
train our models further.
(Optional)
Rinse and Repeat. Do some other things like
add noise if you have the resources/want to
get creative
Transduction vs Induction
Often you will see learning presented as
either transductive or inductive learning.
Transductive learning: I give you specific
examples and ask you to predict specific
examples.
Inductive learning: I give you examples, and
you figure out a pattern. Use that pattern to
predict samples.
Sound similar? Think back to the video on
discriminative vs generative classification.
Assumptions used in Semi Supervised Learning
Take a second to pause. Think about the kinds of assumptions you make when
implementing SSL. This is a good habit when dealing with ML/Data.
Understanding your implicit assumptions gives you a way to figure out how to
improve.
SSL algorithms have 3 assumptions. To use SSL we need to make atleast one:
● Continuity assumption
● Cluster assumption
● Manifold assumption
Assumptions- Continuity
Wikipedia: Points that are close to each other are more likely to share a label. This
is also generally assumed in supervised learning and yields a preference for
geometrically simple decision boundaries. In the case of semi-supervised learning,
the smoothness assumption additionally yields a preference for decision
boundaries in low-density regions, so few points are close to each other but in
different classes.
Assumptions: Cluster
Wikipedia: The data tend to form discrete clusters, and points in the same cluster
are more likely to share a label (although data that shares a label may spread
across multiple clusters).
Gives rise to feature learning with clustering algorithms.
Assumptions: Manifold
The data lie approximately on a manifold of much lower dimension than the input
space. Learning the manifold using both the labeled and unlabeled data can avoid
the curse of dimensionality. Then learning can proceed using distances and
densities defined on the manifold.
SSL-> Most human way to learn?
Semi Supervised Learning is often compared to how we humans learn.
Think back to your favorite skills. How did you learn to do them? The SSL formula:
1) Have the learner train through specific guided examples with lots of detail.
2) As the learner becomes more proficient, we start using mixing in samples that
aren’t fully worked out. Learner must work through them on their own.
3) If we really want to check proficiency of the learner, we throw in
incorrect/broken examples, and have the student correct the mistakes.
Let’s see how this squares with our learning process. Let’s take 3 skills I learnt:
Math, French, and Brazilian JiuJitsu.
Reach out to me
Check out my other articles on Medium. : https://machine-learning-made-
simple.medium.com/
My YouTube: https://rb.gy/88iwdd
Reach out to me on LinkedIn: https://www.linkedin.com/in/devansh-devansh-516004168/
My Instagram: https://rb.gy/gmvuy9
My Twitter: https://twitter.com/Machine01776819
My Substack: https://devanshacc.substack.com/
Live conversations at twitch here: https://rb.gy/zlhk9y
Get a free stock on Robinhood: https://join.robinhood.com/fnud75

More Related Content

PPTX
Deep learning presentation
PPTX
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
PPTX
Semi-Supervised Learning
PDF
Deep Learning - Convolutional Neural Networks
PDF
Self-supervised Learning Lecture Note
PDF
Feature Engineering
PPT
Machine learning
PDF
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
Deep learning presentation
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
Semi-Supervised Learning
Deep Learning - Convolutional Neural Networks
Self-supervised Learning Lecture Note
Feature Engineering
Machine learning
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...

What's hot (20)

PDF
Training Neural Networks
PPTX
Deep neural networks
PDF
Case study-the next gen pos
PPTX
States, state graphs and transition testing
PPTX
Image classification using cnn
PPTX
Optimization in Deep Learning
PPTX
Deep belief network.pptx
PPTX
Machine learning ppt.
PDF
Loan approval prediction based on machine learning approach
PPTX
Machine learning clustering
PPTX
Transfer learning-presentation
PPTX
lazy learners and other classication methods
PPTX
Deep learning.pptx
PPT
Perception
PPTX
PROJECT FINAL PPT
PPT
Distributed Deadlock Detection.ppt
PPTX
Data Mining: Graph mining and social network analysis
PPTX
Deep learning
PPTX
Introduction to Deep learning
Training Neural Networks
Deep neural networks
Case study-the next gen pos
States, state graphs and transition testing
Image classification using cnn
Optimization in Deep Learning
Deep belief network.pptx
Machine learning ppt.
Loan approval prediction based on machine learning approach
Machine learning clustering
Transfer learning-presentation
lazy learners and other classication methods
Deep learning.pptx
Perception
PROJECT FINAL PPT
Distributed Deadlock Detection.ppt
Data Mining: Graph mining and social network analysis
Deep learning
Introduction to Deep learning
Ad

Similar to Semi supervised learning machine learning made simple (20)

PPTX
Semi-supervised Learning Survey - 20 years of evaluation
PPTX
Deep Semi-supervised Learning methods
PDF
On Semi-Supervised Learning and Beyond
PDF
Introduction to self-Supervised learning - kuliah machine learning STEI ITB
PDF
M4L18 Unsupervised and Semi-Supervised Learning - Slides v2.pdf
PDF
Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning
PDF
Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)
PDF
Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)
PDF
Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)
PPTX
Predicting More from Less: Synergies of Learning
PDF
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
PDF
Semi-supervised Machine Learning
PPTX
Deep Learning via Semi-Supervised Embedding (第 7 回 Deep Learning 勉強会資料; 大澤)
PDF
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
PPT
learning.ppt
PPTX
Conceptual-introduction-to-Machine-Learning-Supervised-Unsupervised-and-Semi-...
PDF
Lect 8 learning types (M.L.).pdf
PPTX
100-Concepts-of-AI By Anupama Kate .pptx
PDF
An Introduction to Machine Learning
PDF
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
Semi-supervised Learning Survey - 20 years of evaluation
Deep Semi-supervised Learning methods
On Semi-Supervised Learning and Beyond
Introduction to self-Supervised learning - kuliah machine learning STEI ITB
M4L18 Unsupervised and Semi-Supervised Learning - Slides v2.pdf
Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning
Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)
Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)
Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)
Predicting More from Less: Synergies of Learning
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Semi-supervised Machine Learning
Deep Learning via Semi-Supervised Embedding (第 7 回 Deep Learning 勉強会資料; 大澤)
(Machine)Learning with limited labels(Machine)Learning with limited labels(Ma...
learning.ppt
Conceptual-introduction-to-Machine-Learning-Supervised-Unsupervised-and-Semi-...
Lect 8 learning types (M.L.).pdf
100-Concepts-of-AI By Anupama Kate .pptx
An Introduction to Machine Learning
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
Ad

More from Devansh16 (14)

PDF
Spine net learning scale permuted backbone for recognition and localization
PPTX
Sigmoid function machine learning made simple
PDF
Accounting for variance in machine learning benchmarks
PDF
When deep learners change their mind learning dynamics for active learning
PDF
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
PDF
A simple framework for contrastive learning of visual representations
PPTX
Recurrence relationships
PDF
Paper Explained: Deep learning framework for measuring the digital strategy o...
PDF
Paper Explained: One Pixel Attack for Fooling Deep Neural Networks
PDF
Paper Explained: Understanding the wiring evolution in differentiable neural ...
PPTX
Machine Learning Made Simple: Differential evolution
PDF
Paper Explained: RandAugment: Practical automated data augmentation with a re...
PDF
Noisy student images
PDF
Deep learning ensembles loss landscape
Spine net learning scale permuted backbone for recognition and localization
Sigmoid function machine learning made simple
Accounting for variance in machine learning benchmarks
When deep learners change their mind learning dynamics for active learning
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
A simple framework for contrastive learning of visual representations
Recurrence relationships
Paper Explained: Deep learning framework for measuring the digital strategy o...
Paper Explained: One Pixel Attack for Fooling Deep Neural Networks
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Machine Learning Made Simple: Differential evolution
Paper Explained: RandAugment: Practical automated data augmentation with a re...
Noisy student images
Deep learning ensembles loss landscape

Recently uploaded (20)

PDF
Global Data and Analytics Market Outlook Report
PDF
Introduction to the R Programming Language
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PDF
Introduction to Data Science and Data Analysis
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
Database Infoormation System (DBIS).pptx
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
New ISO 27001_2022 standard and the changes
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
How to run a consulting project- client discovery
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PPTX
A Complete Guide to Streamlining Business Processes
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
IMPACT OF LANDSLIDE.....................
PPTX
Managing Community Partner Relationships
Global Data and Analytics Market Outlook Report
Introduction to the R Programming Language
importance of Data-Visualization-in-Data-Science. for mba studnts
Introduction to Data Science and Data Analysis
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Database Infoormation System (DBIS).pptx
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
New ISO 27001_2022 standard and the changes
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Optimise Shopper Experiences with a Strong Data Estate.pdf
SAP 2 completion done . PRESENTATION.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
CYBER SECURITY the Next Warefare Tactics
How to run a consulting project- client discovery
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
A Complete Guide to Streamlining Business Processes
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
IMPACT OF LANDSLIDE.....................
Managing Community Partner Relationships

Semi supervised learning machine learning made simple

  • 2. Context Supervised learning is amazing. It has been used to solve a whole bunch of problems. It is also very intuitive when it comes to the idea. With lots of data and a directed learning mechanism, it makes sense that our agents should be able to learn specific things. Labeling/annotating data can get extremely expensive. For eg. medical images are typically very expensive to annotate (check out the video on SinGAN-Seg for more details). There is a lot of unlabeled data that is lying around, which can be used for cheap.
  • 3. Enter Semi-Supervised Learning (Prequel) Don’t confuse this with Self-Supervised Learning (I make that mistake a lot.) How can we use the simpler concept and higher performance of Supervised Learning with the cheap, large scale data availability of using Unsupervised Learning? That’s a good question. How would you try to use both processes to leverage the strengths of both types (pause the video and think for a second)
  • 4. An Image to help you think
  • 5. SSL: Overview We take a (relatively) small amount of labelled data. Use that to learn. Then take the unlabeled data use your model to figure out the (pseudo) labels. Use that to train our models further. (Optional) Rinse and Repeat. Do some other things like add noise if you have the resources/want to get creative
  • 6. Transduction vs Induction Often you will see learning presented as either transductive or inductive learning. Transductive learning: I give you specific examples and ask you to predict specific examples. Inductive learning: I give you examples, and you figure out a pattern. Use that pattern to predict samples. Sound similar? Think back to the video on discriminative vs generative classification.
  • 7. Assumptions used in Semi Supervised Learning Take a second to pause. Think about the kinds of assumptions you make when implementing SSL. This is a good habit when dealing with ML/Data. Understanding your implicit assumptions gives you a way to figure out how to improve. SSL algorithms have 3 assumptions. To use SSL we need to make atleast one: ● Continuity assumption ● Cluster assumption ● Manifold assumption
  • 8. Assumptions- Continuity Wikipedia: Points that are close to each other are more likely to share a label. This is also generally assumed in supervised learning and yields a preference for geometrically simple decision boundaries. In the case of semi-supervised learning, the smoothness assumption additionally yields a preference for decision boundaries in low-density regions, so few points are close to each other but in different classes.
  • 9. Assumptions: Cluster Wikipedia: The data tend to form discrete clusters, and points in the same cluster are more likely to share a label (although data that shares a label may spread across multiple clusters). Gives rise to feature learning with clustering algorithms.
  • 10. Assumptions: Manifold The data lie approximately on a manifold of much lower dimension than the input space. Learning the manifold using both the labeled and unlabeled data can avoid the curse of dimensionality. Then learning can proceed using distances and densities defined on the manifold.
  • 11. SSL-> Most human way to learn? Semi Supervised Learning is often compared to how we humans learn. Think back to your favorite skills. How did you learn to do them? The SSL formula: 1) Have the learner train through specific guided examples with lots of detail. 2) As the learner becomes more proficient, we start using mixing in samples that aren’t fully worked out. Learner must work through them on their own. 3) If we really want to check proficiency of the learner, we throw in incorrect/broken examples, and have the student correct the mistakes. Let’s see how this squares with our learning process. Let’s take 3 skills I learnt: Math, French, and Brazilian JiuJitsu.
  • 12. Reach out to me Check out my other articles on Medium. : https://machine-learning-made- simple.medium.com/ My YouTube: https://rb.gy/88iwdd Reach out to me on LinkedIn: https://www.linkedin.com/in/devansh-devansh-516004168/ My Instagram: https://rb.gy/gmvuy9 My Twitter: https://twitter.com/Machine01776819 My Substack: https://devanshacc.substack.com/ Live conversations at twitch here: https://rb.gy/zlhk9y Get a free stock on Robinhood: https://join.robinhood.com/fnud75