SlideShare a Scribd company logo
Intro to Semantic Segmentation Using Deep Learning
================================================================
Semantic segmentation is the task of classifying each and every pixel in an image
into a class as shown in the image below. Here you can see that all persons are red,
the road is purple, the vehicles are blue, street signs are yellow etc.
Semantic segmentation is different from instance segmentation which is that
different objects of the same class will have different labels as in person1, person2
and hence different colours. The picture below very crisply illustrates the difference
between instance and semantic segmentation. If you are interested in learning more
about classification and object detection, please check out my blog here.
One important question can be why do we need this granularity of understanding
pixel by pixel location?
Some examples that come to mind are:
i) Self Driving Cars — May need to know exactly where another car is on the road or
the location of a human crossing the road
ii) Robotic systems — Robots that say join two parts together will perform better if
they know the exact locations of the two parts
iii) Damage Detection - It may be important in this case to know the exact extent of
damage
Deep Learning Model Architectures for Semantic Segmentation
Lets now talk about 3 model architectures that do semantic segmentation.
1. Fully Convolutional Network (FCN)
FCN is a popular algorithm for doing semantic segmentation. This model uses
various blocks of convolution and max pool layers to first decompress an image to
1/32th of its original size. It then makes a class prediction at this level of granularity.
Finally it uses up sampling and deconvolution layers to resize the image to its
original dimensions.
These models typically don't have any fully connected layers. The goal of down
sampling steps is to capture semantic/contextual information while the goal of up
sampling is to recover spatial information. Also there are no limitations on image
size. The final image is the same size as the original image. To fully recover the fine
grained spatial information lost in down sampling, skip connections are used. A skip
connection is a connection that bypasses at least one layer. Here it is used to pass
information from the down sampling step to the up sampling step. Merging features
from various resolution levels helps combining context information with spatial
information
Contacts Us:-
Address: - 110 Fontainbleau Drive, Toronto
Telephone: - 647-550-0256
Email: - deeplearning33@gmail.com

More Related Content

PDF
International Refereed Journal of Engineering and Science (IRJES)
PPTX
Mujungi Davis
PPT
Cristopher M. Bishop's tutorial on graphical models
PDF
Mayer.r aziz.t.a rauber.a:visualising class distribution on som_ie colour and...
PPTX
Sunbelt 2013 Presentation
PPTX
An Efficient Arabic Text Spotting from Natural Scenes Images
PDF
Automatic digital terrain modelling
PDF
Free-scale Magnification for Single-Pixel-Width Alphabetic Typeface Characters
International Refereed Journal of Engineering and Science (IRJES)
Mujungi Davis
Cristopher M. Bishop's tutorial on graphical models
Mayer.r aziz.t.a rauber.a:visualising class distribution on som_ie colour and...
Sunbelt 2013 Presentation
An Efficient Arabic Text Spotting from Natural Scenes Images
Automatic digital terrain modelling
Free-scale Magnification for Single-Pixel-Width Alphabetic Typeface Characters

What's hot (12)

PDF
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
DOC
SchuurmansLecture.doc
PPTX
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
DOCX
Multiview alignment hashing for
PPT
Rafi Zachut's slides on class specific segmentation
PDF
Nips 2016 tutorial generative adversarial networks review
PDF
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
PDF
IEEE PROJECT TOPICS &ABSTRACTS on image processing
PPTX
Static Spatial Graph Features
PPT
A Review of Relational Machine Learning(SRL) for Knowledge Graphs
DOCX
Bt9301 computer graphics
PDF
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
SchuurmansLecture.doc
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
Multiview alignment hashing for
Rafi Zachut's slides on class specific segmentation
Nips 2016 tutorial generative adversarial networks review
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
IEEE PROJECT TOPICS &ABSTRACTS on image processing
Static Spatial Graph Features
A Review of Relational Machine Learning(SRL) for Knowledge Graphs
Bt9301 computer graphics
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
Ad

Similar to Intro to Semantic Segmentation Using Deep Learning (20)

PPTX
Deep Computer Vision - 1.pptx
PPT
What is Panoptic Segmentation and advantages.ppt
PDF
Feature Extraction for Image Classification and Analysis with Ant Colony Opti...
PPTX
Traffic Automation System
PPTX
Introduction to Segmentation in Computer vision
PPTX
AaSeminar_Template.pptx
PDF
C1104011322
PPTX
Introduction to Convolutional Neural Networks
PDF
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION
PPTX
image segmentation, artificial intelegience and ml
PDF
(DL輪読)Matching Networks for One Shot Learning
PDF
Report
PDF
A NEW VISUAL CRYPTOGRAPHY TECHNIQUE FOR COLOR IMAGES
PDF
Chapter 3 Deep Learning architectures.pdf
DOCX
Deep Neural Network DNN.docx
PPT
Segmentation
PDF
Vehicle Tracking Using Kalman Filter and Features
PDF
Image Classification and Annotation Using Deep Learning
PDF
PERFORMANCE EVALUATION OF ONTOLOGY AND FUZZYBASE CBIR
PDF
Performance Evaluation Of Ontology And Fuzzybase Cbir
Deep Computer Vision - 1.pptx
What is Panoptic Segmentation and advantages.ppt
Feature Extraction for Image Classification and Analysis with Ant Colony Opti...
Traffic Automation System
Introduction to Segmentation in Computer vision
AaSeminar_Template.pptx
C1104011322
Introduction to Convolutional Neural Networks
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION
image segmentation, artificial intelegience and ml
(DL輪読)Matching Networks for One Shot Learning
Report
A NEW VISUAL CRYPTOGRAPHY TECHNIQUE FOR COLOR IMAGES
Chapter 3 Deep Learning architectures.pdf
Deep Neural Network DNN.docx
Segmentation
Vehicle Tracking Using Kalman Filter and Features
Image Classification and Annotation Using Deep Learning
PERFORMANCE EVALUATION OF ONTOLOGY AND FUZZYBASE CBIR
Performance Evaluation Of Ontology And Fuzzybase Cbir
Ad

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Approach and Philosophy of On baking technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Cloud computing and distributed systems.
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
A Presentation on Artificial Intelligence
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
cuic standard and advanced reporting.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
KodekX | Application Modernization Development
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Approach and Philosophy of On baking technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Cloud computing and distributed systems.
The AUB Centre for AI in Media Proposal.docx
A Presentation on Artificial Intelligence
Per capita expenditure prediction using model stacking based on satellite ima...
cuic standard and advanced reporting.pdf
NewMind AI Monthly Chronicles - July 2025
Network Security Unit 5.pdf for BCA BBA.
Agricultural_Statistics_at_a_Glance_2022_0.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Empathic Computing: Creating Shared Understanding
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KodekX | Application Modernization Development

Intro to Semantic Segmentation Using Deep Learning

  • 1. Intro to Semantic Segmentation Using Deep Learning ================================================================ Semantic segmentation is the task of classifying each and every pixel in an image into a class as shown in the image below. Here you can see that all persons are red, the road is purple, the vehicles are blue, street signs are yellow etc. Semantic segmentation is different from instance segmentation which is that different objects of the same class will have different labels as in person1, person2 and hence different colours. The picture below very crisply illustrates the difference between instance and semantic segmentation. If you are interested in learning more about classification and object detection, please check out my blog here.
  • 2. One important question can be why do we need this granularity of understanding pixel by pixel location? Some examples that come to mind are: i) Self Driving Cars — May need to know exactly where another car is on the road or the location of a human crossing the road ii) Robotic systems — Robots that say join two parts together will perform better if they know the exact locations of the two parts iii) Damage Detection - It may be important in this case to know the exact extent of damage Deep Learning Model Architectures for Semantic Segmentation Lets now talk about 3 model architectures that do semantic segmentation. 1. Fully Convolutional Network (FCN) FCN is a popular algorithm for doing semantic segmentation. This model uses various blocks of convolution and max pool layers to first decompress an image to 1/32th of its original size. It then makes a class prediction at this level of granularity. Finally it uses up sampling and deconvolution layers to resize the image to its original dimensions. These models typically don't have any fully connected layers. The goal of down sampling steps is to capture semantic/contextual information while the goal of up sampling is to recover spatial information. Also there are no limitations on image size. The final image is the same size as the original image. To fully recover the fine grained spatial information lost in down sampling, skip connections are used. A skip
  • 3. connection is a connection that bypasses at least one layer. Here it is used to pass information from the down sampling step to the up sampling step. Merging features from various resolution levels helps combining context information with spatial information Contacts Us:- Address: - 110 Fontainbleau Drive, Toronto Telephone: - 647-550-0256 Email: - deeplearning33@gmail.com