SlideShare a Scribd company logo
4
Most read
6
Most read
12
Most read
Introduction to Semantic
Segmentation
Sébastien Taylor
V.P. of Research & Development
Au-Zone Technologies
• Introduction to segmentation
• Practical examples and applications
• Various types of segmentation
• Accuracy metrics
• Computational requirements
• Resources
Outline
2
© 2024 Au-Zone Technologies
• Image Segmentation is a process that subdivides
an image into its constituent parts or objects.
• Key task in computer vision and image
processing
• It can be formulated as a pixel classification
problem with three different approaches
(semantic, instance and panoptic)
Introduction to Segmentation
3
© 2024 Au-Zone Technologies
Image Segmentation vs. Object Detection
4
© 2024 Au-Zone Technologies
• Autonomous vehicles
• Smart agriculture
• Drones and aerial imaging
• Medical image diagnosis
• Image editing
• Dataset augmentation
Practical Examples
5
© 2024 Au-Zone Technologies
• Semantic segmentation: produces a contextual description of the “stuff” in the image. Classes are isolated but not
objects within the same class. We don’t have access to a single object.
• Instance segmentation: produces a better description that can list objects as individual instances of “things” but
lower generalization on the environment and background “stuff”.
• Panoptic segmentation: Combines semantic and instance segmentation. We have access to the environmental
context but also to the individual objects. So, we see both “stuff” and “things”.
Instance, Semantic and Panoptic Segmentation
6
Image Semantic Segmentation Instance Segmentation Panoptic Segmentation
© 2024 Au-Zone Technologies
Image Segmentation Using Deep Learning
7
• Deconstruction: Feature extraction (backbone, encoder)
• Reconstruction: Upsampler (decoder)
Deconstruction Reconstruction
© 2024 Au-Zone Technologies
Deep Learning Segmentation Architecture
8
© 2023 Au-Zone Technologies
Encoder Decoder
• 1-hot encoding, just like classification
• Score applied to each pixel
• Class with highest score sets the pixel
Semantic Segmentation Output
9
© 2024 Au-Zone Technologies
Instance Segmentation – Naïve
10
Detections
• Additional model output for
computing bounding boxes
• Same as SSD, YOLO, etc…
• Boxes are post-processed to re-
colour masks in order to
distinguish instances.
• Overlapping instances will be
poorly segmented because of box
limitations.
© 2024 Au-Zone Technologies
• Additional model output computes
per-instance mask predictions.
• Learns to separate objects in each
mask which are then fused with
semantic mask.
• Handles overlapping instances.
Instance Segmentation – Proto Masks
11
© 2024 Au-Zone Technologies
• Extension of detection models. Inherently instance based.
• Instead of predicting boxes for objects, the model predicts masks.
Instance Segmentation – Box Masks
12
© 2024 Au-Zone Technologies
Fusing semantic and instance segmentation to detect “things” and “stuff”
Panoptic Segmentation
13
© 2024 Au-Zone Technologies
• Label masks
• Object polygons
• Very high annotation effort
• “Segment Anything Model” has been
a game changer for annotation effort
Dataset Types
14
© 2024 Au-Zone Technologies
• Similar IoU concept as detection
• Panoptic Quality “PQ” is a new
metric and applied, in part, to all
segmentation challenges
• PQ metrics for “things” and “stuffs”
categories
• COCO metrics “Panoptic Evaluation”
Accuracy Metrics
15
© 2024 Au-Zone Technologies
• Same backbone as detection
• Segmentation head incurs ~20% overhead
• Post-processing demands
• Instance and panoptic incur additional overhead
Computational Requirements
16
© 2024 Au-Zone Technologies
• Semantic segmentation is a technique that enables us to isolate different
objects in an image along their contours.
• Improves on detection models for objects with more complex shapes.
• It can be considered an image classification task at a pixel level.
Conclusions
17
© 2024 Au-Zone Technologies
• Semantic segmentation classifies all pixels in an image by their class.
• Instance segmentation refines the semantic masks to separate each
object instance.
• Panoptic segmentation fuses semantic and instance segmentation into a
single unified model with knowledge of “things” and “stuff”.
Conclusions
18
© 2024 Au-Zone Technologies
• Datasets
• https://cocodataset.org/
• https://www.cityscapes-dataset.com/
• https://ai.facebook.com/datasets/segment-anything/
• Models
• https://towardsdatascience.com/u-net-explained-understanding-its-image-segmentation-
architecture-56e4842e313a
• https://learnopencv.com/yolov5-instance-segmentation/
• https://segment-anything.com/
Resources
19
© 2024 Au-Zone Technologies
20
Thank you! Questions?

More Related Content

PDF
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...
PPTX
SLICING AIDED HYPER INFERENCE AND FINE-TUNING FOR SMALL.pptx
PPTX
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
PPTX
presentation on Faster Yolo
PDF
Object Detetcion using SSD-MobileNet
PPTX
YOLO_review.pptxThis is a test document that is used to satisfy the requireme...
PDF
Introducción al Análisis y Diseño Orientado a Objetos
PDF
Object Detection An Overview
“Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technolo...
SLICING AIDED HYPER INFERENCE AND FINE-TUNING FOR SMALL.pptx
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
presentation on Faster Yolo
Object Detetcion using SSD-MobileNet
YOLO_review.pptxThis is a test document that is used to satisfy the requireme...
Introducción al Análisis y Diseño Orientado a Objetos
Object Detection An Overview

Similar to “An Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technologies (20)

PPTX
Object Tracking and Identification with OpenCV Using Histograms.pptx
PPTX
Object recognition
PPTX
Introduction to Object recognition
PDF
ObjectDetectionUsingMachineLearningandNeuralNetworks.pdf
PDF
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
PDF
IRJET - Direct Me-Nevigation for Blind People
PPT
OOSE Unit 3 PPT.ppt
PPTX
ppt - of a project will help you on your college projects
PPTX
Presentation1.pptx
PDF
Object tracking final
PDF
Object tracking presentation
PDF
Detection and Tracking of Objects: A Detailed Study
PDF
Visual Object Category Recognition
PDF
The deep learning technology on coco framework full report
PDF
ppt_ooad.pdf
PDF
Deep Learning for X ray Image to Text Generation
PDF
“Tools and Strategies for Quickly Building Effective Image Datasets,” a Prese...
PDF
Supervised Learning
 
PPTX
A-13 Iomp-1.pptx
PPTX
Computer vision-nit-silchar-hackathon
Object Tracking and Identification with OpenCV Using Histograms.pptx
Object recognition
Introduction to Object recognition
ObjectDetectionUsingMachineLearningandNeuralNetworks.pdf
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
IRJET - Direct Me-Nevigation for Blind People
OOSE Unit 3 PPT.ppt
ppt - of a project will help you on your college projects
Presentation1.pptx
Object tracking final
Object tracking presentation
Detection and Tracking of Objects: A Detailed Study
Visual Object Category Recognition
The deep learning technology on coco framework full report
ppt_ooad.pdf
Deep Learning for X ray Image to Text Generation
“Tools and Strategies for Quickly Building Effective Image Datasets,” a Prese...
Supervised Learning
 
A-13 Iomp-1.pptx
Computer vision-nit-silchar-hackathon
Ad

More from Edge AI and Vision Alliance (20)

PDF
“An Introduction to the MIPI CSI-2 Image Sensor Standard and Its Latest Advan...
PDF
“Visual Search: Fine-grained Recognition with Embedding Models for the Edge,”...
PDF
“Optimizing Real-time SLAM Performance for Autonomous Robots with GPU Acceler...
PDF
“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applicat...
PDF
“Simplifying Portable Computer Vision with OpenVX 2.0,” a Presentation from AMD
PDF
“Quantization Techniques for Efficient Deployment of Large Language Models: A...
PDF
“Introduction to Data Types for AI: Trade-offs and Trends,” a Presentation fr...
PDF
“Introduction to Radar and Its Use for Machine Perception,” a Presentation fr...
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
PDF
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
PDF
“Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-effe...
PDF
“Running Accelerated CNNs on Low-power Microcontrollers Using Arm Ethos-U55, ...
PDF
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
PDF
“A Re-imagination of Embedded Vision System Design,” a Presentation from Imag...
PDF
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
PDF
“Evolving Inference Processor Software Stacks to Support LLMs,” a Presentatio...
PDF
“Efficiently Registering Depth and RGB Images,” a Presentation from eInfochips
“An Introduction to the MIPI CSI-2 Image Sensor Standard and Its Latest Advan...
“Visual Search: Fine-grained Recognition with Embedding Models for the Edge,”...
“Optimizing Real-time SLAM Performance for Autonomous Robots with GPU Acceler...
“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applicat...
“Simplifying Portable Computer Vision with OpenVX 2.0,” a Presentation from AMD
“Quantization Techniques for Efficient Deployment of Large Language Models: A...
“Introduction to Data Types for AI: Trade-offs and Trends,” a Presentation fr...
“Introduction to Radar and Its Use for Machine Perception,” a Presentation fr...
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
“Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-effe...
“Running Accelerated CNNs on Low-power Microcontrollers Using Arm Ethos-U55, ...
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
“A Re-imagination of Embedded Vision System Design,” a Presentation from Imag...
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
“Evolving Inference Processor Software Stacks to Support LLMs,” a Presentatio...
“Efficiently Registering Depth and RGB Images,” a Presentation from eInfochips
Ad

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
cuic standard and advanced reporting.pdf
PPTX
A Presentation on Artificial Intelligence
PPTX
Big Data Technologies - Introduction.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Spectroscopy.pptx food analysis technology
PDF
Encapsulation theory and applications.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MIND Revenue Release Quarter 2 2025 Press Release
MYSQL Presentation for SQL database connectivity
Per capita expenditure prediction using model stacking based on satellite ima...
cuic standard and advanced reporting.pdf
A Presentation on Artificial Intelligence
Big Data Technologies - Introduction.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
Spectroscopy.pptx food analysis technology
Encapsulation theory and applications.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Empathic Computing: Creating Shared Understanding
Assigned Numbers - 2025 - Bluetooth® Document
Reach Out and Touch Someone: Haptics and Empathic Computing
NewMind AI Weekly Chronicles - August'25-Week II
Advanced methodologies resolving dimensionality complications for autism neur...
Digital-Transformation-Roadmap-for-Companies.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
SOPHOS-XG Firewall Administrator PPT.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf

“An Introduction to Semantic Segmentation,” a Presentation from Au-Zone Technologies

  • 1. Introduction to Semantic Segmentation Sébastien Taylor V.P. of Research & Development Au-Zone Technologies
  • 2. • Introduction to segmentation • Practical examples and applications • Various types of segmentation • Accuracy metrics • Computational requirements • Resources Outline 2 © 2024 Au-Zone Technologies
  • 3. • Image Segmentation is a process that subdivides an image into its constituent parts or objects. • Key task in computer vision and image processing • It can be formulated as a pixel classification problem with three different approaches (semantic, instance and panoptic) Introduction to Segmentation 3 © 2024 Au-Zone Technologies
  • 4. Image Segmentation vs. Object Detection 4 © 2024 Au-Zone Technologies
  • 5. • Autonomous vehicles • Smart agriculture • Drones and aerial imaging • Medical image diagnosis • Image editing • Dataset augmentation Practical Examples 5 © 2024 Au-Zone Technologies
  • 6. • Semantic segmentation: produces a contextual description of the “stuff” in the image. Classes are isolated but not objects within the same class. We don’t have access to a single object. • Instance segmentation: produces a better description that can list objects as individual instances of “things” but lower generalization on the environment and background “stuff”. • Panoptic segmentation: Combines semantic and instance segmentation. We have access to the environmental context but also to the individual objects. So, we see both “stuff” and “things”. Instance, Semantic and Panoptic Segmentation 6 Image Semantic Segmentation Instance Segmentation Panoptic Segmentation © 2024 Au-Zone Technologies
  • 7. Image Segmentation Using Deep Learning 7 • Deconstruction: Feature extraction (backbone, encoder) • Reconstruction: Upsampler (decoder) Deconstruction Reconstruction © 2024 Au-Zone Technologies
  • 8. Deep Learning Segmentation Architecture 8 © 2023 Au-Zone Technologies Encoder Decoder
  • 9. • 1-hot encoding, just like classification • Score applied to each pixel • Class with highest score sets the pixel Semantic Segmentation Output 9 © 2024 Au-Zone Technologies
  • 10. Instance Segmentation – Naïve 10 Detections • Additional model output for computing bounding boxes • Same as SSD, YOLO, etc… • Boxes are post-processed to re- colour masks in order to distinguish instances. • Overlapping instances will be poorly segmented because of box limitations. © 2024 Au-Zone Technologies
  • 11. • Additional model output computes per-instance mask predictions. • Learns to separate objects in each mask which are then fused with semantic mask. • Handles overlapping instances. Instance Segmentation – Proto Masks 11 © 2024 Au-Zone Technologies
  • 12. • Extension of detection models. Inherently instance based. • Instead of predicting boxes for objects, the model predicts masks. Instance Segmentation – Box Masks 12 © 2024 Au-Zone Technologies
  • 13. Fusing semantic and instance segmentation to detect “things” and “stuff” Panoptic Segmentation 13 © 2024 Au-Zone Technologies
  • 14. • Label masks • Object polygons • Very high annotation effort • “Segment Anything Model” has been a game changer for annotation effort Dataset Types 14 © 2024 Au-Zone Technologies
  • 15. • Similar IoU concept as detection • Panoptic Quality “PQ” is a new metric and applied, in part, to all segmentation challenges • PQ metrics for “things” and “stuffs” categories • COCO metrics “Panoptic Evaluation” Accuracy Metrics 15 © 2024 Au-Zone Technologies
  • 16. • Same backbone as detection • Segmentation head incurs ~20% overhead • Post-processing demands • Instance and panoptic incur additional overhead Computational Requirements 16 © 2024 Au-Zone Technologies
  • 17. • Semantic segmentation is a technique that enables us to isolate different objects in an image along their contours. • Improves on detection models for objects with more complex shapes. • It can be considered an image classification task at a pixel level. Conclusions 17 © 2024 Au-Zone Technologies
  • 18. • Semantic segmentation classifies all pixels in an image by their class. • Instance segmentation refines the semantic masks to separate each object instance. • Panoptic segmentation fuses semantic and instance segmentation into a single unified model with knowledge of “things” and “stuff”. Conclusions 18 © 2024 Au-Zone Technologies
  • 19. • Datasets • https://cocodataset.org/ • https://www.cityscapes-dataset.com/ • https://ai.facebook.com/datasets/segment-anything/ • Models • https://towardsdatascience.com/u-net-explained-understanding-its-image-segmentation- architecture-56e4842e313a • https://learnopencv.com/yolov5-instance-segmentation/ • https://segment-anything.com/ Resources 19 © 2024 Au-Zone Technologies