SlideShare a Scribd company logo
Unsupervised Correction of Training Labels
project taken under program
Presented by:
Charmi Chokshi
Final year BTech student of
Information and Communication Technology
Ahmedabad University
Duration: May 31, 2018 till July End
Guided by:
Mr. Pankaj Bodani (Scientist-SE)
Space Application Centre-ISRO
Outline
● Objectives
● Introduction
● Methodology
● Unsupervised Sementic Segmentation
Algorithms
● Discussion
● Region Growing Algorithm
● Conclusion
● Future work
● References
2
Objective
● To work on the preprocessing (Data Cleaning) step of the image
segmentation problem using Deep Learning
● To create accurate Training Labels as the input to Neural Network
● To investigate/compare the use of different unsupervised image
segmentation techniques for boundary correction
3
Introduction
4
What is Semantic Segmentation?
● In computer vision, semantic segmentation is the process of partitioning a
digital image into multiple meaningful segments
● Semantic segmentation is typically used to locate objects and boundaries in
images.
● More precisely, it is the process of assigning a label to every pixel in an image
such that pixels with the same label share certain characteristics
5
Application of Image Segmentation
● Driverless car
● Medical imaging
● Object detection
○ Face detection
○ Pedestrian detection
○ Brake light detection
○ Locate objects in satellite images (roads, forests, crops, water bodies, etc.)
● Recognition Tasks
● Traffic control systems
6
What is Unsupervised Sementic Segmentation?
● Unsupervised = Learning without the help of teacher!
● No Labeled Training Data available for the model to learn from
● The goal of this unsupervised machine learning technique is to find
similarities in the data point and group similar data points together which
will give us insight into underlying patterns of different groups
● “Clustering” is the process of grouping similar entities together
Supervised Segmentation
Input: Raw image, Labelled Image
Output: Segmented Image
Unsupervised Segmentation
Input: Raw image
Output: Segmented Image
7
Methodology
8
Methodology
Methodology consist of following 2 stages:
1. Unsupervised Sementic Segmentation using Clustering algorithms
2. Post-processing using Seed based Region Growing algorithm
Input Data
● Satellite: IRS Resourcesat–2
● Sensor: LISS-IV
● Spatial Resolution: 5.8 m
● No of Bands: 3
● Bit depth: 16 – 10 bit quantization (1024 different tones can be
assigned to a pixel)
● Size: More than 3000 X 3000
● Cities: Vadodara, Jabalpur, Sagar, Satana, Ujjain, Varanasi, Rampur 9
Input Images
Vadodara Varanasi Ujjain
10
Raw and Manually Labelled Images
11
Raw and Manually Labelled Images
12
Methodology (step-1 of 2)
13
DBSCAN
● DBSCAN: Density-based spatial clustering of
applications with noise
● Its clusters are defined as areas of higher density than
the remainder of the data set
● It clusters water bodies accurately in our dataset
● DBSCAN(eps=3.5, min_samples=5, metric=’euclidean’, n_jobs=1)
● eps: The maximum distance between two samples for them to be
considered as in the same neighborhood
● min_samples: The number of samples in a neighborhood for a
point to be considered as a core point
14
K-means
● k-means clustering aims to partition n observations into k clusters in which
each observation belongs to the cluster with the nearest mean, serving as a
prototype of the cluster
● KMeans(n_clusters=3, n_init=10, n_jobs=1)
● n_init: Number of time the k-means algorithm will be run with different centroid seeds
15
K-means Results
16
K=3 K=10
Mean Shift
17
Intuitive Description
18
Intuitive Description
neighboring
pixels are
determined
19
Intuitive Description
new color
mean value
is calculated
20
Intuitive Description
move that
window to
the area of
maximum
pixel
density
21
Intuitive Description
new
center for
the next
iteration
22
Intuitive Description
iterate until
the spatial
and the
color mean
stops
changing
23
Mean Shift
● Mean Shift: Finding modes in a set of data samples, manifesting an underlying
probability density function (PDF) in RN
● It is a procedure for locating the maxima of a density function given discrete
data sampled from that function
● Thus, it is using a non-parametric density gradient estimation
24
Non-Parametric Density Estimation
25
Non-Parametric Density Estimation
26
Non-Parametric Density Estimation
● for each channel, separate colour histograms will be generated
27
Mean Shift Results
Down Sampled by:
0 1 2 3
28
N-Cut (Graph-Cut)
● Given an image or image sequence, set up
a undirected weighted graph G = (V; E)
● The nodes are pixels
● The weight on the edge connecting two
nodes is the measure of the similarity
between the two nodes in terms of colour,
texture etc.
● The objective of normalized partitioning is
to optimize the cut value
29
Watershed
● The watershed transformation treats the
image it operates upon like a topographic
map
● Intuitively, a drop of water falling on a
topographic relief flows towards the
"nearest" minimum
● The "nearest" minimum is that minimum
which lies at the end of the path of steepest
descent
30
Discussion
● DBSCAN: clustered only water bodies
● K-mean: loss of data
● Mean shift: overall good result
● N-cut: poorest result (time complexity is
too high)
● Watershed: not good for this dataset
Reference [1]
31
Methodology (step-2 of 2)
32
Region Growing Algorithm
33
Region Growing
● Group pixels or sub-regions into
larger regions when homogeneity
criterion is satisfied (assumed
criterion is a range of pixel value in all
3 bands)
● Region grows around the seed point
based on similar properties (grey level,
texture, color) (9 seed points have
been assumed)
● It is better in noisy image where edges
are hard to identify
34
Region Growing Video
35
Output of RG on Mean Shift & Raw Image
As of now, step 2 has directly been performed on raw image. But after tweaking the parameters, output
of mean shift can be used
36
Results
Manually
Labelled
Input
Output
Of RG
Vadodara City Jabalpur CitySagar City 37
Results
Manually Labelled Input Output
Sagar City
Jabalpur City
38
Results
Manually Labelled Input Output
Jabalpur City
Jabalpur City
39
Conclusion
● As of now, RGB Threshold of 14 and B&W
pixel voting of 50% applied on RAW
image without preprocessing gives best
result (based on subjective assessment)
● I will now tune parameters for
unsupervised segmentation and try to
arrive at results which are better than
using RAW image
40
Future Work
● Twiking of Hyperparameters such as RGB Threshold value and Voting value
of B&W pixels for better accuracy
● Implementation of Progressive Thresholding technique to improve voting
Labelled
Input Output
Region
Growing
Voting Value
(Fix 50%)
RGB
Threshold
(10,15,20)
Feedback output image as
new labelled image
41
References
[1] Xia, Xide, and Brian Kulis. "W-Net: A Deep Model
for Fully Unsupervised Image Segmentation." arXiv
preprint arXiv:1711.08506 (2017).
[2] Shi, Jianbo, and Jitendra Malik. "Normalized cuts
and image segmentation." IEEE Transactions on
pattern analysis and machine intelligence 22.8
(2000): 888-905.
[3] Zhou, Yong-mei, Sheng-yi Jiang, and Mei-lin Yin.
"A region-based image segmentation method with
mean-shift clustering algorithm." Fuzzy Systems and
Knowledge Discovery, 2008. FSKD'08. Fifth
International Conference on. Vol. 2. IEEE, 2008.
42
Thank you!

More Related Content

PDF
Remotely Sensed Image (RSI) Analysis for feature extraction using Color map I...
DOCX
Ijdmta v1i1
PPTX
Cahall Final Intern Presentation
PDF
C1104011322
PDF
A binarization technique for extraction of devanagari text from camera based ...
PDF
Dk34681688
PDF
G04544346
PDF
A New Method for Indoor-outdoor Image Classification Using Color Correlated T...
Remotely Sensed Image (RSI) Analysis for feature extraction using Color map I...
Ijdmta v1i1
Cahall Final Intern Presentation
C1104011322
A binarization technique for extraction of devanagari text from camera based ...
Dk34681688
G04544346
A New Method for Indoor-outdoor Image Classification Using Color Correlated T...

What's hot (20)

PDF
Sign Language Recognition Using Image Processing For Mute People
PDF
PDE BASED FEATURES FOR TEXTURE ANALYSIS USING WAVELET TRANSFORM
PDF
Cj36511514
PDF
A Novel PSNR-B Approach for Evaluating the Quality of De-blocked Images
PDF
Reduced-reference Video Quality Metric Using Spatial Information in Salient R...
PDF
Jc3515691575
PDF
Research Inventy : International Journal of Engineering and Science
PDF
Av4301248253
PDF
improving differently illuminant images with fuzzy membership based saturatio...
PDF
Automatic dominant region segmentation for natural images
PDF
Wavelet-Based Color Histogram on Content-Based Image Retrieval
PDF
Parking detection system using background subtraction and HSV color segmentation
PPT
Image inpainting
PDF
IRJET- Crowd Density Estimation using Image Processing
PDF
Dissertation synopsis for imagedenoising(noise reduction )using non local me...
PDF
H017416670
PDF
A Review on Image Inpainting to Restore Image
PDF
Sample Paper Techscribe
PDF
Fuzzy Region Merging Using Fuzzy Similarity Measurement on Image Segmentation
PDF
An improved hdr image processing using fast global tone mapping
Sign Language Recognition Using Image Processing For Mute People
PDE BASED FEATURES FOR TEXTURE ANALYSIS USING WAVELET TRANSFORM
Cj36511514
A Novel PSNR-B Approach for Evaluating the Quality of De-blocked Images
Reduced-reference Video Quality Metric Using Spatial Information in Salient R...
Jc3515691575
Research Inventy : International Journal of Engineering and Science
Av4301248253
improving differently illuminant images with fuzzy membership based saturatio...
Automatic dominant region segmentation for natural images
Wavelet-Based Color Histogram on Content-Based Image Retrieval
Parking detection system using background subtraction and HSV color segmentation
Image inpainting
IRJET- Crowd Density Estimation using Image Processing
Dissertation synopsis for imagedenoising(noise reduction )using non local me...
H017416670
A Review on Image Inpainting to Restore Image
Sample Paper Techscribe
Fuzzy Region Merging Using Fuzzy Similarity Measurement on Image Segmentation
An improved hdr image processing using fast global tone mapping
Ad

Similar to Charmi chokshi ppt (20)

PDF
Image segmentation
PDF
A novel predicate for active region merging in automatic image segmentation
PDF
A novel predicate for active region merging in automatic image segmentation
PPTX
Image segmentation
PDF
IRJET- Image Segmentation Techniques: A Review
PPT
ImSeg04 (2).ppt
PPT
ImSeg04.ppt
PPT
Im seg04
PDF
imagesegmentationppt-120409061123-phpapp01 (2).pdf
PDF
imagesegmentationppt-120409061123-phpapp01 (2).pdf
PPTX
image segmentation by Rajesh
PPTX
IMAGE SEGMENTATION TECHNIQUES
PDF
Lecture 8_Image Segmentation_3_dip__.pdf
PDF
Image segmentation by modified map ml estimations
PDF
Image segmentation by modified map ml
PDF
IMAGE SEGMENTATION BY MODIFIED MAP-ML ESTIMATIONS
PDF
Different Image Segmentation Techniques for Dental Image Extraction
PDF
Image Segmentation Techniques for Remote Sensing Satellite Images.pdf
PDF
Bx4301429434
Image segmentation
A novel predicate for active region merging in automatic image segmentation
A novel predicate for active region merging in automatic image segmentation
Image segmentation
IRJET- Image Segmentation Techniques: A Review
ImSeg04 (2).ppt
ImSeg04.ppt
Im seg04
imagesegmentationppt-120409061123-phpapp01 (2).pdf
imagesegmentationppt-120409061123-phpapp01 (2).pdf
image segmentation by Rajesh
IMAGE SEGMENTATION TECHNIQUES
Lecture 8_Image Segmentation_3_dip__.pdf
Image segmentation by modified map ml estimations
Image segmentation by modified map ml
IMAGE SEGMENTATION BY MODIFIED MAP-ML ESTIMATIONS
Different Image Segmentation Techniques for Dental Image Extraction
Image Segmentation Techniques for Remote Sensing Satellite Images.pdf
Bx4301429434
Ad

More from Charmi Chokshi (14)

PPTX
Deep learning with tensorflow
PDF
From ELIZA to Alexa and Beyond
PDF
Report smart dustbin
PDF
Os prj ppt
PDF
magazine
PDF
PPTX
Trade relations US & India; the changing faces
PPTX
Public vs private sector banks in india
PPTX
Market analysis of pencil
PPTX
Indian agriculture sector, Issues Related to Minimum Support Price, Subsidies...
PPTX
Face recognition system using Hidden Markov Model
PPTX
Educational Resource Management System
PPTX
Dictionary implementation using TRIE
PPTX
8 bit mips-processor
Deep learning with tensorflow
From ELIZA to Alexa and Beyond
Report smart dustbin
Os prj ppt
magazine
Trade relations US & India; the changing faces
Public vs private sector banks in india
Market analysis of pencil
Indian agriculture sector, Issues Related to Minimum Support Price, Subsidies...
Face recognition system using Hidden Markov Model
Educational Resource Management System
Dictionary implementation using TRIE
8 bit mips-processor

Recently uploaded (20)

PPTX
Sustainable Sites - Green Building Construction
PPTX
web development for engineering and engineering
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPT
Project quality management in manufacturing
PDF
Well-logging-methods_new................
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Welding lecture in detail for understanding
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Lecture Notes Electrical Wiring System Components
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Sustainable Sites - Green Building Construction
web development for engineering and engineering
R24 SURVEYING LAB MANUAL for civil enggi
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Project quality management in manufacturing
Well-logging-methods_new................
Model Code of Practice - Construction Work - 21102022 .pdf
CH1 Production IntroductoryConcepts.pptx
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Welding lecture in detail for understanding
UNIT 4 Total Quality Management .pptx
Lecture Notes Electrical Wiring System Components
Automation-in-Manufacturing-Chapter-Introduction.pdf
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx

Charmi chokshi ppt

  • 1. Unsupervised Correction of Training Labels project taken under program Presented by: Charmi Chokshi Final year BTech student of Information and Communication Technology Ahmedabad University Duration: May 31, 2018 till July End Guided by: Mr. Pankaj Bodani (Scientist-SE) Space Application Centre-ISRO
  • 2. Outline ● Objectives ● Introduction ● Methodology ● Unsupervised Sementic Segmentation Algorithms ● Discussion ● Region Growing Algorithm ● Conclusion ● Future work ● References 2
  • 3. Objective ● To work on the preprocessing (Data Cleaning) step of the image segmentation problem using Deep Learning ● To create accurate Training Labels as the input to Neural Network ● To investigate/compare the use of different unsupervised image segmentation techniques for boundary correction 3
  • 5. What is Semantic Segmentation? ● In computer vision, semantic segmentation is the process of partitioning a digital image into multiple meaningful segments ● Semantic segmentation is typically used to locate objects and boundaries in images. ● More precisely, it is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics 5
  • 6. Application of Image Segmentation ● Driverless car ● Medical imaging ● Object detection ○ Face detection ○ Pedestrian detection ○ Brake light detection ○ Locate objects in satellite images (roads, forests, crops, water bodies, etc.) ● Recognition Tasks ● Traffic control systems 6
  • 7. What is Unsupervised Sementic Segmentation? ● Unsupervised = Learning without the help of teacher! ● No Labeled Training Data available for the model to learn from ● The goal of this unsupervised machine learning technique is to find similarities in the data point and group similar data points together which will give us insight into underlying patterns of different groups ● “Clustering” is the process of grouping similar entities together Supervised Segmentation Input: Raw image, Labelled Image Output: Segmented Image Unsupervised Segmentation Input: Raw image Output: Segmented Image 7
  • 9. Methodology Methodology consist of following 2 stages: 1. Unsupervised Sementic Segmentation using Clustering algorithms 2. Post-processing using Seed based Region Growing algorithm Input Data ● Satellite: IRS Resourcesat–2 ● Sensor: LISS-IV ● Spatial Resolution: 5.8 m ● No of Bands: 3 ● Bit depth: 16 – 10 bit quantization (1024 different tones can be assigned to a pixel) ● Size: More than 3000 X 3000 ● Cities: Vadodara, Jabalpur, Sagar, Satana, Ujjain, Varanasi, Rampur 9
  • 11. Raw and Manually Labelled Images 11
  • 12. Raw and Manually Labelled Images 12
  • 14. DBSCAN ● DBSCAN: Density-based spatial clustering of applications with noise ● Its clusters are defined as areas of higher density than the remainder of the data set ● It clusters water bodies accurately in our dataset ● DBSCAN(eps=3.5, min_samples=5, metric=’euclidean’, n_jobs=1) ● eps: The maximum distance between two samples for them to be considered as in the same neighborhood ● min_samples: The number of samples in a neighborhood for a point to be considered as a core point 14
  • 15. K-means ● k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster ● KMeans(n_clusters=3, n_init=10, n_jobs=1) ● n_init: Number of time the k-means algorithm will be run with different centroid seeds 15
  • 20. Intuitive Description new color mean value is calculated 20
  • 21. Intuitive Description move that window to the area of maximum pixel density 21
  • 23. Intuitive Description iterate until the spatial and the color mean stops changing 23
  • 24. Mean Shift ● Mean Shift: Finding modes in a set of data samples, manifesting an underlying probability density function (PDF) in RN ● It is a procedure for locating the maxima of a density function given discrete data sampled from that function ● Thus, it is using a non-parametric density gradient estimation 24
  • 27. Non-Parametric Density Estimation ● for each channel, separate colour histograms will be generated 27
  • 28. Mean Shift Results Down Sampled by: 0 1 2 3 28
  • 29. N-Cut (Graph-Cut) ● Given an image or image sequence, set up a undirected weighted graph G = (V; E) ● The nodes are pixels ● The weight on the edge connecting two nodes is the measure of the similarity between the two nodes in terms of colour, texture etc. ● The objective of normalized partitioning is to optimize the cut value 29
  • 30. Watershed ● The watershed transformation treats the image it operates upon like a topographic map ● Intuitively, a drop of water falling on a topographic relief flows towards the "nearest" minimum ● The "nearest" minimum is that minimum which lies at the end of the path of steepest descent 30
  • 31. Discussion ● DBSCAN: clustered only water bodies ● K-mean: loss of data ● Mean shift: overall good result ● N-cut: poorest result (time complexity is too high) ● Watershed: not good for this dataset Reference [1] 31
  • 34. Region Growing ● Group pixels or sub-regions into larger regions when homogeneity criterion is satisfied (assumed criterion is a range of pixel value in all 3 bands) ● Region grows around the seed point based on similar properties (grey level, texture, color) (9 seed points have been assumed) ● It is better in noisy image where edges are hard to identify 34
  • 36. Output of RG on Mean Shift & Raw Image As of now, step 2 has directly been performed on raw image. But after tweaking the parameters, output of mean shift can be used 36
  • 38. Results Manually Labelled Input Output Sagar City Jabalpur City 38
  • 39. Results Manually Labelled Input Output Jabalpur City Jabalpur City 39
  • 40. Conclusion ● As of now, RGB Threshold of 14 and B&W pixel voting of 50% applied on RAW image without preprocessing gives best result (based on subjective assessment) ● I will now tune parameters for unsupervised segmentation and try to arrive at results which are better than using RAW image 40
  • 41. Future Work ● Twiking of Hyperparameters such as RGB Threshold value and Voting value of B&W pixels for better accuracy ● Implementation of Progressive Thresholding technique to improve voting Labelled Input Output Region Growing Voting Value (Fix 50%) RGB Threshold (10,15,20) Feedback output image as new labelled image 41
  • 42. References [1] Xia, Xide, and Brian Kulis. "W-Net: A Deep Model for Fully Unsupervised Image Segmentation." arXiv preprint arXiv:1711.08506 (2017). [2] Shi, Jianbo, and Jitendra Malik. "Normalized cuts and image segmentation." IEEE Transactions on pattern analysis and machine intelligence 22.8 (2000): 888-905. [3] Zhou, Yong-mei, Sheng-yi Jiang, and Mei-lin Yin. "A region-based image segmentation method with mean-shift clustering algorithm." Fuzzy Systems and Knowledge Discovery, 2008. FSKD'08. Fifth International Conference on. Vol. 2. IEEE, 2008. 42