SlideShare a Scribd company logo
Parallel implementation of geodesic distance
transform with application in superpixel segmentation
Tuan Q. Pham
Canon Information Systems Research Australia (CISRA)
tuan.pham@cisra.canon.com.au

Geodesic distance

Parallel distance transform

 Geodesic distance between two points = sum of
pixel costs along a minimum-cost path
 Geodesic distance transform d(cost image f,
seed points) = geodesic distance from every
pixel to its nearest seed
destination

1

0.8

0.4

minimum path, cost = 1.7
straight path, cost = 11.1

 Segment image into superpixels using GDT where

 The algorithm can be parallelised if multiple
passes are allowed (OK since GDT is iterative)

 Cost image = gradient energy + a small offset
 Seed points = well-separated local gradient
minima by adaptive non-maximum suppression

 Distance transform is propagated across bands in
a next iteration (may require more iterations)

8

0.6

source

 Within a pass, chamfer algorithm is sequential

 Image is divided into bands for parallel processing

10

Superpixel segmentation

6

seed

4

0.2

2

0

Fig. 1. Cost image (left) and its geodesic distance transform (right)

Geodesic distance transform
 Chamfer distance algorithm = multiple iterations of
a forward
propagation

Fig. 4. Band-based image partitioning for parallel GDT implementation
Fig. 6. 1000 geodesic superpixels from a 2MegaPixel image (1936×1288)

OpenMP implementation

Segmentation comparison

 OpenMP = an easy to use Open Multi-Processing
platform that is designed for multicore processors
and is supported by most compilers

and
a backward
propagation

 Geodesic superpixel is faster & follows edges better

 OpenMP parallelises loop by compiler directives

Frame 8

SLIC [1] (4.6 seconds)

Geodesic (0.64 sec)

TurboPixels [2] (207 sec)

Fig. 7. Three state-of-the-art superpixel methods on 2MP image in Fig.6

Fig. 2. One iteration of a forward pass (left) and a backward pass (right)

Geodesic Voronoi tessellation
 Geodesic distance transform (GDT) produces edgefollowing Voronoi tessellation if edge is used as cost

frame 12

frame 8

frame 4

frame 1

Evaluation of parallel GDT
 Best with static scheduling (where bands are
assigned to threads in a round-robin fashion)
Cost image & 4 seed points

Input image

Method

#

Time Platform

Method

Watershed 1008 3.2s C/Matlab

 Number of fwd+bwd propagation iterations
increases slightly under parallel implementation
(10 iterations are often enough for segmentation)

Entropy 1000 6.5s

C

Lattice

FH

Time Platform

Method

#

Time Platform

1024 2.3s

C

Quickshift

992 13.3s

C

Geodesic 1000 0.3s

C

CVT

1000 2.7s

Matlab

C

Turbo

1067 58.1s Matlab

C

1024 1.4s

#

SLIC

990 1.2s

Fig. 8. Segmentation of 1MP image (# denotes number of superpixels returned)

 Sub-second runtime on 5 MP image or smaller
 Speedup of 1.3× on 2-core, 2.6× on 4-core CPU

region without a nearest seed

GDT after 10 fwd+bwd
propagation iterations

fragmentation

2

1.5

Summary

3.5

without OpenMP
static schedule
dynamic schedule

3

speedup factor

Intermediate GDT after
a first backward pass

runtime (seconds)

Intermediate GDT after
a first forward pass

1

0.5

 We proposed a parallel implementation of

2.5
2

geodesic distance transform using OpenMP

1.5
1

static schedule
dynamic schedule

0.5
0
0

1000

2000

3000

4000

0
0

1000

image width (pixels)

Nearest seed label after Nearest seed label after
a first backward pass
a first forward pass

Nearest seed label
after 10 iterations

Fig. 3. Geodesic distance transform (2nd row) and tessellation (3rd row)

Runtime

2000

3000

4000

image width (pixels)

Speedup factor

Fig. 5. Runtime & speedup factor on 2.8GHz quad-core CPU with 12GB RAM

References: 1. Achanta et al., SLIC superpixels compared to state-of-the-art superpixel methods, PAMI 34(11), 2012.

 Our geodesic segmentation method
produces more regular, edge-following
superpixels at orders of magnitude faster
than state-of-the-art segmentation methods.

2. Levinshtein et al., TurboPixels: Fast superpixels using geometric flows, PAMI 31(12), 2009.

Contact details: Tuan Q. Pham (tuan.pham@cisra.canon.com.au), 1 Thomas Holt drive, North Ryde, NSW 2113, Australia
Presented at Int’l Conf. on Digital Image Computing: Techniques and Applications (DICTA) Paper 5, Poster session 2 on Thursday 8th November, 2013. Hobart, Australia

More Related Content

PDF
Realtime 3D Visualization without GPU
PDF
robio-2014-falquez
PPTX
Geometry Batching Using Texture-Arrays
PDF
cnsm2011_slide
PPTX
Rendering of Complex 3D Treemaps (GRAPP 2013)
PDF
Digital Image Processing: Image Enhancement in the Frequency Domain
PPTX
2.5D Clip-Surfaces for Technical Visualization
PDF
Multi-core GPU – Fast parallel SAR image generation
Realtime 3D Visualization without GPU
robio-2014-falquez
Geometry Batching Using Texture-Arrays
cnsm2011_slide
Rendering of Complex 3D Treemaps (GRAPP 2013)
Digital Image Processing: Image Enhancement in the Frequency Domain
2.5D Clip-Surfaces for Technical Visualization
Multi-core GPU – Fast parallel SAR image generation

What's hot (20)

PDF
Gossip-based resource allocation for green computing in large clouds
PDF
PhD defense talk (portfolio of my expertise)
PPT
Stable SSAO in Battlefield 3 with Selective Temporal Filtering
PDF
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
PDF
30th コンピュータビジョン勉強会@関東 DynamicFusion
PDF
Point cloud mesh-investigation_report-lihang
PDF
NVIDIA effects GDC09
PDF
Optimized linear spatial filters implemented in FPGA
PPT
Anatomy of a Texture Fetch
PDF
Siggraph2016 - The Devil is in the Details: idTech 666
PDF
Ultra Fast SOM using CUDA
PDF
Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYC
PPSX
Advancements in-tiled-rendering
PPTX
Multimedia lossy compression algorithms
PDF
Modified approximate 8-point multiplier less DCT like transform
PPTX
FlameWorks GTC 2014
PDF
Adaptive lifting based image compression scheme using interactive artificial ...
PPTX
High Performance Pedestrian Detection On TEGRA X1
PDF
Jpeg image compression using discrete cosine transform a survey
PPT
Motion estimation overview
Gossip-based resource allocation for green computing in large clouds
PhD defense talk (portfolio of my expertise)
Stable SSAO in Battlefield 3 with Selective Temporal Filtering
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
30th コンピュータビジョン勉強会@関東 DynamicFusion
Point cloud mesh-investigation_report-lihang
NVIDIA effects GDC09
Optimized linear spatial filters implemented in FPGA
Anatomy of a Texture Fetch
Siggraph2016 - The Devil is in the Details: idTech 666
Ultra Fast SOM using CUDA
Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYC
Advancements in-tiled-rendering
Multimedia lossy compression algorithms
Modified approximate 8-point multiplier less DCT like transform
FlameWorks GTC 2014
Adaptive lifting based image compression scheme using interactive artificial ...
High Performance Pedestrian Detection On TEGRA X1
Jpeg image compression using discrete cosine transform a survey
Motion estimation overview
Ad

Viewers also liked (8)

PDF
Paper fingerprinting using alpha-masked image matching
PDF
19 water marking 9
PDF
Parallel implementation of geodesic distance transform with application in su...
PDF
Digital Image Watermarking Basics
PDF
A robust watermarking algorithm based on image normalization and dc coefficients
PPTX
digital-water-marking-created-by-subrat&rubi
PPTX
Digital water marking
PPTX
digital watermarking
Paper fingerprinting using alpha-masked image matching
19 water marking 9
Parallel implementation of geodesic distance transform with application in su...
Digital Image Watermarking Basics
A robust watermarking algorithm based on image normalization and dc coefficients
digital-water-marking-created-by-subrat&rubi
Digital water marking
digital watermarking
Ad

Similar to Parallel implementation of geodesic distance transform with application in superpixel segmentation (20)

PPT
D3 D10 Unleashed New Features And Effects
PPTX
Thesis Final Presentation
PDF
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
PPT
The Intersection of Game Engines & GPUs: Current & Future (Graphics Hardware ...
PDF
Median based parallel steering kernel regression for image reconstruction
PDF
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
PDF
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
PPTX
2022-01-17-Rethinking_Bisenet.pptx
PPTX
crowd counting.pptx
PDF
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
PDF
Final Project
PDF
High Speed and Area Efficient 2D DWT Processor Based Image Compression
PDF
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
PPT
Secrets of CryENGINE 3 Graphics Technology
PDF
Hz2514321439
PDF
Hz2514321439
PDF
Hz2514321439
PPT
Discrete cosine transform
PDF
Robust foreground modelling to segment and detect multiple moving objects in ...
PDF
4g lte matlab
D3 D10 Unleashed New Features And Effects
Thesis Final Presentation
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
The Intersection of Game Engines & GPUs: Current & Future (Graphics Hardware ...
Median based parallel steering kernel regression for image reconstruction
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
2022-01-17-Rethinking_Bisenet.pptx
crowd counting.pptx
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
Final Project
High Speed and Area Efficient 2D DWT Processor Based Image Compression
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
Secrets of CryENGINE 3 Graphics Technology
Hz2514321439
Hz2514321439
Hz2514321439
Discrete cosine transform
Robust foreground modelling to segment and detect multiple moving objects in ...
4g lte matlab

More from Tuan Q. Pham (13)

PDF
Oral presentation on Asymmetric recursive Gaussian filtering for space-varia...
PDF
Asymmetric recursive Gaussian filtering for space-variant artificial bokeh
PDF
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
PDF
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
PPT
Non-maximum suppression using fewer than two comparison per pixels
PDF
Paper fingerprinting using alpha-masked image matching
PDF
Bidirectional bias correction for gradient-based shift estimation
PDF
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
PPT
Resolution enhancement of low-quality videos using a high-resolution frame
PPT
Separable bilateral filtering for fast video preprocessing
PDF
Performance of Optimal Registration Estimator
PDF
Influence of Signal-to-Noise Ratio and Point Spread Function on Limits of Sup...
PPT
Normalized averaging using adaptive applicability functions with applications...
Oral presentation on Asymmetric recursive Gaussian filtering for space-varia...
Asymmetric recursive Gaussian filtering for space-variant artificial bokeh
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
Non-maximum suppression using fewer than two comparison per pixels
Paper fingerprinting using alpha-masked image matching
Bidirectional bias correction for gradient-based shift estimation
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Resolution enhancement of low-quality videos using a high-resolution frame
Separable bilateral filtering for fast video preprocessing
Performance of Optimal Registration Estimator
Influence of Signal-to-Noise Ratio and Point Spread Function on Limits of Sup...
Normalized averaging using adaptive applicability functions with applications...

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
KodekX | Application Modernization Development
PDF
Encapsulation theory and applications.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PPTX
A Presentation on Artificial Intelligence
PDF
cuic standard and advanced reporting.pdf
PPTX
Big Data Technologies - Introduction.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
KodekX | Application Modernization Development
Encapsulation theory and applications.pdf
Review of recent advances in non-invasive hemoglobin estimation
Dropbox Q2 2025 Financial Results & Investor Presentation
Understanding_Digital_Forensics_Presentation.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Diabetes mellitus diagnosis method based random forest with bat algorithm
Chapter 3 Spatial Domain Image Processing.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
A Presentation on Artificial Intelligence
cuic standard and advanced reporting.pdf
Big Data Technologies - Introduction.pptx

Parallel implementation of geodesic distance transform with application in superpixel segmentation

  • 1. Parallel implementation of geodesic distance transform with application in superpixel segmentation Tuan Q. Pham Canon Information Systems Research Australia (CISRA) tuan.pham@cisra.canon.com.au Geodesic distance Parallel distance transform  Geodesic distance between two points = sum of pixel costs along a minimum-cost path  Geodesic distance transform d(cost image f, seed points) = geodesic distance from every pixel to its nearest seed destination 1 0.8 0.4 minimum path, cost = 1.7 straight path, cost = 11.1  Segment image into superpixels using GDT where  The algorithm can be parallelised if multiple passes are allowed (OK since GDT is iterative)  Cost image = gradient energy + a small offset  Seed points = well-separated local gradient minima by adaptive non-maximum suppression  Distance transform is propagated across bands in a next iteration (may require more iterations) 8 0.6 source  Within a pass, chamfer algorithm is sequential  Image is divided into bands for parallel processing 10 Superpixel segmentation 6 seed 4 0.2 2 0 Fig. 1. Cost image (left) and its geodesic distance transform (right) Geodesic distance transform  Chamfer distance algorithm = multiple iterations of a forward propagation Fig. 4. Band-based image partitioning for parallel GDT implementation Fig. 6. 1000 geodesic superpixels from a 2MegaPixel image (1936×1288) OpenMP implementation Segmentation comparison  OpenMP = an easy to use Open Multi-Processing platform that is designed for multicore processors and is supported by most compilers and a backward propagation  Geodesic superpixel is faster & follows edges better  OpenMP parallelises loop by compiler directives Frame 8 SLIC [1] (4.6 seconds) Geodesic (0.64 sec) TurboPixels [2] (207 sec) Fig. 7. Three state-of-the-art superpixel methods on 2MP image in Fig.6 Fig. 2. One iteration of a forward pass (left) and a backward pass (right) Geodesic Voronoi tessellation  Geodesic distance transform (GDT) produces edgefollowing Voronoi tessellation if edge is used as cost frame 12 frame 8 frame 4 frame 1 Evaluation of parallel GDT  Best with static scheduling (where bands are assigned to threads in a round-robin fashion) Cost image & 4 seed points Input image Method # Time Platform Method Watershed 1008 3.2s C/Matlab  Number of fwd+bwd propagation iterations increases slightly under parallel implementation (10 iterations are often enough for segmentation) Entropy 1000 6.5s C Lattice FH Time Platform Method # Time Platform 1024 2.3s C Quickshift 992 13.3s C Geodesic 1000 0.3s C CVT 1000 2.7s Matlab C Turbo 1067 58.1s Matlab C 1024 1.4s # SLIC 990 1.2s Fig. 8. Segmentation of 1MP image (# denotes number of superpixels returned)  Sub-second runtime on 5 MP image or smaller  Speedup of 1.3× on 2-core, 2.6× on 4-core CPU region without a nearest seed GDT after 10 fwd+bwd propagation iterations fragmentation 2 1.5 Summary 3.5 without OpenMP static schedule dynamic schedule 3 speedup factor Intermediate GDT after a first backward pass runtime (seconds) Intermediate GDT after a first forward pass 1 0.5  We proposed a parallel implementation of 2.5 2 geodesic distance transform using OpenMP 1.5 1 static schedule dynamic schedule 0.5 0 0 1000 2000 3000 4000 0 0 1000 image width (pixels) Nearest seed label after Nearest seed label after a first backward pass a first forward pass Nearest seed label after 10 iterations Fig. 3. Geodesic distance transform (2nd row) and tessellation (3rd row) Runtime 2000 3000 4000 image width (pixels) Speedup factor Fig. 5. Runtime & speedup factor on 2.8GHz quad-core CPU with 12GB RAM References: 1. Achanta et al., SLIC superpixels compared to state-of-the-art superpixel methods, PAMI 34(11), 2012.  Our geodesic segmentation method produces more regular, edge-following superpixels at orders of magnitude faster than state-of-the-art segmentation methods. 2. Levinshtein et al., TurboPixels: Fast superpixels using geometric flows, PAMI 31(12), 2009. Contact details: Tuan Q. Pham (tuan.pham@cisra.canon.com.au), 1 Thomas Holt drive, North Ryde, NSW 2113, Australia Presented at Int’l Conf. on Digital Image Computing: Techniques and Applications (DICTA) Paper 5, Poster session 2 on Thursday 8th November, 2013. Hobart, Australia