SlideShare a Scribd company logo
Wavelets @ CPU
David Barina
April 15, 2014
David Barina Wavelets @ CPU April 15, 2014 1 / 16
Wavelet
David Barina Wavelets @ CPU April 15, 2014 2 / 16
Discrete Wavelet Transform
David Barina Wavelets @ CPU April 15, 2014 3 / 16
Lifting
α
β
γ
δ
David Barina Wavelets @ CPU April 15, 2014 4 / 16
2-D DWT
David Barina Wavelets @ CPU April 15, 2014 5 / 16
2-D Separability
David Barina Wavelets @ CPU April 15, 2014 6 / 16
What have I done?
loop fusion
removed prologs/epilogs
influence of CPU cache
SIMD-vectorization
parallelization
David Barina Wavelets @ CPU April 15, 2014 7 / 16
Loop Fusion
read
write
F
F
David Barina Wavelets @ CPU April 15, 2014 8 / 16
Removed Prologs and Epilogs
David Barina Wavelets @ CPU April 15, 2014 9 / 16
Influence of CPU Cache
David Barina Wavelets @ CPU April 15, 2014 10 / 16
SIMD Vectorization
4 × 4 6 × 2
David Barina Wavelets @ CPU April 15, 2014 11 / 16
Image Processing and Buffers
David Barina Wavelets @ CPU April 15, 2014 12 / 16
Parallelization
prolog
overlay
overlay
segment
David Barina Wavelets @ CPU April 15, 2014 13 / 16
Results
Intel Core2 Quad @ 2.00 GHz
10 Mpx
CDF 9/7, 1 level, in-place
approach best algorithm time/px speed-up
separable diag. 17.23 ns 1.0×
single-loop diag. 2 × 2 9.55 ns 1.8×
core diag. 2 × 2 8.79 ns 2.0×
super-core vert. 4 × 4 5.33 ns 3.2×
parallel (4) vert. 4 × 4 1.55 ns 11.1×
David Barina Wavelets @ CPU April 15, 2014 14 / 16
Future Work
merge several levels
merge forward and inverse cores
another wavelets
combine with EAW
another platforms (ARM, GPU, FPGA)
another transforms
David Barina Wavelets @ CPU April 15, 2014 15 / 16
Example (AMD Opteron)
1.0ns
10.0ns
100.0ns
1.0k 10.0k 100.0k 1.0M 10.0M 100.0M
time/pixel
pixels
naive vertical
naive diagonal
single-loop vertical
single-loop diagonal
David Barina Wavelets @ CPU April 15, 2014 16 / 16

More Related Content

PDF
Lifting Scheme Cores for Wavelet Transform
PDF
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
PDF
Real-Time 3-D Wavelet Lifting
PDF
SVD and Lifting Wavelet Based Fragile Image Watermarking
PPTX
Lossless image compression via by lifting scheme
PPT
PDF
Building effective Java applications for the Cloud: The DHARMA principles - D...
PDF
Blaze the-evolution-of-numpy
Lifting Scheme Cores for Wavelet Transform
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Real-Time 3-D Wavelet Lifting
SVD and Lifting Wavelet Based Fragile Image Watermarking
Lossless image compression via by lifting scheme
Building effective Java applications for the Cloud: The DHARMA principles - D...
Blaze the-evolution-of-numpy

More from David Bařina (18)

PDF
CCSDS 122.0
PDF
Lossy Light Field Compression
PDF
Mathematical curiosities
PDF
C/C++ tricks
PDF
New Transforms for JPEG Format
PDF
PDF
Discrete Wavelet Transforms on Parallel Architectures
PDF
Parallel Implementation of the 2-D Discrete Wavelet Transform
PDF
Parallel Wavelet Schemes for Images
PDF
Single-Loop Software Architecture for JPEG 2000
PDF
Wavelet News
PDF
IIR aproximace Gaussovy funkce
PDF
Akcelerace DWT pomocí SIMD
PDF
Wavelet Lifting on Application Specific Vector Processor
PDF
GStreamer
PDF
PDF
Bit Twiddling Hacks: Integers
PDF
Fixed-point arithmetic
CCSDS 122.0
Lossy Light Field Compression
Mathematical curiosities
C/C++ tricks
New Transforms for JPEG Format
Discrete Wavelet Transforms on Parallel Architectures
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Wavelet Schemes for Images
Single-Loop Software Architecture for JPEG 2000
Wavelet News
IIR aproximace Gaussovy funkce
Akcelerace DWT pomocí SIMD
Wavelet Lifting on Application Specific Vector Processor
GStreamer
Bit Twiddling Hacks: Integers
Fixed-point arithmetic
Ad

Recently uploaded (20)

PPTX
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
PPTX
Substance Disorders- part different drugs change body
PPTX
Welcome-grrewfefweg-students-of-2024.pptx
PPTX
BIOMOLECULES PPT........................
PPTX
A powerpoint on colorectal cancer with brief background
PDF
Science Form five needed shit SCIENEce so
PPTX
endocrine - management of adrenal incidentaloma.pptx
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PDF
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
PPTX
Microbes in human welfare class 12 .pptx
PPTX
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPTX
Seminar Hypertension and Kidney diseases.pptx
PDF
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PPTX
gene cloning powerpoint for general biology 2
PPT
veterinary parasitology ````````````.ppt
PPTX
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
Substance Disorders- part different drugs change body
Welcome-grrewfefweg-students-of-2024.pptx
BIOMOLECULES PPT........................
A powerpoint on colorectal cancer with brief background
Science Form five needed shit SCIENEce so
endocrine - management of adrenal incidentaloma.pptx
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
Microbes in human welfare class 12 .pptx
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
Animal tissues, epithelial, muscle, connective, nervous tissue
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
Seminar Hypertension and Kidney diseases.pptx
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
Hypertension_Training_materials_English_2024[1] (1).pptx
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
gene cloning powerpoint for general biology 2
veterinary parasitology ````````````.ppt
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
Ad

Wavelets @ CPU