LightFields.jl: Fast 3D image reconstruction for VR
applications
H´ector Andrade Loarca
Technical University of Berlin, BMS
7th of July, 2018
PyData 2018
Main goal
Present a novel technique to reconstruct the depth map of a scene
from a limited number of views. This can be applied in view synthesis
and rendering for free viewpoint VR.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 2 / 24
Main goal
Present a novel technique to reconstruct the depth map of a scene
from a limited number of views. This can be applied in view synthesis
and rendering for free viewpoint VR.
Explain the main building blocks of the technique: Light Field and
Shearlets.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 2 / 24
Main goal
Present a novel technique to reconstruct the depth map of a scene
from a limited number of views. This can be applied in view synthesis
and rendering for free viewpoint VR.
Explain the main building blocks of the technique: Light Field and
Shearlets.
Show a free hardware/software implementation using julia, python
and Raspberry Pi.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 2 / 24
What is a Light Field?
Light can be interpreted as a field, i.e. assignment of a vector to each
point in the space (M. Faraday, 1846).
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
What is a Light Field?
Light can be interpreted as a field, i.e. assignment of a vector to each
point in the space (M. Faraday, 1846).
Propagation of light rays in the 3D space is completely described by a
7D continuous function L : R7 −→ R3, L(x, y, z, θ, φ, λ, τ) called the
plenoptic function
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
What is a Light Field?
Light can be interpreted as a field, i.e. assignment of a vector to each
point in the space (M. Faraday, 1846).
Propagation of light rays in the 3D space is completely described by a
7D continuous function L : R7 −→ R3, L(x, y, z, θ, φ, λ, τ) called the
plenoptic function
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
What is a Light Field?
Light can be interpreted as a field, i.e. assignment of a vector to each
point in the space (M. Faraday, 1846).
Propagation of light rays in the 3D space is completely described by a
7D continuous function L : R7 −→ R3, L(x, y, z, θ, φ, λ, τ) called the
plenoptic function
L can be simplified to a 4D function L4, called 4D Light Field or
simply Light Field, which quantifies the intensity of static and
monochromatic light rays propagating in half space.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
4D Light Field Representation
Figure: Three different representation of 4F LF. Left: L4(u, v, φ, θ). Center:
L4(φ1, θ1, φ2, θ2). Right: L4(u, v, s, t).
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 4 / 24
4D Light Field Representation
Figure: Three different representation of 4F LF. Left: L4(u, v, φ, θ). Center:
L4(φ1, θ1, φ2, θ2). Right: L4(u, v, s, t).
Figure: Used representation: ”Two plane parametrization”.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 4 / 24
From LF to 3D
Stereo Vision: The human brain generates the 3D depth perception
of its sorroundings by triangulating the points of a scene using the
information coming from both eyes.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 5 / 24
From LF to 3D
Stereo Vision: The human brain generates the 3D depth perception
of its sorroundings by triangulating the points of a scene using the
information coming from both eyes.
Epipolar Geometry: Generalization of Stereo Vision with more than
two views, assuming the epipolar constraint.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 5 / 24
From LF to 3D
Stereo Vision: The human brain generates the 3D depth perception
of its sorroundings by triangulating the points of a scene using the
information coming from both eyes.
Epipolar Geometry: Generalization of Stereo Vision with more than
two views, assuming the epipolar constraint.
Epipolar Constraint: Analysis of object position while assuming the
knowledge of the camera motion.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 5 / 24
Epipolar Plane Images (EPIs) on Straight Line Trajectories
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 6 / 24
Depth map estimation with EPIs
Point-depth formula: D = h∆X
∆u = h ∆X
u1−u2.
Sampling rate (Nyquist criterion): ∆X ≤ Dmin
h ∆u.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 7 / 24
Commercial LF (Epipolar) camera
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 8 / 24
Our approach: Sub-Nyquist reconstruction via inpainting
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 9 / 24
(General) Image inpainting
Mathematical formulation
Recover an image f ∈ X from known data:
g = PK (f )
where PK is and orthogonal projection onto the known subspace XK X.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 10 / 24
How to inpaint?
Frame
A frame for a Hilbert space X is a collection Ψ = {ψi }i∈I ⊂ X satisfying
A||f ||2 ≤ ||{ f , ψi }i∈I|| 2(I) ≤ B||f ||2 ∀f ∈ X
for some 0 < A ≤ B < ∞.
Sparse Regularization/CS approach (Genzel, Kutyniok, 2014):
” If a signal (image) is sparse within a frame Ψ, it can be recovered from
highly underdetermined, non-adaptive linear measurements by
1-regularization, i.e.
min
˜f ∈X
||{ ˜f , ψi }i∈I|| 1(I) s.t. PK (˜f ) = g = PK (f ) ”
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 11 / 24
Frames for images and optimal sparsity
Gabor frames (Gabor, 1946).
Wavelet frames (Morlet et al., 1984).
Curvelet frames (Cand`es et al., 1999).
Shearlet frames (Kutyniok et al., 2005).
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 12 / 24
Frames for images and optimal sparsity
Gabor frames (Gabor, 1946).
Wavelet frames (Morlet et al., 1984).
Curvelet frames (Cand`es et al., 1999).
Shearlet frames (Kutyniok et al., 2005).
Best N-term approx. error (Donoho, 2001)
Let {ψλ}λ∈Λ ⊂ L2(R2) a frame. The optimal best N-Term approximation
error for any f ∈ E2(R2) is
σN(f , {ψλ}λ∈Λ) = O(N−1
)
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 12 / 24
Frames for images and optimal sparsity
Gabor frames (Gabor, 1946).
Wavelet frames (Morlet et al., 1984).
Curvelet frames (Cand`es et al., 1999).
Shearlet frames (Kutyniok et al., 2005).
Best N-term approx. error (Donoho, 2001)
Let {ψλ}λ∈Λ ⊂ L2(R2) a frame. The optimal best N-Term approximation
error for any f ∈ E2(R2) is
σN(f , {ψλ}λ∈Λ) = O(N−1
)
Error of 2D-wavelets
σN(f , {ψλ}Λ) ∼ N−1/2
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 12 / 24
Shearlet Transform (Kutyniok, Guo, Labate, 2005)
Classical Shearlet Transform
f , ψj,k,m =
R2
f (x)ψj,k,m(x)dx
where
SH(ψ) = {ψj,k,m(x) = 23j/4
ψ(SkAj x − m) : (j, k) ∈ Z2
, m ∈ Z2
}
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 13 / 24
Modification: Cone-adapted Shearlet transform
SH(φ, ψ, ˜ψ, c) := PRΦ(φ, c1) ∪ PC1 Ψ(ψ, c) ∪ PC2
˜Ψ( ˜ψ, c)
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 14 / 24
Modification: Cone-adapted Shearlet transform
SH(φ, ψ, ˜ψ, c) := PRΦ(φ, c1) ∪ PC1 Ψ(ψ, c) ∪ PC2
˜Ψ( ˜ψ, c)
Cone shearlets sparsity (Band limited case: Lim, Labate; 2006),
(Compactly supported case: Kutyniok, Lim, 2011)
Best N-term approximation error
σN(f , {ψj,k,m}j,k,m) ∼ N−1
(log(N))3/2
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 14 / 24
Followed Pipeline
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 15 / 24
Physical Acquisition Setup
Data set: Sequence of 101 rectified pictures of a scene generated by
Professor Markus Gross’ group in the Disney Research Center at
Z¨urich.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 16 / 24
Physical Acquisition Setup
Data set: Sequence of 101 rectified pictures of a scene generated by
Professor Markus Gross’ group in the Disney Research Center at
Z¨urich.
Technical details of physical setup: Canon EOS 4D Mark II DSLR
camera, Canon EF 50 mm f/1.4 USM lens and a Zaber T-LST1500D
motorized linear stage to drive the camera to the shooting positions
with 10 mm of distance between each other.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 16 / 24
Physical Acquisition Setup
Data set: Sequence of 101 rectified pictures of a scene generated by
Professor Markus Gross’ group in the Disney Research Center at
Z¨urich.
Technical details of physical setup: Canon EOS 4D Mark II DSLR
camera, Canon EF 50 mm f/1.4 USM lens and a Zaber T-LST1500D
motorized linear stage to drive the camera to the shooting positions
with 10 mm of distance between each other.
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 16 / 24
Used Data Set: Church
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 17 / 24
Used Data Set: Church
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 17 / 24
Point Tracking Results
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 18 / 24
Point Tracking Results
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 18 / 24
Example of EPI
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 19 / 24
Example of EPI
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 19 / 24
Example of EPI
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 19 / 24
Results on EPIs inpainting
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
Results on EPIs inpainting
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
Results on EPIs inpainting
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
Results on EPIs inpainting
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
Results on line detection and depth map estimation
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
Results on line detection and depth map estimation
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
Results on line detection and depth map estimation
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
Results on line detection and depth map estimation
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
Open Hardware Implementation
Raspberry π + Camera module v2
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 22 / 24
Future work
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 23 / 24
Future work
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 23 / 24
Thanks!
Questions?
H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 24 / 24

More Related Content

PDF
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
PDF
Estimating Human Pose from Occluded Images (ACCV 2009)
PDF
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
PDF
Lossy Light Field Compression
PPT
Build Your Own 3D Scanner: 3D Scanning with Structured Lighting
PPTX
ICRA 2015 interactive presentation
PDF
Implementation of Efficiency CORDIC Algorithmfor Sine & Cosine Generation
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Lossy Light Field Compression
Build Your Own 3D Scanner: 3D Scanning with Structured Lighting
ICRA 2015 interactive presentation
Implementation of Efficiency CORDIC Algorithmfor Sine & Cosine Generation

What's hot (19)

PPTX
Semantic Mapping of Road Scenes
PDF
Injecting image priors into Learnable Compressive Subsampling
PDF
Cvpr2010 open source vision software, intro and training part viii point clou...
PDF
Graphical Model Selection for Big Data
PDF
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
PDF
Slide11 icc2015
PDF
ICRA Nathan Piasco
PDF
Vector Distance Transform Maps for Autonomous Mobile Robot Navigation
PDF
Ice: lightweight, efficient rendering for remote sensing images
 
PDF
Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry...
PPT
Camera calibration from a single image based on coupled line cameras and rect...
PDF
2018 IMSM: Semi-analytical BRDF-based Quantification of Light Reflection - MI...
DOCX
Dsp model exam qp
PPTX
Representative Previous Work
PDF
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
PDF
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
PDF
Study of VBF/VBS in the LHC at 13 TeV and available tools
PDF
Cordic Code
PPTX
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Semantic Mapping of Road Scenes
Injecting image priors into Learnable Compressive Subsampling
Cvpr2010 open source vision software, intro and training part viii point clou...
Graphical Model Selection for Big Data
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
Slide11 icc2015
ICRA Nathan Piasco
Vector Distance Transform Maps for Autonomous Mobile Robot Navigation
Ice: lightweight, efficient rendering for remote sensing images
 
Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry...
Camera calibration from a single image based on coupled line cameras and rect...
2018 IMSM: Semi-analytical BRDF-based Quantification of Light Reflection - MI...
Dsp model exam qp
Representative Previous Work
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
Study of VBF/VBS in the LHC at 13 TeV and available tools
Cordic Code
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Ad

Similar to LightFields.jl: Fast 3D image reconstruction for VR applications - Hector Andrade Loarca (20)

PPTX
SIGGRAPH 2012 Computational Plenoptic Imaging Course - 4 Light Fields
PDF
Image Restoration for 3D Computer Vision
PPTX
Thesis presentation
PDF
Shearlet Frames and Optimally Sparse Approximations
PPT
Raskar Keynote at Stereoscopic Display Jan 2011
PPTX
Light field
PDF
Accelarating Optical Quadrature Microscopy Using GPUs
PDF
Image Smoothing for Structure Extraction
PPT
Normalized averaging using adaptive applicability functions with applications...
PPTX
Optical Computing for Fast Light Transport Analysis
PDF
Lecture20
PDF
Super resolution imaging using frequency wavelets and three dimensional
PDF
CVPR 2012 Review Seminar - Multi-View Hair Capture using Orientation Fields
PPTX
Optics group research overview
PDF
Lecture 02 yasutaka furukawa - 3 d reconstruction with priors
PDF
Image processing ieee-projects-ocularsystems.in-
PDF
Fcv learn sudderth
PDF
PDF
Compressive Light Field Photography using Overcomplete Dictionaries and Optim...
PDF
Inside the Wavelength: Seeing Really Small Objects with Light - Professor Sir...
SIGGRAPH 2012 Computational Plenoptic Imaging Course - 4 Light Fields
Image Restoration for 3D Computer Vision
Thesis presentation
Shearlet Frames and Optimally Sparse Approximations
Raskar Keynote at Stereoscopic Display Jan 2011
Light field
Accelarating Optical Quadrature Microscopy Using GPUs
Image Smoothing for Structure Extraction
Normalized averaging using adaptive applicability functions with applications...
Optical Computing for Fast Light Transport Analysis
Lecture20
Super resolution imaging using frequency wavelets and three dimensional
CVPR 2012 Review Seminar - Multi-View Hair Capture using Orientation Fields
Optics group research overview
Lecture 02 yasutaka furukawa - 3 d reconstruction with priors
Image processing ieee-projects-ocularsystems.in-
Fcv learn sudderth
Compressive Light Field Photography using Overcomplete Dictionaries and Optim...
Inside the Wavelength: Seeing Really Small Objects with Light - Professor Sir...
Ad

More from PyData (20)

PDF
Michal Mucha: Build and Deploy an End-to-end Streaming NLP Insight System | P...
PDF
Unit testing data with marbles - Jane Stewart Adams, Leif Walsh
PDF
The TileDB Array Data Storage Manager - Stavros Papadopoulos, Jake Bolewski
PDF
Using Embeddings to Understand the Variance and Evolution of Data Science... ...
PDF
Deploying Data Science for Distribution of The New York Times - Anne Bauer
PPTX
Graph Analytics - From the Whiteboard to Your Toolbox - Sam Lerma
PPTX
Do Your Homework! Writing tests for Data Science and Stochastic Code - David ...
PDF
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
PDF
Mining dockless bikeshare and dockless scootershare trip data - Stefanie Brod...
PDF
Avoiding Bad Database Surprises: Simulation and Scalability - Steven Lott
PDF
Words in Space - Rebecca Bilbro
PDF
End-to-End Machine learning pipelines for Python driven organizations - Nick ...
PPTX
Pydata beautiful soup - Monica Puerto
PDF
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
PPTX
Extending Pandas with Custom Types - Will Ayd
PDF
Measuring Model Fairness - Stephen Hoover
PDF
What's the Science in Data Science? - Skipper Seabold
PDF
Applying Statistical Modeling and Machine Learning to Perform Time-Series For...
PDF
Solving very simple substitution ciphers algorithmically - Stephen Enright-Ward
PDF
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
Michal Mucha: Build and Deploy an End-to-end Streaming NLP Insight System | P...
Unit testing data with marbles - Jane Stewart Adams, Leif Walsh
The TileDB Array Data Storage Manager - Stavros Papadopoulos, Jake Bolewski
Using Embeddings to Understand the Variance and Evolution of Data Science... ...
Deploying Data Science for Distribution of The New York Times - Anne Bauer
Graph Analytics - From the Whiteboard to Your Toolbox - Sam Lerma
Do Your Homework! Writing tests for Data Science and Stochastic Code - David ...
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
Mining dockless bikeshare and dockless scootershare trip data - Stefanie Brod...
Avoiding Bad Database Surprises: Simulation and Scalability - Steven Lott
Words in Space - Rebecca Bilbro
End-to-End Machine learning pipelines for Python driven organizations - Nick ...
Pydata beautiful soup - Monica Puerto
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
Extending Pandas with Custom Types - Will Ayd
Measuring Model Fairness - Stephen Hoover
What's the Science in Data Science? - Skipper Seabold
Applying Statistical Modeling and Machine Learning to Perform Time-Series For...
Solving very simple substitution ciphers algorithmically - Stephen Enright-Ward
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...

Recently uploaded (20)

PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Five Habits of High-Impact Board Members
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPT
Geologic Time for studying geology for geologist
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
Architecture types and enterprise applications.pdf
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPT
What is a Computer? Input Devices /output devices
PPTX
Chapter 5: Probability Theory and Statistics
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
DP Operators-handbook-extract for the Mautical Institute
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Five Habits of High-Impact Board Members
Enhancing emotion recognition model for a student engagement use case through...
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
NewMind AI Weekly Chronicles – August ’25 Week III
Geologic Time for studying geology for geologist
Web Crawler for Trend Tracking Gen Z Insights.pptx
Benefits of Physical activity for teenagers.pptx
A review of recent deep learning applications in wood surface defect identifi...
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
A contest of sentiment analysis: k-nearest neighbor versus neural network
Architecture types and enterprise applications.pdf
Final SEM Unit 1 for mit wpu at pune .pptx
Group 1 Presentation -Planning and Decision Making .pptx
What is a Computer? Input Devices /output devices
Chapter 5: Probability Theory and Statistics
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Developing a website for English-speaking practice to English as a foreign la...
DP Operators-handbook-extract for the Mautical Institute

LightFields.jl: Fast 3D image reconstruction for VR applications - Hector Andrade Loarca

  • 1. LightFields.jl: Fast 3D image reconstruction for VR applications H´ector Andrade Loarca Technical University of Berlin, BMS 7th of July, 2018 PyData 2018
  • 2. Main goal Present a novel technique to reconstruct the depth map of a scene from a limited number of views. This can be applied in view synthesis and rendering for free viewpoint VR. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 2 / 24
  • 3. Main goal Present a novel technique to reconstruct the depth map of a scene from a limited number of views. This can be applied in view synthesis and rendering for free viewpoint VR. Explain the main building blocks of the technique: Light Field and Shearlets. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 2 / 24
  • 4. Main goal Present a novel technique to reconstruct the depth map of a scene from a limited number of views. This can be applied in view synthesis and rendering for free viewpoint VR. Explain the main building blocks of the technique: Light Field and Shearlets. Show a free hardware/software implementation using julia, python and Raspberry Pi. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 2 / 24
  • 5. What is a Light Field? Light can be interpreted as a field, i.e. assignment of a vector to each point in the space (M. Faraday, 1846). H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
  • 6. What is a Light Field? Light can be interpreted as a field, i.e. assignment of a vector to each point in the space (M. Faraday, 1846). Propagation of light rays in the 3D space is completely described by a 7D continuous function L : R7 −→ R3, L(x, y, z, θ, φ, λ, τ) called the plenoptic function H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
  • 7. What is a Light Field? Light can be interpreted as a field, i.e. assignment of a vector to each point in the space (M. Faraday, 1846). Propagation of light rays in the 3D space is completely described by a 7D continuous function L : R7 −→ R3, L(x, y, z, θ, φ, λ, τ) called the plenoptic function H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
  • 8. What is a Light Field? Light can be interpreted as a field, i.e. assignment of a vector to each point in the space (M. Faraday, 1846). Propagation of light rays in the 3D space is completely described by a 7D continuous function L : R7 −→ R3, L(x, y, z, θ, φ, λ, τ) called the plenoptic function L can be simplified to a 4D function L4, called 4D Light Field or simply Light Field, which quantifies the intensity of static and monochromatic light rays propagating in half space. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 3 / 24
  • 9. 4D Light Field Representation Figure: Three different representation of 4F LF. Left: L4(u, v, φ, θ). Center: L4(φ1, θ1, φ2, θ2). Right: L4(u, v, s, t). H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 4 / 24
  • 10. 4D Light Field Representation Figure: Three different representation of 4F LF. Left: L4(u, v, φ, θ). Center: L4(φ1, θ1, φ2, θ2). Right: L4(u, v, s, t). Figure: Used representation: ”Two plane parametrization”. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 4 / 24
  • 11. From LF to 3D Stereo Vision: The human brain generates the 3D depth perception of its sorroundings by triangulating the points of a scene using the information coming from both eyes. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 5 / 24
  • 12. From LF to 3D Stereo Vision: The human brain generates the 3D depth perception of its sorroundings by triangulating the points of a scene using the information coming from both eyes. Epipolar Geometry: Generalization of Stereo Vision with more than two views, assuming the epipolar constraint. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 5 / 24
  • 13. From LF to 3D Stereo Vision: The human brain generates the 3D depth perception of its sorroundings by triangulating the points of a scene using the information coming from both eyes. Epipolar Geometry: Generalization of Stereo Vision with more than two views, assuming the epipolar constraint. Epipolar Constraint: Analysis of object position while assuming the knowledge of the camera motion. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 5 / 24
  • 14. Epipolar Plane Images (EPIs) on Straight Line Trajectories H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 6 / 24
  • 15. Depth map estimation with EPIs Point-depth formula: D = h∆X ∆u = h ∆X u1−u2. Sampling rate (Nyquist criterion): ∆X ≤ Dmin h ∆u. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 7 / 24
  • 16. Commercial LF (Epipolar) camera H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 8 / 24
  • 17. Our approach: Sub-Nyquist reconstruction via inpainting H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 9 / 24
  • 18. (General) Image inpainting Mathematical formulation Recover an image f ∈ X from known data: g = PK (f ) where PK is and orthogonal projection onto the known subspace XK X. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 10 / 24
  • 19. How to inpaint? Frame A frame for a Hilbert space X is a collection Ψ = {ψi }i∈I ⊂ X satisfying A||f ||2 ≤ ||{ f , ψi }i∈I|| 2(I) ≤ B||f ||2 ∀f ∈ X for some 0 < A ≤ B < ∞. Sparse Regularization/CS approach (Genzel, Kutyniok, 2014): ” If a signal (image) is sparse within a frame Ψ, it can be recovered from highly underdetermined, non-adaptive linear measurements by 1-regularization, i.e. min ˜f ∈X ||{ ˜f , ψi }i∈I|| 1(I) s.t. PK (˜f ) = g = PK (f ) ” H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 11 / 24
  • 20. Frames for images and optimal sparsity Gabor frames (Gabor, 1946). Wavelet frames (Morlet et al., 1984). Curvelet frames (Cand`es et al., 1999). Shearlet frames (Kutyniok et al., 2005). H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 12 / 24
  • 21. Frames for images and optimal sparsity Gabor frames (Gabor, 1946). Wavelet frames (Morlet et al., 1984). Curvelet frames (Cand`es et al., 1999). Shearlet frames (Kutyniok et al., 2005). Best N-term approx. error (Donoho, 2001) Let {ψλ}λ∈Λ ⊂ L2(R2) a frame. The optimal best N-Term approximation error for any f ∈ E2(R2) is σN(f , {ψλ}λ∈Λ) = O(N−1 ) H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 12 / 24
  • 22. Frames for images and optimal sparsity Gabor frames (Gabor, 1946). Wavelet frames (Morlet et al., 1984). Curvelet frames (Cand`es et al., 1999). Shearlet frames (Kutyniok et al., 2005). Best N-term approx. error (Donoho, 2001) Let {ψλ}λ∈Λ ⊂ L2(R2) a frame. The optimal best N-Term approximation error for any f ∈ E2(R2) is σN(f , {ψλ}λ∈Λ) = O(N−1 ) Error of 2D-wavelets σN(f , {ψλ}Λ) ∼ N−1/2 H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 12 / 24
  • 23. Shearlet Transform (Kutyniok, Guo, Labate, 2005) Classical Shearlet Transform f , ψj,k,m = R2 f (x)ψj,k,m(x)dx where SH(ψ) = {ψj,k,m(x) = 23j/4 ψ(SkAj x − m) : (j, k) ∈ Z2 , m ∈ Z2 } H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 13 / 24
  • 24. Modification: Cone-adapted Shearlet transform SH(φ, ψ, ˜ψ, c) := PRΦ(φ, c1) ∪ PC1 Ψ(ψ, c) ∪ PC2 ˜Ψ( ˜ψ, c) H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 14 / 24
  • 25. Modification: Cone-adapted Shearlet transform SH(φ, ψ, ˜ψ, c) := PRΦ(φ, c1) ∪ PC1 Ψ(ψ, c) ∪ PC2 ˜Ψ( ˜ψ, c) Cone shearlets sparsity (Band limited case: Lim, Labate; 2006), (Compactly supported case: Kutyniok, Lim, 2011) Best N-term approximation error σN(f , {ψj,k,m}j,k,m) ∼ N−1 (log(N))3/2 H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 14 / 24
  • 26. Followed Pipeline H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 15 / 24
  • 27. Physical Acquisition Setup Data set: Sequence of 101 rectified pictures of a scene generated by Professor Markus Gross’ group in the Disney Research Center at Z¨urich. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 16 / 24
  • 28. Physical Acquisition Setup Data set: Sequence of 101 rectified pictures of a scene generated by Professor Markus Gross’ group in the Disney Research Center at Z¨urich. Technical details of physical setup: Canon EOS 4D Mark II DSLR camera, Canon EF 50 mm f/1.4 USM lens and a Zaber T-LST1500D motorized linear stage to drive the camera to the shooting positions with 10 mm of distance between each other. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 16 / 24
  • 29. Physical Acquisition Setup Data set: Sequence of 101 rectified pictures of a scene generated by Professor Markus Gross’ group in the Disney Research Center at Z¨urich. Technical details of physical setup: Canon EOS 4D Mark II DSLR camera, Canon EF 50 mm f/1.4 USM lens and a Zaber T-LST1500D motorized linear stage to drive the camera to the shooting positions with 10 mm of distance between each other. H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 16 / 24
  • 30. Used Data Set: Church H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 17 / 24
  • 31. Used Data Set: Church H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 17 / 24
  • 32. Point Tracking Results H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 18 / 24
  • 33. Point Tracking Results H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 18 / 24
  • 34. Example of EPI H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 19 / 24
  • 35. Example of EPI H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 19 / 24
  • 36. Example of EPI H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 19 / 24
  • 37. Results on EPIs inpainting H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
  • 38. Results on EPIs inpainting H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
  • 39. Results on EPIs inpainting H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
  • 40. Results on EPIs inpainting H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 20 / 24
  • 41. Results on line detection and depth map estimation H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
  • 42. Results on line detection and depth map estimation H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
  • 43. Results on line detection and depth map estimation H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
  • 44. Results on line detection and depth map estimation H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 21 / 24
  • 45. Open Hardware Implementation Raspberry π + Camera module v2 H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 22 / 24
  • 46. Future work H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 23 / 24
  • 47. Future work H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 23 / 24
  • 48. Thanks! Questions? H´ector Andrade Loarca (TUB) LightFields.jl PyData Berlin 2018 24 / 24