SlideShare a Scribd company logo
N. H. Shimada
Differentiable Ray Sampling 

for Neural 3D Representation

Preferred Networks 2019 Research Internship
Single-view 3D reconstruction
・Grasping ・Autonomous driving
[Yan+ ICRA 2018] [Mapillary blog]
Single-view 3D reconstruction
● 3D supervision
○ A large number of 3D datas are needed.
[Kato+ CVPR 2019]
Input
(image)
Output
(3D geometry)
prediction model
Single-view 3D reconstruction
● 2D supervision
○ End-to-end training: only 2D images.
○ Differentiable renderer is needed.
[Kato+ CVPR 2019]
Input
(image)
prediction model
Rendering
3D geometry Output
(image)
Single-view 3D reconstruction
● 3D Geometry representation
1. [Kato+ CVPR 2017]
2. [Tulsiani+ CVPR 2018]
3. [Sitzmann+ arXiv 2019]
Mesh1
Voxel2 Neural 3D
(SRN3
)
Neural 3D
(Ours)
initial shape ✕ ◯ ◯ ◯
memory
vs
resolution
◯ ✕ ◯ ◯
the number
of train views
◯ ◯ (✕) ◯
Accuracy
(IoU)
0.71 0.73 - ???
DRC (Tulsiani+ CVPR 2017)
Encoder
Decoder
Input
(image)
323
voxel
(occupancy)
Rendered
image
DRC (Tulsiani+ CVPR 2017)
● Differentiable rendering
DRC (Tulsiani+ CVPR 2017)
Input
(RGB) Input
(RGB)
Ground truth Prediction
Prediction
Ours
Voxel grid representation as function :
(xi
, yi
, zi
) → (Occupancy)
323
discrete input
Memory increases cubically with higher resolution
DRC (Tulsiani+ CVPR 2017) Our idea
x
y
z
Occupancy
Neural 3D representation :

(x, y, z) → (Occupancy)
Continuous input
Constant memory with high resolution
Ours
● Differentiable ray sampling
d
 Translation probability
Pixel value
in mask images
0 1
Ours
Encoder
Decoder
Input
(image)
Rendered
image
parameters
x
y
z
3D Networks
Results
● 1 instance Ground
truth
Prediction Diff
IoU
(DRC)
0.53
(0.43)
Voxelized 3D (sliced image)
{prediction, gt, diff}
0.81
(0.73)
Car
Chair
Results
● Multi-instance (Qualitative)
Ground
truth
Prediction Diff
Input
RGB
Car Chair
Results
● Multi-instance (Quantitative)
Accuracy
(IoU)
Voxel
(DRC1
)
Neural 3D
(Ours)
Car 0.73 0.72
Chair 0.43 0.44
Results
● Multi-instance (Loss plots)
Car Chair
SRN (Sitzmann+ NIPS 2019)
Encoder
Decoder
Input
(image)
Rendered
image
parameters
x
y
z
3D Networks
pixel generator
SDF (?)
di
d1
d2
d0
The part of rendering is also a networks.
→ 50 images per 1 object for training

More Related Content

PDF
[第2回3D勉強会 研究紹介] Neural 3D Mesh Renderer (CVPR 2018)
PPTX
[DL輪読会]EfficientDet: Scalable and Efficient Object Detection
PPTX
[DL輪読会]Deep High-Resolution Representation Learning for Human Pose Estimation
PDF
ドロネー三角形分割
PPTX
関東コンピュータビジョン勉強会
PDF
Semantic segmentation
PPTX
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transfo...
PDF
LiDAR点群とSfM点群との位置合わせ
[第2回3D勉強会 研究紹介] Neural 3D Mesh Renderer (CVPR 2018)
[DL輪読会]EfficientDet: Scalable and Efficient Object Detection
[DL輪読会]Deep High-Resolution Representation Learning for Human Pose Estimation
ドロネー三角形分割
関東コンピュータビジョン勉強会
Semantic segmentation
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transfo...
LiDAR点群とSfM点群との位置合わせ

What's hot (20)

PDF
第3回WBAレクチャー:BRAに基づく海馬体の確率的生成モデルの構築
PDF
中級グラフィックス入門~シャドウマッピング総まとめ~
PDF
4 データ間の距離と類似度
PPTX
【DL輪読会】DiffRF: Rendering-guided 3D Radiance Field Diffusion [N. Muller+ CVPR2...
PDF
[DL輪読会]High-Quality Self-Supervised Deep Image Denoising
PDF
Introduction to YOLO detection model
PPTX
【DL輪読会】"Instant Neural Graphics Primitives with a Multiresolution Hash Encoding"
PDF
20190131 lidar-camera fusion semantic segmentation survey
PDF
SSII2022 [TS1] Transformerの最前線〜 畳込みニューラルネットワークの先へ 〜
PDF
三次元表現まとめ(深層学習を中心に)
PDF
object detection with lidar-camera fusion: survey
PDF
【論文調査】XAI技術の効能を ユーザ実験で評価する研究
PPTX
多目的遺伝的アルゴリズム
PDF
夏のトップカンファレンス論文読み会 / Realtime Multi-Person 2D Pose Estimation using Part Affin...
PDF
【メタサーベイ】Neural Fields
PDF
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
PPTX
SfM Learner系単眼深度推定手法について
PPTX
Semi supervised, weakly-supervised, unsupervised, and active learning
PDF
マルチコアを用いた画像処理
PDF
【DL輪読会】GAN-Supervised Dense Visual Alignment (CVPR 2022)
第3回WBAレクチャー:BRAに基づく海馬体の確率的生成モデルの構築
中級グラフィックス入門~シャドウマッピング総まとめ~
4 データ間の距離と類似度
【DL輪読会】DiffRF: Rendering-guided 3D Radiance Field Diffusion [N. Muller+ CVPR2...
[DL輪読会]High-Quality Self-Supervised Deep Image Denoising
Introduction to YOLO detection model
【DL輪読会】"Instant Neural Graphics Primitives with a Multiresolution Hash Encoding"
20190131 lidar-camera fusion semantic segmentation survey
SSII2022 [TS1] Transformerの最前線〜 畳込みニューラルネットワークの先へ 〜
三次元表現まとめ(深層学習を中心に)
object detection with lidar-camera fusion: survey
【論文調査】XAI技術の効能を ユーザ実験で評価する研究
多目的遺伝的アルゴリズム
夏のトップカンファレンス論文読み会 / Realtime Multi-Person 2D Pose Estimation using Part Affin...
【メタサーベイ】Neural Fields
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
SfM Learner系単眼深度推定手法について
Semi supervised, weakly-supervised, unsupervised, and active learning
マルチコアを用いた画像処理
【DL輪読会】GAN-Supervised Dense Visual Alignment (CVPR 2022)
Ad

Similar to Differentiable Ray Sampling for Neural 3D Representation (9)

PPTX
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
PPTX
Scene Representation Networks(NIPS 2019)_OJung
PDF
Introduction to 3D Computer Vision and Differentiable Rendering
PDF
Deep single view 3 d object reconstruction with visual hull
PPTX
Summary of survey papers on deep learning method to 3D data
PDF
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
PPTX
[NS][Lab_Seminar_240611]Graph R-CNN.pptx
PDF
Learning to Perceive the 3D World
PPTX
Emily Denton - Unsupervised Learning of Disentangled Representations from Vid...
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
Scene Representation Networks(NIPS 2019)_OJung
Introduction to 3D Computer Vision and Differentiable Rendering
Deep single view 3 d object reconstruction with visual hull
Summary of survey papers on deep learning method to 3D data
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
[NS][Lab_Seminar_240611]Graph R-CNN.pptx
Learning to Perceive the 3D World
Emily Denton - Unsupervised Learning of Disentangled Representations from Vid...
Ad

More from Preferred Networks (20)

PDF
PodSecurityPolicy からGatekeeper に移行しました / Kubernetes Meetup Tokyo #57
PDF
Optunaを使ったHuman-in-the-loop最適化の紹介 - 2023/04/27 W&B 東京ミートアップ #3
PDF
Kubernetes + containerd で cgroup v2 に移行したら "failed to create fsnotify watcher...
PDF
深層学習の新しい応用と、 それを支える計算機の進化 - Preferred Networks CEO 西川徹 (SEMICON Japan 2022 Ke...
PDF
Kubernetes ControllerをScale-Outさせる方法 / Kubernetes Meetup Tokyo #55
PDF
Kaggle Happywhaleコンペ優勝解法でのOptuna使用事例 - 2022/12/10 Optuna Meetup #2
PDF
最新リリース:Optuna V3の全て - 2022/12/10 Optuna Meetup #2
PDF
Optuna Dashboardの紹介と設計解説 - 2022/12/10 Optuna Meetup #2
PDF
スタートアップが提案する2030年の材料開発 - 2022/11/11 QPARC講演
PPTX
Deep Learningのための専用プロセッサ「MN-Core」の開発と活用(2022/10/19東大大学院「 融合情報学特別講義Ⅲ」)
PPTX
PFNにおける研究開発(2022/10/19 東大大学院「融合情報学特別講義Ⅲ」)
PDF
自然言語処理を 役立てるのはなぜ難しいのか(2022/10/25東大大学院「自然言語処理応用」)
PDF
Kubernetes にこれから入るかもしれない注目機能!(2022年11月版) / TechFeed Experts Night #7 〜 コンテナ技術を語る
PDF
Matlantis™のニューラルネットワークポテンシャルPFPの適用範囲拡張
PDF
PFNのオンプレ計算機クラスタの取り組み_第55回情報科学若手の会
PDF
続・PFN のオンプレML基盤の取り組み / オンプレML基盤 on Kubernetes 〜PFN、ヤフー〜 #2
PDF
Kubernetes Service Account As Multi-Cloud Identity / Cloud Native Security Co...
PDF
KubeCon + CloudNativeCon Europe 2022 Recap / Kubernetes Meetup Tokyo #51 / #k...
PDF
KubeCon + CloudNativeCon Europe 2022 Recap - Batch/HPCの潮流とScheduler拡張事例 / Kub...
PDF
独断と偏見で選んだ Kubernetes 1.24 の注目機能と今後! / Kubernetes Meetup Tokyo 50
PodSecurityPolicy からGatekeeper に移行しました / Kubernetes Meetup Tokyo #57
Optunaを使ったHuman-in-the-loop最適化の紹介 - 2023/04/27 W&B 東京ミートアップ #3
Kubernetes + containerd で cgroup v2 に移行したら "failed to create fsnotify watcher...
深層学習の新しい応用と、 それを支える計算機の進化 - Preferred Networks CEO 西川徹 (SEMICON Japan 2022 Ke...
Kubernetes ControllerをScale-Outさせる方法 / Kubernetes Meetup Tokyo #55
Kaggle Happywhaleコンペ優勝解法でのOptuna使用事例 - 2022/12/10 Optuna Meetup #2
最新リリース:Optuna V3の全て - 2022/12/10 Optuna Meetup #2
Optuna Dashboardの紹介と設計解説 - 2022/12/10 Optuna Meetup #2
スタートアップが提案する2030年の材料開発 - 2022/11/11 QPARC講演
Deep Learningのための専用プロセッサ「MN-Core」の開発と活用(2022/10/19東大大学院「 融合情報学特別講義Ⅲ」)
PFNにおける研究開発(2022/10/19 東大大学院「融合情報学特別講義Ⅲ」)
自然言語処理を 役立てるのはなぜ難しいのか(2022/10/25東大大学院「自然言語処理応用」)
Kubernetes にこれから入るかもしれない注目機能!(2022年11月版) / TechFeed Experts Night #7 〜 コンテナ技術を語る
Matlantis™のニューラルネットワークポテンシャルPFPの適用範囲拡張
PFNのオンプレ計算機クラスタの取り組み_第55回情報科学若手の会
続・PFN のオンプレML基盤の取り組み / オンプレML基盤 on Kubernetes 〜PFN、ヤフー〜 #2
Kubernetes Service Account As Multi-Cloud Identity / Cloud Native Security Co...
KubeCon + CloudNativeCon Europe 2022 Recap / Kubernetes Meetup Tokyo #51 / #k...
KubeCon + CloudNativeCon Europe 2022 Recap - Batch/HPCの潮流とScheduler拡張事例 / Kub...
独断と偏見で選んだ Kubernetes 1.24 の注目機能と今後! / Kubernetes Meetup Tokyo 50

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Approach and Philosophy of On baking technology
PPTX
Programs and apps: productivity, graphics, security and other tools
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Electronic commerce courselecture one. Pdf
PPTX
Cloud computing and distributed systems.
PDF
Encapsulation theory and applications.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Machine learning based COVID-19 study performance prediction
PDF
KodekX | Application Modernization Development
Review of recent advances in non-invasive hemoglobin estimation
Chapter 3 Spatial Domain Image Processing.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Understanding_Digital_Forensics_Presentation.pptx
Unlocking AI with Model Context Protocol (MCP)
Digital-Transformation-Roadmap-for-Companies.pptx
Spectral efficient network and resource selection model in 5G networks
Approach and Philosophy of On baking technology
Programs and apps: productivity, graphics, security and other tools
The AUB Centre for AI in Media Proposal.docx
MYSQL Presentation for SQL database connectivity
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Electronic commerce courselecture one. Pdf
Cloud computing and distributed systems.
Encapsulation theory and applications.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Machine learning based COVID-19 study performance prediction
KodekX | Application Modernization Development

Differentiable Ray Sampling for Neural 3D Representation