SlideShare a Scribd company logo
Deep Marching Tetrahedra: a Hybrid
Representation for High-Resolution 3D Shape
Synthesis
NeurIPS 2021
Tianchang Shen, Jun Gao, Kangxue Yin, Ming-Yu Liu, Sanja Fidler
High-quality 3D Content Creation
Meshy [2024]
LRM [Adobe,
2023]
Shap-E [OpenAI, 2023]
Automatic generation of 3D assets has attracted major industrial attention given its application in
● AR/VR ● Game industry ● Film industry
Representation for 3D Content Creation
The core of a 3D generation system is the choice of 3D
representation:
● Unlike 2D image generation, the “optimal” representation for
3D generation is still debatable.
Representation for 3D Content Creation
The core of a 3D generation system is the choice of 3D representation:
● Unlike 2D image generation, the “optimal” representation for 3D
generation is still debatable.
DMTet proposes a new 3D representation and design their model based
on it:
● enables them to achieve surprisingly good results at that time (2021).
Advantage of DMTet representation
DMTet proposes a differentiable shape representation that
marries implicit representations and meshes:
Advantage of DMTet representation
DMTet proposes a differentiable shape representation that
marries implicit representations and meshes:
● Takes the best of both worlds.
Unlike most mesh
representations,
not restricted to
fixed topologies.
Unlike implicit
representations,
directly leverages
surface losses.
DMTet: Overview
● Deformable tetrahedral grid
● Selective tetrahedra subdivision
● Differentiable marching tetrahedra layer and surface subdivision
● Learning a generative model with DMTet
Deformable tetrahedral grid
● The base 3D representation is the deformable tetrahedral grid (from DefTet):
○ Fully tetrahedralizes (or, tightly covers) the 3D unit cube.
○ Basic unit is tetrahedron. Each tetrahedron consists of 4 vertices and 4 faces.
○ Vertex records SDF values, interpolate for any other 3D points in the tetrahedron.
Learning Deformable Tetrahedral Meshes for 3D Reconstruction, Gao et al., NeurIPS 2020
tetrahed
ron
2D
illustration
Tetrahedra subdivision
● The representation is multi-scale:
○ When SDF sign changes between vertices, can subdivide the
tetrahedra and its neighbors;
○ Subdivision: adding the midpoints as vertices and use average SDF
value.
Each surface tet.(blue) is
divided into 8 tet.(red) by
adding midpoints.
2D
illustration
Tetrahedra to surface
● Marching Tetrahedra algorithm:
○ Converts SDF grids into explicit triangular mesh.
○ Only 3 unique surface-generation configurations with rotation symmetry.
○ Mesh vertex calculated as linear interpolation of tetrahedra vertices
using SDF for weighting.
○ Gradient directly backpropagated to SDF values and vertex positions.
Tetrahedra to surface
● Marching Tetrahedra algorithm:
○ Converts SDF grids into explicit triangular mesh.
○ Only 3 unique surface-generation configurations with rotation symmetry.
○ Mesh vertex calculated as linear interpolation of tetrahedra vertices using SDF for
weighting.
○ Gradient directly backpropagated to SDF values and vertex positions.
● The mesh can be further subdivided following simplified Neural Subdivision
Neural subdivision, Liu et al., TOG 2020
Generative modeling based on tedrahedron
● Problem setting: Point-conditioned or voxel-conditioned shape
generation.
● Input Encoder is PVCNN: extract feature volume from point
clouds.
● SDF prediction via MLP:
● MLP also output features for each vertices for surface refinement
Learning surface refinement with GCN
● Build a graph using all surface tetrahedra;
● Learn a GCN to dynamically deform the vertices and update the SDF;
● Subdivide surface tetrahedra and prune unused tetrahedra;
● Further refinement and subdivision of the extracted mesh.
Mesh discriminator
● Learn an additional discriminator for better local details:
○ 3D CNN Discriminator from DECOR-GAN.
○ Input is local NxNxN SDF grid, which can be differentiably calculated
from mesh.
Decor-gan: 3d shape detailization by conditional refinement, Chen et al., CVPR 2021
Learning objectives
● Surface alignment:
○ Chamfer Distance (with normal) between prediction and groundtruth
Learning objectives
● Surface alignment:
○ Chamfer Distance (with normal) between prediction and groundtruth
● Adversarial loss follows LSGAN
Least squares generative adversarial networks, Mao et al., ICCV 2017
Learning objectives
● Surface alignment:
○ Chamfer Distance (with normal) between prediction and groundtruth
● Adversarial loss follows LSGAN
● Direct SDF loss:
○ Make sure whole space receives supervision signal and avoid
flipping of SDF signs
Least squares generative adversarial networks, Mao et al., ICCV 2017
Learning objectives
● Surface alignment:
○ Chamfer Distance (with normal) between prediction and groundtruth
● Adversarial loss follows LSGAN
● Direct SDF loss:
○ Make sure whole space receives supervision signal and avoid
flipping of SDF signs
● Deformation regularization:
○ Make sure the deformation stays small.
Final loss is a weighted average of all these.
Least squares generative adversarial networks, Mao et al., ICCV 2017
Learning objectives
● Surface alignment:
○ Chamfer Distance (with normal) between prediction and groundtruth
● Adversarial loss follows LSGAN
● Direct SDF loss:
○ Make sure whole space receives supervision signal and avoid
flipping of SDF signs
● Deformation regularization:
○ Make sure the deformation stays small.
● Final loss is a weighted average of all these.
Least squares generative adversarial networks, Mao et al., ICCV 2017
Results – Point cloud conditioned generation on ShapeNet
Results – Voxel conditioned generation on animals
Why is DMTet better than previous approaches?
Why is DMTet better than previous approaches?
● No topology constraints;
● Direct surface supervision and regularization;
● Less artifacts given the deformable grids;
● Better efficiency and faithfulness of MT over MC under low
sampling budget.
Flexible Isosurface Extraction for
Gradient-Based Mesh
Optimization
Tianchang Shen, Jacob Munkberg, Jon Hasselgren, Kangxue Yin, Zian Wang,
Wenzheng Chen, Zan Gojcic, Sanja Fidler, Nicholas Sharp, Jun Gao
Slides credit: Most of slides are adapted from the video “[SIGGRAPH'23] Flexible Isosurface Extraction for Gradient-Based Mesh
Optimization”
Generating Mesh via Gradient-Based Optimization
Allow local flexible
adjustments
Existing Isosurface Extraction Methods
Marching Cube
Marching Cubes Increase grid resolution
Signed Distance
Field
Mesh vertices are
constrained on lattice
edges
Small triangles in low
curvature areas
Requires more SDF
samples
Deep Marching Tetrahedra (DMTet)
Marching
Tetrahedra
Marching
Tetrahedra [1]
DMTet
[1] Image credit: Jun Gao on Towards Generative Modeling of 3D Objects Learned from Images | Toronto AIR Seminar
No independent
repositioning of
vertices
Why DMTet is
deformable?
Dual Contouring
Dual Contouring
Produce non-manifold
results
Vertex outside the
cube
Previous Methods Fail to Satisfy Both Properties
FlexiCubes
Improve fitting with additional flexibility + Well-defined gradient
differentiation
FlexiCubes
Improve fitting with additional flexibility + Well-defined gradient
differentiation
Recall prior dual methods diverge during optimization.
Dual Marching Cubes (Centroid)
No QEF for vertex
positioning
Lack of freedom
Quadrilateral faces
FlexiCubes
FlexiCubes introduces 3 types of parameters into Dual Marching
Cubes:
- Interpolation weights to position dual vertices in space.
- Splitting weights to control how to split quadrilaterals into
triangles.
- Deformation vectors for spatial alignment.
(DMTet)
Interpolation weights
𝛼 per-cell adjusting interpolation along each edge
𝛽 per-cell adjusting vertex position within each dual face
𝛼, 𝛽 ℝ+
∈
Splitting weights
Raw output from Dual Marching Cubes are typically non-planar quadrilateral faces
- Quad-split weights 𝛾 controlling how quads get split to tris.
During optimization, 𝛾
interpolates the surface
between two possible splits.
At inference, we
split along the
diagonal with
Ablating Parameters
Extensions
Tetrahedral mesh extraction Hierarchically adaptive meshing
Applications
Photogrammetry Through Differentiable Rendering
Nvdiffrec jointly optimizes shape, materials, and lighting from images.
FlexiCubes improves geometric fidelity and mesh quality.
3D Mesh Generation
FlexiCubes can serve as a plug-and-play differentiable mesh extraction module
in a 3D generative model
GET3D w/ FlexiCubes generates meshes with better details and tessellation.
Differentiable Physics Simulation with Tetrahedral Mesh
Combine Flexicubes with a differentiable physics simulation framework
(gradSim) and a differentiable rendering pipeline (nvdiffrast)
Jointly recover 3D shapes and physical parameters from multi-view videos
Mesh Simplification of Animated Objects
FlexiCubes allows us to differentiably skin and deform the mesh, and
simultaneously optimize with respect to the entire animated sequence.
End-to-end optimization w/ FlexiCubes avoids artifacts from mesh
stretching.
Adding Mesh Regularizations
FlexiCubes representation is flexible enough to support optimizing
regularizations defined on meshes

More Related Content

PDF
TransNeRF
PDF
Introduction to 3D Computer Vision and Differentiable Rendering
PPTX
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
PPTX
FastV2C-HandNet - ICICC 2020
PDF
HardNet: Convolutional Network for Local Image Description
PDF
3Dshape Analysis Matching Ajmmmmmmmmmmmmm
PPTX
Semantic Segmentation on Satellite Imagery
PPTX
Grid 2.pptx
TransNeRF
Introduction to 3D Computer Vision and Differentiable Rendering
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
FastV2C-HandNet - ICICC 2020
HardNet: Convolutional Network for Local Image Description
3Dshape Analysis Matching Ajmmmmmmmmmmmmm
Semantic Segmentation on Satellite Imagery
Grid 2.pptx

Similar to Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis (20)

PDF
ShaderX³: Geometry Manipulation - Morphing between two different objects
PDF
DAOR - Bridging the Gap between Community and Node Representations: Graph Emb...
PDF
3-d interpretation from stereo images for autonomous driving
PDF
Stereo Matching by Deep Learning
PDF
10.1007@978 3-319-29504-657
PDF
3-d interpretation from single 2-d image for autonomous driving II
PDF
AR/SLAM for end-users
PDF
Deep learning for 3 d point clouds presentation
PDF
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...
PPTX
[NS][Lab_Seminar_250421]SignGraph: A Sign Sequence is Worth Graphs of Nodes.pptx
PDF
3 d video streaming for virtual exploration of planet surface
PPTX
[NS][Lab_Seminar_250120]HandDAGT: A Denoising Adaptive Graph Transformer for ...
PDF
Best Techniques of Point cloud to 3D.pdf
PDF
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
PDF
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
PPTX
3D Image visualization
PDF
Learning to Perceive the 3D World
PPTX
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
PPTX
[NS][Lab_Seminar_240819]Re_PolyWorld.pptx
PDF
Domain adaptation for Image Segmentation
ShaderX³: Geometry Manipulation - Morphing between two different objects
DAOR - Bridging the Gap between Community and Node Representations: Graph Emb...
3-d interpretation from stereo images for autonomous driving
Stereo Matching by Deep Learning
10.1007@978 3-319-29504-657
3-d interpretation from single 2-d image for autonomous driving II
AR/SLAM for end-users
Deep learning for 3 d point clouds presentation
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...
[NS][Lab_Seminar_250421]SignGraph: A Sign Sequence is Worth Graphs of Nodes.pptx
3 d video streaming for virtual exploration of planet surface
[NS][Lab_Seminar_250120]HandDAGT: A Denoising Adaptive Graph Transformer for ...
Best Techniques of Point cloud to 3D.pdf
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
3D Image visualization
Learning to Perceive the 3D World
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
[NS][Lab_Seminar_240819]Re_PolyWorld.pptx
Domain adaptation for Image Segmentation
Ad

Recently uploaded (20)

PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
composite construction of structures.pdf
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
UNIT 4 Total Quality Management .pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
web development for engineering and engineering
PPTX
Sustainable Sites - Green Building Construction
PPT
Project quality management in manufacturing
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Safety Seminar civil to be ensured for safe working.
PPT
introduction to datamining and warehousing
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
composite construction of structures.pdf
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Lecture Notes Electrical Wiring System Components
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
UNIT 4 Total Quality Management .pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Embodied AI: Ushering in the Next Era of Intelligent Systems
Operating System & Kernel Study Guide-1 - converted.pdf
Model Code of Practice - Construction Work - 21102022 .pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
web development for engineering and engineering
Sustainable Sites - Green Building Construction
Project quality management in manufacturing
Internet of Things (IOT) - A guide to understanding
Safety Seminar civil to be ensured for safe working.
introduction to datamining and warehousing
Ad

Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis

  • 1. Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis NeurIPS 2021 Tianchang Shen, Jun Gao, Kangxue Yin, Ming-Yu Liu, Sanja Fidler
  • 2. High-quality 3D Content Creation Meshy [2024] LRM [Adobe, 2023] Shap-E [OpenAI, 2023] Automatic generation of 3D assets has attracted major industrial attention given its application in ● AR/VR ● Game industry ● Film industry
  • 3. Representation for 3D Content Creation The core of a 3D generation system is the choice of 3D representation: ● Unlike 2D image generation, the “optimal” representation for 3D generation is still debatable.
  • 4. Representation for 3D Content Creation The core of a 3D generation system is the choice of 3D representation: ● Unlike 2D image generation, the “optimal” representation for 3D generation is still debatable. DMTet proposes a new 3D representation and design their model based on it: ● enables them to achieve surprisingly good results at that time (2021).
  • 5. Advantage of DMTet representation DMTet proposes a differentiable shape representation that marries implicit representations and meshes:
  • 6. Advantage of DMTet representation DMTet proposes a differentiable shape representation that marries implicit representations and meshes: ● Takes the best of both worlds. Unlike most mesh representations, not restricted to fixed topologies. Unlike implicit representations, directly leverages surface losses.
  • 7. DMTet: Overview ● Deformable tetrahedral grid ● Selective tetrahedra subdivision ● Differentiable marching tetrahedra layer and surface subdivision ● Learning a generative model with DMTet
  • 8. Deformable tetrahedral grid ● The base 3D representation is the deformable tetrahedral grid (from DefTet): ○ Fully tetrahedralizes (or, tightly covers) the 3D unit cube. ○ Basic unit is tetrahedron. Each tetrahedron consists of 4 vertices and 4 faces. ○ Vertex records SDF values, interpolate for any other 3D points in the tetrahedron. Learning Deformable Tetrahedral Meshes for 3D Reconstruction, Gao et al., NeurIPS 2020 tetrahed ron 2D illustration
  • 9. Tetrahedra subdivision ● The representation is multi-scale: ○ When SDF sign changes between vertices, can subdivide the tetrahedra and its neighbors; ○ Subdivision: adding the midpoints as vertices and use average SDF value. Each surface tet.(blue) is divided into 8 tet.(red) by adding midpoints. 2D illustration
  • 10. Tetrahedra to surface ● Marching Tetrahedra algorithm: ○ Converts SDF grids into explicit triangular mesh. ○ Only 3 unique surface-generation configurations with rotation symmetry. ○ Mesh vertex calculated as linear interpolation of tetrahedra vertices using SDF for weighting. ○ Gradient directly backpropagated to SDF values and vertex positions.
  • 11. Tetrahedra to surface ● Marching Tetrahedra algorithm: ○ Converts SDF grids into explicit triangular mesh. ○ Only 3 unique surface-generation configurations with rotation symmetry. ○ Mesh vertex calculated as linear interpolation of tetrahedra vertices using SDF for weighting. ○ Gradient directly backpropagated to SDF values and vertex positions. ● The mesh can be further subdivided following simplified Neural Subdivision Neural subdivision, Liu et al., TOG 2020
  • 12. Generative modeling based on tedrahedron ● Problem setting: Point-conditioned or voxel-conditioned shape generation. ● Input Encoder is PVCNN: extract feature volume from point clouds. ● SDF prediction via MLP: ● MLP also output features for each vertices for surface refinement
  • 13. Learning surface refinement with GCN ● Build a graph using all surface tetrahedra; ● Learn a GCN to dynamically deform the vertices and update the SDF; ● Subdivide surface tetrahedra and prune unused tetrahedra; ● Further refinement and subdivision of the extracted mesh.
  • 14. Mesh discriminator ● Learn an additional discriminator for better local details: ○ 3D CNN Discriminator from DECOR-GAN. ○ Input is local NxNxN SDF grid, which can be differentiably calculated from mesh. Decor-gan: 3d shape detailization by conditional refinement, Chen et al., CVPR 2021
  • 15. Learning objectives ● Surface alignment: ○ Chamfer Distance (with normal) between prediction and groundtruth
  • 16. Learning objectives ● Surface alignment: ○ Chamfer Distance (with normal) between prediction and groundtruth ● Adversarial loss follows LSGAN Least squares generative adversarial networks, Mao et al., ICCV 2017
  • 17. Learning objectives ● Surface alignment: ○ Chamfer Distance (with normal) between prediction and groundtruth ● Adversarial loss follows LSGAN ● Direct SDF loss: ○ Make sure whole space receives supervision signal and avoid flipping of SDF signs Least squares generative adversarial networks, Mao et al., ICCV 2017
  • 18. Learning objectives ● Surface alignment: ○ Chamfer Distance (with normal) between prediction and groundtruth ● Adversarial loss follows LSGAN ● Direct SDF loss: ○ Make sure whole space receives supervision signal and avoid flipping of SDF signs ● Deformation regularization: ○ Make sure the deformation stays small. Final loss is a weighted average of all these. Least squares generative adversarial networks, Mao et al., ICCV 2017
  • 19. Learning objectives ● Surface alignment: ○ Chamfer Distance (with normal) between prediction and groundtruth ● Adversarial loss follows LSGAN ● Direct SDF loss: ○ Make sure whole space receives supervision signal and avoid flipping of SDF signs ● Deformation regularization: ○ Make sure the deformation stays small. ● Final loss is a weighted average of all these. Least squares generative adversarial networks, Mao et al., ICCV 2017
  • 20. Results – Point cloud conditioned generation on ShapeNet
  • 21. Results – Voxel conditioned generation on animals
  • 22. Why is DMTet better than previous approaches?
  • 23. Why is DMTet better than previous approaches? ● No topology constraints; ● Direct surface supervision and regularization; ● Less artifacts given the deformable grids; ● Better efficiency and faithfulness of MT over MC under low sampling budget.
  • 24. Flexible Isosurface Extraction for Gradient-Based Mesh Optimization Tianchang Shen, Jacob Munkberg, Jon Hasselgren, Kangxue Yin, Zian Wang, Wenzheng Chen, Zan Gojcic, Sanja Fidler, Nicholas Sharp, Jun Gao Slides credit: Most of slides are adapted from the video “[SIGGRAPH'23] Flexible Isosurface Extraction for Gradient-Based Mesh Optimization”
  • 25. Generating Mesh via Gradient-Based Optimization Allow local flexible adjustments
  • 27. Marching Cube Marching Cubes Increase grid resolution Signed Distance Field Mesh vertices are constrained on lattice edges Small triangles in low curvature areas Requires more SDF samples
  • 28. Deep Marching Tetrahedra (DMTet) Marching Tetrahedra Marching Tetrahedra [1] DMTet [1] Image credit: Jun Gao on Towards Generative Modeling of 3D Objects Learned from Images | Toronto AIR Seminar No independent repositioning of vertices Why DMTet is deformable?
  • 29. Dual Contouring Dual Contouring Produce non-manifold results Vertex outside the cube
  • 30. Previous Methods Fail to Satisfy Both Properties
  • 31. FlexiCubes Improve fitting with additional flexibility + Well-defined gradient differentiation
  • 32. FlexiCubes Improve fitting with additional flexibility + Well-defined gradient differentiation Recall prior dual methods diverge during optimization.
  • 33. Dual Marching Cubes (Centroid) No QEF for vertex positioning Lack of freedom Quadrilateral faces
  • 34. FlexiCubes FlexiCubes introduces 3 types of parameters into Dual Marching Cubes: - Interpolation weights to position dual vertices in space. - Splitting weights to control how to split quadrilaterals into triangles. - Deformation vectors for spatial alignment. (DMTet)
  • 35. Interpolation weights 𝛼 per-cell adjusting interpolation along each edge 𝛽 per-cell adjusting vertex position within each dual face 𝛼, 𝛽 ℝ+ ∈
  • 36. Splitting weights Raw output from Dual Marching Cubes are typically non-planar quadrilateral faces - Quad-split weights 𝛾 controlling how quads get split to tris. During optimization, 𝛾 interpolates the surface between two possible splits. At inference, we split along the diagonal with
  • 38. Extensions Tetrahedral mesh extraction Hierarchically adaptive meshing
  • 40. Photogrammetry Through Differentiable Rendering Nvdiffrec jointly optimizes shape, materials, and lighting from images. FlexiCubes improves geometric fidelity and mesh quality.
  • 41. 3D Mesh Generation FlexiCubes can serve as a plug-and-play differentiable mesh extraction module in a 3D generative model GET3D w/ FlexiCubes generates meshes with better details and tessellation.
  • 42. Differentiable Physics Simulation with Tetrahedral Mesh Combine Flexicubes with a differentiable physics simulation framework (gradSim) and a differentiable rendering pipeline (nvdiffrast) Jointly recover 3D shapes and physical parameters from multi-view videos
  • 43. Mesh Simplification of Animated Objects FlexiCubes allows us to differentiably skin and deform the mesh, and simultaneously optimize with respect to the entire animated sequence. End-to-end optimization w/ FlexiCubes avoids artifacts from mesh stretching.
  • 44. Adding Mesh Regularizations FlexiCubes representation is flexible enough to support optimizing regularizations defined on meshes

Editor's Notes

  • #25: isosurface representation
  • #43: Rather than fitting a single mesh in a reference pose,