SlideShare a Scribd company logo
CharNeRF: 3D Character
Generation from 2D Concept Art
(IEEE AIxVR 2024)
Eddy Chu1
, Yiyang Chen1
, Chedy Raissi2
, Anand Bhojan1
1
National University of Singapore1
, Riot Games2
● Problem Overview
● Method
● Data Collection
● Results
● Ablation Test
● Limitations & Future Extensions
Content
2
Problem Overview
3
3D modeling is essential in VR and AR as it enables creative and realistic
expression of ideas, but it’s a costly process!
4
Current 3D Modeling in AR/VR/Game
5
Concept Art
Image Source: https://www.behance.net/gallery/5594425/3D-Models-from-Concept-Art
Image Source: https://zechatactics.com/blog/2021/03/how-to-build-a-zecha/
Image Source: https://substance3d.adobe.com/plugins/mixamo-in-blender/
3D Models
Shading & Texturing
Rigging & Animation
Optional
Costly & Tedious
Types of Concept Art
6
Initial Concept Art
Image Source: https://www.behance.net/gallery/5594425/3D-Models-from-Concept-Art
Image Source: https://fgfactory.com/how-to-design-3d-character-from-scratch-to-final-game-model
Image Source: https://www.weasyl.com/~magnus/submissions/55047/lewis-model-sheet
Image Source: https://www.artstation.com/artwork/rRy3em
Initial Sketches
Turnaround Concept Art
3D Modeling-Ready
Input: 3-View Turnaround
Concept Art
G: Goal Model
Output: 3D model
(Volumetric Representation,
NeRF)
- Front, Back & Side faces - Meshes of different LoD can be
generated easily
7
Problem Setting:
Industrial Values
- Speed up the process of character modelling which usually takes
hundreds of hours (source: WallaWalla Studio)
- Speed up the process of skin production
Research Values
- There are a lot of existing works/researches on human or object reconstruction
from 2D image(s), but to the best of our knowledge, reconstruction from concept
art for virtual character is NEW!
- Reconstruction for “virtual” character means only RGB values are available (not
depth), which is challenging computer vision problem by itself
- The shape variation for virtual characters is much greater than daily objects or
human beings. A network that is able to capture the distribution of virtual
characters would need to be extremely resourceful.
8
Method
9
Capture essential information from the concept art
- Source Image Encoding
10
Pixel-aligned
Features
Image Encoder
- Encode local information
- Store at each pixel location
Choose PixelNeRF (CVPR 2021) as our baseline
11
12
Instead of ResNet, we find multi-level (one coarse and one
fine) feature extraction greatly improves the rendering quality.
Image Encoder
We find two-level encoder effective in extracting local information
13
Source images are first downsized to extract
coarse information
Fine feature
Coarse featur
H/8 x W/8 x256 H/2 x W/2 x256
Image Encoder - Simplified Feature Combination
14
Fine feature
Coarse feature MLP
H/2 x W/2 x256
NeRF
Regularize NeRF through
- a Mix of Sampling Methods
15
2 x Neural Radiance Field (NeRF)
16
novel view
Ray Sampling
17
novel view
Ray Sampling
Wi
Coarse Sampling
18
novel view
Ray Sampling
Wi
ΣWj
Wi
Weight Distribution
19
novel view
Ray Sampling
Fine Sampling
Wi
ΣWj
20
NeRF is trained by minimizing the color difference between the
image pixel and the “composited” color along a ray. It is therefore a
common problem for a NeRF to “cheat” by distributing volume
density more evenly along a ray than it should.
Our idea is essentially to encourage NeRF to to assign high volume
density near the surface area, which also allows NeRF to learn the shape
directly.
21
Entropy Loss (InfoNeRF CVPR 2022) Emptiness Loss (SJC CVPR 2023)
novel view
Ray Sampling
Surface Sampling
22
We first distribute the target number of surface sampling points based on the
sub-mesh area distribution, and then, for each sub-meshes, sampling is done
using triangle point picking algorithm.
Pixel-aligned
Features
novel view src view
src view
Ray Sampling
Surface Sampling
23
Mimic how a 3D modelist creates a 3D model
- View Direction attended Feature Combination
24
Pixel-aligned
Features
novel view src view
src view
Ray Sampling
Surface Sampling
25
Pixel-aligned
Features
novel view src view
src view
Early
MLP
Z’2
Z’1
Z’3
Ray Sampling
Surface Sampling
26
Two More
MLP
Blocks
- Feature Vector Similarity
- View Direction Similarity
View Direction Attended Multi-Head Self Attention
27
28
Loss Function
29
30
Simple Guideline for Mesh
Reconstruction from CharNeRF
31
In the original NeRF, mesh is reconstructed through setting view direction to be zero
vector. Nonetheless, the approach nullifies our view-direction attended feature
combination mechanism. Therefore, to fully leverage the power of CharNeRF, we
propose to supply multiple view directions and reconstruct the mesh through the
average volume density.
32
Equivalent to setting view directions to0
Data
33
We collect 95 3D character from online resources
- 75 in training dataset
- 10 in validation dataset
- 10 in test dataset
34
Each 3D Character, we generate
- 303 images (3 views as concept art, 300 cameras evenly distributed
using fibonacci lattice method)
- 1080x1080
- White background
- Pose.txt 4x4 camera to world matrix
1m
2m
4m
35
Image Source: https://www.mixamo.com/#/?page=2&type=Character
- Front, back, side images as concept art
36
Image Source: https://www.mixamo.com/#/?page=2&type=Character
- Intrinsics.json
- focal length: fx, fy
- measured in pixel
- principle point coordinate: cx, cy
- width & height
- aabb_scale
- Obj + Mtl + texture images
37
Results
38
Concept Art from 2D Artists:
39
40
Concept Art from 2D Artists:
Synthetic Concept Art:
Ablation Test
42
CharNeRF 4 2
1
3
43
PixelNeRF
{
With a mix of sampling method
44
Limitation & Future Extension
45
Extension 1 - Improve rendering quality
46
CharNeRF suffers from blurriness at the extremities of the characters,
especially when the viewing angle is far from the concept art views…
47
48
Later research in NeRF leverages diffusion models as prior by lifting its
strong 2D generative power to 3D…
Text to 3D through diffusion model
- Score Jacobian Chaining
(CVPR 2023)
- Dreamfusion (ICLR 2023)
Image to 3D through diffusion model
- 3DiM (ICLR 2023)
- RealFusion 360 (CVPR 2023)
- zero-1-to-3 (ICCV 2023)
49
50
51
NeRFDiff (ICML 2023) finetunes NeRF using conditional diffusion model
Given the similar architecture of CharNeRF to PixelNeRF, the fine tuning
method used in NeRFDiff is a ready-made extension to improve CharNeRF
Extension 2 - Interactivity & Editability
52
53
The Final Character Concept Art typically comes with close-up sketches
for the occluded parts. A model that allows 2D artists to interactively
supply close-up sketches would be an important extension.
Image Source: https://www.artstation.com/artwork/rRy3em
Thank You
54
Q & A
55

More Related Content

PDF
Neural Radiance Fields & Neural Rendering.pdf
PDF
[Paper] GIRAFFE: Representing Scenes as Compositional Generative Neural Featu...
PDF
TransNeRF
PDF
DOMAIN ENGINEERING FOR APPLIED MONOCULAR RECONSTRUCTION OF PARAMETRIC FACES
PDF
Domain Engineering for Applied Monocular Reconstruction of Parametric Faces
PPTX
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
PPTX
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
PPTX
Understanding neural radiance fields
Neural Radiance Fields & Neural Rendering.pdf
[Paper] GIRAFFE: Representing Scenes as Compositional Generative Neural Featu...
TransNeRF
DOMAIN ENGINEERING FOR APPLIED MONOCULAR RECONSTRUCTION OF PARAMETRIC FACES
Domain Engineering for Applied Monocular Reconstruction of Parametric Faces
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
Understanding neural radiance fields

Similar to CharNeRF: 3D Character Generation from Concept Art using Neural Radiance Field (20)

PDF
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
PPTX
Volumetric Texture Analysis Based on Three-Dimensional Gaussian Markov Random...
PDF
lecture_16_jiajun.pdf
PDF
Tsvi Lev. Practical Explainability for AI - with examples
PDF
Selective local binary pattern with convolutional neural network for facial ...
PDF
PPTX
Final_From 2D Image To 3D Object.pptx
PPT
3D visualisation of medical images
PPT
3DAS presentation Game Developers Conference
PPT
Making Face
PDF
Generative Network for 3D Face Modelling
PDF
3D Face Modelling of Human Faces using GANs
PPT
Facial modeling animation - Presentation Imagina 2003
PDF
Introduction to 3D Computer Vision and Differentiable Rendering
PDF
Computed Tomography Image Reconstruction in 3D VoxelSpace
PDF
2020 State of the Art of Neural Rendering
PPTX
Tutorial on Generalization in Neural Fields, CVPR 2022 Tutorial on Neural Fie...
PDF
stylegan.pdf
PDF
Visualization of Deep Learning Models (D1L6 2017 UPC Deep Learning for Comput...
PDF
Texture Classification
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Volumetric Texture Analysis Based on Three-Dimensional Gaussian Markov Random...
lecture_16_jiajun.pdf
Tsvi Lev. Practical Explainability for AI - with examples
Selective local binary pattern with convolutional neural network for facial ...
Final_From 2D Image To 3D Object.pptx
3D visualisation of medical images
3DAS presentation Game Developers Conference
Making Face
Generative Network for 3D Face Modelling
3D Face Modelling of Human Faces using GANs
Facial modeling animation - Presentation Imagina 2003
Introduction to 3D Computer Vision and Differentiable Rendering
Computed Tomography Image Reconstruction in 3D VoxelSpace
2020 State of the Art of Neural Rendering
Tutorial on Generalization in Neural Fields, CVPR 2022 Tutorial on Neural Fie...
stylegan.pdf
Visualization of Deep Learning Models (D1L6 2017 UPC Deep Learning for Comput...
Texture Classification
Ad

More from Anand Bhojan (20)

PDF
Metaverse - The 'Killer App' for 5G, 6G and Beyond
PDF
Multimedia Analytics with 5G Edge Nodes
PDF
Adaptive Video Content Manipulation for OLED Display Power Management
PPTX
CloudHide: Towards Latency Hiding Techniques for Thin-client Cloud Gaming
PDF
SuperStreamer: Enabling Progressive Content Streaming in a Game Engine
PDF
Game balancing with ecosystem mechanism
PPTX
Game Balancing with Ecosystem Mechanism
PDF
TIṬAL – Asynchronous multiplayer shooter with procedurally generated maps
PDF
Large-scale Media Processing on Cloud - Cloud Asia 2016 PANEL DISCUSSION
PDF
Introduction to the Special issue on ‘‘Future trends in robotics and autonomo...
PDF
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
PDF
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
PDF
mumble: Framework for Seamless Message Transfer on Smartphones
PDF
mumble: Framework for Seamless Message Transfer on Smartphones
PDF
Energy Efficient Multi-player Smartphone Gaming using 3D Spatial Subdivisioni...
PDF
ARENA - Dynamic Run-time Map Generation for Multiplayer Shooters [Full Text]
PDF
PARVAI - HVS Aware Adaptive Display Power Management for Mobile Games [Full T...
PDF
Gamelets - Multiplayer Mobile Games with Distributed Micro-Clouds [Full Text]
PDF
Energy Efficient Mobile Applications with Mobile Cloud Computing ( MCC )
PPTX
ARENA - Dynamic Run-time Map Generation for Multiplayer Shooters
Metaverse - The 'Killer App' for 5G, 6G and Beyond
Multimedia Analytics with 5G Edge Nodes
Adaptive Video Content Manipulation for OLED Display Power Management
CloudHide: Towards Latency Hiding Techniques for Thin-client Cloud Gaming
SuperStreamer: Enabling Progressive Content Streaming in a Game Engine
Game balancing with ecosystem mechanism
Game Balancing with Ecosystem Mechanism
TIṬAL – Asynchronous multiplayer shooter with procedurally generated maps
Large-scale Media Processing on Cloud - Cloud Asia 2016 PANEL DISCUSSION
Introduction to the Special issue on ‘‘Future trends in robotics and autonomo...
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
ShowNTell: An easy-to-use tool for answering students’ questions with voice-o...
mumble: Framework for Seamless Message Transfer on Smartphones
mumble: Framework for Seamless Message Transfer on Smartphones
Energy Efficient Multi-player Smartphone Gaming using 3D Spatial Subdivisioni...
ARENA - Dynamic Run-time Map Generation for Multiplayer Shooters [Full Text]
PARVAI - HVS Aware Adaptive Display Power Management for Mobile Games [Full T...
Gamelets - Multiplayer Mobile Games with Distributed Micro-Clouds [Full Text]
Energy Efficient Mobile Applications with Mobile Cloud Computing ( MCC )
ARENA - Dynamic Run-time Map Generation for Multiplayer Shooters
Ad

Recently uploaded (20)

PDF
PPT on Performance Review to get promotions
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPT
Mechanical Engineering MATERIALS Selection
PPTX
Safety Seminar civil to be ensured for safe working.
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Artificial Intelligence
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Sustainable Sites - Green Building Construction
PPTX
Construction Project Organization Group 2.pptx
PPTX
Geodesy 1.pptx...............................................
PPTX
web development for engineering and engineering
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
III.4.1.2_The_Space_Environment.p pdffdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
additive manufacturing of ss316l using mig welding
PPT on Performance Review to get promotions
Foundation to blockchain - A guide to Blockchain Tech
Mechanical Engineering MATERIALS Selection
Safety Seminar civil to be ensured for safe working.
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
R24 SURVEYING LAB MANUAL for civil enggi
Internet of Things (IOT) - A guide to understanding
Artificial Intelligence
CH1 Production IntroductoryConcepts.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Sustainable Sites - Green Building Construction
Construction Project Organization Group 2.pptx
Geodesy 1.pptx...............................................
web development for engineering and engineering
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Operating System & Kernel Study Guide-1 - converted.pdf
III.4.1.2_The_Space_Environment.p pdffdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
additive manufacturing of ss316l using mig welding

CharNeRF: 3D Character Generation from Concept Art using Neural Radiance Field

  • 1. CharNeRF: 3D Character Generation from 2D Concept Art (IEEE AIxVR 2024) Eddy Chu1 , Yiyang Chen1 , Chedy Raissi2 , Anand Bhojan1 1 National University of Singapore1 , Riot Games2
  • 2. ● Problem Overview ● Method ● Data Collection ● Results ● Ablation Test ● Limitations & Future Extensions Content 2
  • 4. 3D modeling is essential in VR and AR as it enables creative and realistic expression of ideas, but it’s a costly process! 4
  • 5. Current 3D Modeling in AR/VR/Game 5 Concept Art Image Source: https://www.behance.net/gallery/5594425/3D-Models-from-Concept-Art Image Source: https://zechatactics.com/blog/2021/03/how-to-build-a-zecha/ Image Source: https://substance3d.adobe.com/plugins/mixamo-in-blender/ 3D Models Shading & Texturing Rigging & Animation Optional Costly & Tedious
  • 6. Types of Concept Art 6 Initial Concept Art Image Source: https://www.behance.net/gallery/5594425/3D-Models-from-Concept-Art Image Source: https://fgfactory.com/how-to-design-3d-character-from-scratch-to-final-game-model Image Source: https://www.weasyl.com/~magnus/submissions/55047/lewis-model-sheet Image Source: https://www.artstation.com/artwork/rRy3em Initial Sketches Turnaround Concept Art 3D Modeling-Ready
  • 7. Input: 3-View Turnaround Concept Art G: Goal Model Output: 3D model (Volumetric Representation, NeRF) - Front, Back & Side faces - Meshes of different LoD can be generated easily 7 Problem Setting:
  • 8. Industrial Values - Speed up the process of character modelling which usually takes hundreds of hours (source: WallaWalla Studio) - Speed up the process of skin production Research Values - There are a lot of existing works/researches on human or object reconstruction from 2D image(s), but to the best of our knowledge, reconstruction from concept art for virtual character is NEW! - Reconstruction for “virtual” character means only RGB values are available (not depth), which is challenging computer vision problem by itself - The shape variation for virtual characters is much greater than daily objects or human beings. A network that is able to capture the distribution of virtual characters would need to be extremely resourceful. 8
  • 10. Capture essential information from the concept art - Source Image Encoding 10
  • 11. Pixel-aligned Features Image Encoder - Encode local information - Store at each pixel location Choose PixelNeRF (CVPR 2021) as our baseline 11
  • 12. 12 Instead of ResNet, we find multi-level (one coarse and one fine) feature extraction greatly improves the rendering quality.
  • 13. Image Encoder We find two-level encoder effective in extracting local information 13 Source images are first downsized to extract coarse information Fine feature Coarse featur
  • 14. H/8 x W/8 x256 H/2 x W/2 x256 Image Encoder - Simplified Feature Combination 14 Fine feature Coarse feature MLP H/2 x W/2 x256 NeRF
  • 15. Regularize NeRF through - a Mix of Sampling Methods 15
  • 16. 2 x Neural Radiance Field (NeRF) 16
  • 20. novel view Ray Sampling Fine Sampling Wi ΣWj 20
  • 21. NeRF is trained by minimizing the color difference between the image pixel and the “composited” color along a ray. It is therefore a common problem for a NeRF to “cheat” by distributing volume density more evenly along a ray than it should. Our idea is essentially to encourage NeRF to to assign high volume density near the surface area, which also allows NeRF to learn the shape directly. 21 Entropy Loss (InfoNeRF CVPR 2022) Emptiness Loss (SJC CVPR 2023)
  • 22. novel view Ray Sampling Surface Sampling 22 We first distribute the target number of surface sampling points based on the sub-mesh area distribution, and then, for each sub-meshes, sampling is done using triangle point picking algorithm.
  • 23. Pixel-aligned Features novel view src view src view Ray Sampling Surface Sampling 23
  • 24. Mimic how a 3D modelist creates a 3D model - View Direction attended Feature Combination 24
  • 25. Pixel-aligned Features novel view src view src view Ray Sampling Surface Sampling 25
  • 26. Pixel-aligned Features novel view src view src view Early MLP Z’2 Z’1 Z’3 Ray Sampling Surface Sampling 26
  • 27. Two More MLP Blocks - Feature Vector Similarity - View Direction Similarity View Direction Attended Multi-Head Self Attention 27
  • 28. 28
  • 30. 30
  • 31. Simple Guideline for Mesh Reconstruction from CharNeRF 31
  • 32. In the original NeRF, mesh is reconstructed through setting view direction to be zero vector. Nonetheless, the approach nullifies our view-direction attended feature combination mechanism. Therefore, to fully leverage the power of CharNeRF, we propose to supply multiple view directions and reconstruct the mesh through the average volume density. 32 Equivalent to setting view directions to0
  • 34. We collect 95 3D character from online resources - 75 in training dataset - 10 in validation dataset - 10 in test dataset 34
  • 35. Each 3D Character, we generate - 303 images (3 views as concept art, 300 cameras evenly distributed using fibonacci lattice method) - 1080x1080 - White background - Pose.txt 4x4 camera to world matrix 1m 2m 4m 35 Image Source: https://www.mixamo.com/#/?page=2&type=Character
  • 36. - Front, back, side images as concept art 36 Image Source: https://www.mixamo.com/#/?page=2&type=Character
  • 37. - Intrinsics.json - focal length: fx, fy - measured in pixel - principle point coordinate: cx, cy - width & height - aabb_scale - Obj + Mtl + texture images 37
  • 39. Concept Art from 2D Artists: 39
  • 40. 40 Concept Art from 2D Artists:
  • 43. CharNeRF 4 2 1 3 43 PixelNeRF { With a mix of sampling method
  • 44. 44
  • 45. Limitation & Future Extension 45
  • 46. Extension 1 - Improve rendering quality 46
  • 47. CharNeRF suffers from blurriness at the extremities of the characters, especially when the viewing angle is far from the concept art views… 47
  • 48. 48 Later research in NeRF leverages diffusion models as prior by lifting its strong 2D generative power to 3D… Text to 3D through diffusion model - Score Jacobian Chaining (CVPR 2023) - Dreamfusion (ICLR 2023) Image to 3D through diffusion model - 3DiM (ICLR 2023) - RealFusion 360 (CVPR 2023) - zero-1-to-3 (ICCV 2023)
  • 49. 49
  • 50. 50
  • 51. 51 NeRFDiff (ICML 2023) finetunes NeRF using conditional diffusion model Given the similar architecture of CharNeRF to PixelNeRF, the fine tuning method used in NeRFDiff is a ready-made extension to improve CharNeRF
  • 52. Extension 2 - Interactivity & Editability 52
  • 53. 53 The Final Character Concept Art typically comes with close-up sketches for the occluded parts. A model that allows 2D artists to interactively supply close-up sketches would be an important extension. Image Source: https://www.artstation.com/artwork/rRy3em