SlideShare a Scribd company logo
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
EVALUATION OF CONDITIONAL
IMAGES SYNTHESIS:
GENERATING A PHOTOREALISTIC
IMAGE FROM A FACE SKETCH
Laureanda:
Samantha Gallone
Relatore:
Prof. Andrea De Lorenzo
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
AGENDA
AGENDA
What are GANs?
Ø Structure and functioning
Ø Conditional GANs and applications
What’s next?
Ø Main limitations
Ø Three suggestions for future works
Project implementation
Ø How the dataset has been obtained?
Ø How the networks used are structured?
Evaluation’s results
Ø How the generated images have been tested?
Ø What can be said about their quality?
Agenda
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
INTRODUCTION
INTRODUCTION
Learn a generative model
Generative
Trained in an adversarial setting
Adversarial
Use deep Neural Networks
Networks
What are Generative Adversarial Networks?
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
INTRODUCTION
INTRODUCTION
Conditional GANs
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
APPLICATIONS
APPLICATIONS
Ø Generation of digital characters for:
• video games,
• movies, and
• animations
Ø Generation of photorealistic renderings of
suspect based on sketches obtained thanks to
eyewitness information
Ø Creation of fake identities
Possible applications
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
Ø ArtLine + sketch simplification
Ø XDoG edge detector + sketch simplification
Dataset preparation
Ø StyleGAN2
Ø ReStyle + pixel2Style2pixel (pSp)
Network architecture
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
IMPLEMENTATION
Ø A very large dataset composed of pair of images (sketch + corresponding image) is needed
Ø Online there are available:
• CUHK Face Sketch FERET Database (CUFSF)
o 1’194 pair of images with both photo of a face and sketch of it
• FFHQ (Flickr-Face-HQ) dataset
o 70’000 face images
o no sketch
Dataset preparation - problem
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
IMPLEMENTATION
Ø ArtLine
Ø Learning to Simplify (LtS)/Mastering sketching
Original image ArtLine LtS MSE+GAN pencil1 pencil2
Dataset preparation – 1st approach
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
IMPLEMENTATION
Dataset preparation – 2nd approach
Ø Extended Difference of Gaussian (XDoG)
Ø Mastering sketching
Original image MSE+GAN
XDoG
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
IMPLEMENTATION
(a) StyleGAN generator
Ø State-of-the-art deep learning generative model
Ø Developed by NVIDIA in 2018 to produce realistic-looking images
StyleGAN2
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
IMPLEMENTATION
ReStyle
Ø Novel inversion scheme tasked with encoding real images into the extended 𝒲 + StyleGAN
latent space
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
IMPLEMENTATION
pixel2Style2pixel
Ø Encoder network that directly generates a series of style vectors which are fed into a pretrained
StyleGAN generator
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
IMPLEMENTATION
IMPLEMENTATION
ReStyle – simplified encoder architecture
Ø Encoder architecture based on a variation of the pSp encoder
Ø All style features are derived from the final 16x16 feature map.
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION RESULTS
RESULTS
Generated images
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION RESULTS
RESULTS
Survey’s example
In your opinion, which images has inspired this drawing?
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION RESULTS
RESULTS
Results – Correct vs incorrect responses
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION RESULTS
RESULTS
Results – Lowest % of correct responses
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION RESULTS
RESULTS
Results – Highest % of correct responses
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION RESULTS
RESULTS
Ø Spearman correlation coefficient used to determine if there is correlation between the time spent to
answer and the percentage of correct responses
Results – Further analysis
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION CONCLUSION
CONCLUSION
Limitations
Ø Not able to generate images of all races equally
Ø It is challenging to generate images of children and young people
Ø It is not able to capture some features like piercings, tattoos and freckles
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION CONCLUSION
CONCLUSION
Future works
Solve the current limitations
Apply Stable Diffusion to generate photorealistic images based on a face sketch
Adapt the proposed model to other domains
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
THANK YOU
FOR THE
ATTENTION
Bibliography (I)
1) Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron
Courville, and Yoshua Bengio. GeneraJve adversarial nets. In Z. Ghahramani, M. Welling, C. Cortes, N.
Lawrence, and K.Q. Weinberger, editors, Advances in Neural InformaJon Processing Systems, volume 27.
Curran Associates, Inc., 2014.
2) Yuval Alaluf, Or Patashnik, and Daniel Cohen-Or. Restyle: A residual-based style- gan encoder via iteraJve
refinement. In Proceedings of the IEEE/CVF InternaJonal Conference on Computer Vision (ICCV), October
2021.
3) Elad Richardson, Yuval Alaluf, Or Patashnik, Yotam Nitzan, Yaniv Azar, Stav Shapiro, and Daniel Cohen-Or.
Encoding in style: a stylegan encoder for image-to-image transla- Jon. In IEEE/CVF Conference on
Computer Vision and Pa`ern RecogniJon (CVPR), June 2021.
4) Edgar Simo-Serra, Satoshi Iizuka, and Hiroshi Ishikawa. Mastering Sketching: Adver- sarial AugmentaJon
for Structured PredicJon. ACM TransacJons on Graphics (TOG), 37(1), 2018.
5) Edgar Simo-Serra, Satoshi Iizuka, Kazuma Sasaki, and Hiroshi Ishikawa. Learning to Simplify: Fully
ConvoluJonal Networks for Rough Sketch Cleanup. ACM TransacJons on Graphics (SIGGRAPH), 35(4),
2016.
6) Sven C. Olsen Holger Winnemöller, Jan Eric Kyprianidis. Xdog: An extended difference-of-gaussians
compendium including advanced image stylizaJon. Computers & Graphics, 36, 2012.
Bibliography (II)
7) NVIDIA. Ffhq dataset. https://github.com/NVlabs/ffhq-dataset.
8) Yu-Sheng Lin, Zhe-Yu Liu, Yu-An Chen, Yu-Siang Wang, Ya-Liang Chang, and Win- ston H. Hsu. Xcos: An
explainable cosine metric for face verification task. ACM Trans. Multimedia Comput. Commun. Appl.,
17(3s), nov 2021.
9) Timo Aila Tero Karras, Samuli Laine. A style-based generator architecture for genera- tive adversarial
networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
10) Omer Tov, Yuval Alaluf, Yotam Nitzan, Or Patashnik, and Daniel Cohen-Or. Designing an encoder for
stylegan image manipulation. arXiv preprint arXiv:2102.02766, 2021.
11) H. J. Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, and Wei Liu.
Cosface: Large margin cosine loss for deep face recognition. In IEEE/CVF Conference on Computer Vision
and Pattern Recognition, 2018.
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
APPENDIX
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
Explainable Cosine Metric - xCos
Ø It is based on the insight that humans tend to compare
different facial features to determine whether two face
images belong to the same person.
Ø It is built using a grid-based feature extraction
approach, in which each image is divided into
multiple local regions.
Ø It uses the cosine similarity to compute the similarity
score
Ø It includes an attention mechanism that identifies the
specific facial features that contribute the most to the
similarity score
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
Extended Difference of Gaussian
Ø Gaussian filter:
Ø Difference of two Gaussians with different 𝜎:
Ø XDoG :
Gσ(x) =
1
2πσ2
e− x2
2σ2
Dσ,k(x) = Gσ(x) − Gkσ(x) ≈ − (k − 1)σ2
∇2
G
Dσ,k,τ(x) = Gσ(x) − τ·Gkσ(x)
Tϵ,φ(u) =
{
1 u ≥ ϵ
1 + tanh(φ·(u − ϵ)) otherwise
Tϵ,φ(Dσ,k,τ * I)
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
Learning to Simplify (LtS)
Ø Technique to simplify rough sketches
Ø It consists of a Fully Convolutional Network to simplify the image
Ø It has been trained by the authors using pairs of rough and simplified sketches using a weighted
mean square error criterion as loss
INTRODUCTION APPLICATIONS
AGENDA CONCLUSION
RESULTS
IMPLEMENTATION
Mastering sketching
Ø Combines a fully convolutional network for sketch simplification with a discriminator network that
is able to distinguish real line drawings from those generated by the network
Ø It is trained a variation of a conditional GAN where instead of a random input z, it is used a
deterministic prediction
Ø For adversarial training, the prediction model S is trained together with the discriminator model
which is no conditioned on the input x.
S : x ↦ y = S(x)
D : y ↦ D(y) ∈ ℝ

More Related Content

PDF
PPT Image Caption Generator mini project
PDF
Image super resolution using Generative Adversarial Network.
PDF
FACE PHOTO-SKETCH RECOGNITION USING DEEP LEARNING TECHNIQUES - A REVIEW
PPTX
Semantic segmentation with Convolutional Neural Network Approaches
PPTX
[20240902_LabSeminar_Huy]Dynamic Semantic-Based Spatial Graph Convolution Net...
PDF
Video Stitching using Improved RANSAC and SIFT
PDF
SVD and the Netflix Dataset
PDF
Decomposing image generation into layout priction and conditional synthesis
PPT Image Caption Generator mini project
Image super resolution using Generative Adversarial Network.
FACE PHOTO-SKETCH RECOGNITION USING DEEP LEARNING TECHNIQUES - A REVIEW
Semantic segmentation with Convolutional Neural Network Approaches
[20240902_LabSeminar_Huy]Dynamic Semantic-Based Spatial Graph Convolution Net...
Video Stitching using Improved RANSAC and SIFT
SVD and the Netflix Dataset
Decomposing image generation into layout priction and conditional synthesis

Similar to Evaluation of conditional images synthesis: generating a photorealistic image from a face sketch (20)

PDF
IRJET - Face Recognition based Attendance System
PPTX
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
PDF
A Comparative Case Study on Compression Algorithm for Remote Sensing Images
PDF
Kq3518291832
PPTX
Semantic Segmentation on Satellite Imagery
PDF
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
PDF
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
PPTX
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
PDF
Conception_et_realisation_dun_site_Web_d.pdf
PDF
2020 State of the Art of Neural Rendering
PDF
A deep learning based stereo matching model for autonomous vehicle
PDF
Survey paper on image compression techniques
PDF
Survey on Various Image Denoising Techniques
PDF
IRJET- K-SVD: Dictionary Developing Algorithms for Sparse Representation ...
PDF
A PROJECT REPORT ON REMOVAL OF UNNECESSARY OBJECTS FROM PHOTOS USING MASKING
PPTX
Technical Seminar presentation topic for 8th sem
PDF
Efficient resampling features and convolution neural network model for image ...
PDF
Efficient resampling features and convolution neural network model for image ...
PDF
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...
PDF
IRJET- A Review on Data Dependent Label Distribution Learning for Age Estimat...
IRJET - Face Recognition based Attendance System
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
A Comparative Case Study on Compression Algorithm for Remote Sensing Images
Kq3518291832
Semantic Segmentation on Satellite Imagery
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
A Comparative study of K-SVD and WSQ Algorithms in Fingerprint Compression Te...
[20240415_LabSeminar_Huy]Deciphering Spatio-Temporal Graph Forecasting: A Cau...
Conception_et_realisation_dun_site_Web_d.pdf
2020 State of the Art of Neural Rendering
A deep learning based stereo matching model for autonomous vehicle
Survey paper on image compression techniques
Survey on Various Image Denoising Techniques
IRJET- K-SVD: Dictionary Developing Algorithms for Sparse Representation ...
A PROJECT REPORT ON REMOVAL OF UNNECESSARY OBJECTS FROM PHOTOS USING MASKING
Technical Seminar presentation topic for 8th sem
Efficient resampling features and convolution neural network model for image ...
Efficient resampling features and convolution neural network model for image ...
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...
IRJET- A Review on Data Dependent Label Distribution Learning for Age Estimat...
Ad

Recently uploaded (20)

PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Lecture Notes Electrical Wiring System Components
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPT
Project quality management in manufacturing
PDF
Well-logging-methods_new................
PPTX
additive manufacturing of ss316l using mig welding
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPT
Mechanical Engineering MATERIALS Selection
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
web development for engineering and engineering
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Geodesy 1.pptx...............................................
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Operating System & Kernel Study Guide-1 - converted.pdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
OOP with Java - Java Introduction (Basics)
Lecture Notes Electrical Wiring System Components
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Project quality management in manufacturing
Well-logging-methods_new................
additive manufacturing of ss316l using mig welding
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Mechanical Engineering MATERIALS Selection
CH1 Production IntroductoryConcepts.pptx
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
web development for engineering and engineering
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Internet of Things (IOT) - A guide to understanding
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Geodesy 1.pptx...............................................
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Ad

Evaluation of conditional images synthesis: generating a photorealistic image from a face sketch