SlideShare a Scribd company logo
Image Aesthetic Assessment With Deep Learning
Who am I ?
M a c h i n e L e a r n i n g E n g i n e e r
A m i t K u s h w a h a
Image Aesthetics
W H A T D E F I N E S A N I M A G E ?
★ Image aesthetic evaluation aims to classify photos into high quality or low quality from the perspective
of human.
★ Many visual features have been explored under this formulation (handcrafted features), ranging from
low-level image statistics, such as edge distributions and color histograms, to high-level photographic
rules, such as the rule of thirds and golden ratio
Beautiful Capture
“ B E A U T Y I S R E A L L Y I N T H E E Y E O F T H E B E H O L D E R ”
✦ While everyone has different tastes, there are universally accepted
norms when it comes to beauty – things which everyone pretty
much agrees are beautiful, like sunsets or sunrises over the
mountains or the ocean.
A S U B J E C T I V E C O N J E C T U R E
T E C H N I Q U E S O F I M A G E C A P T U R I N G
A S U B J E C T I V E C O N J E C T U R E
✦ Image aesthetics can be affected by the different usages of
lighting, contrast , and image composition
Beautiful Capture
The Two Images
F A C T O R S L I K E L I G H T I N G , C O N T R A S T , A N G L E D E T E R M I N E T H E A E S T H E T I C S
Significance of Aesthetic
Assessment
✦ With the ever increasing user generated content, biggest challenge is to
showcase high quality images to users.
✦ Zomato measures the performance of restaurant for ad sales with click
through rate (CTR) as one of the parameter. Restaurants with High Quality
Display Image have greater CTR.
01
02
04
06
WRITESOMETHING
Photo Gallery
P H O T O S D O H A V E A S T O R Y
Let’s solve Image
Aesthetics with the
most disruptive
technology over the
last five years.
Deep Learning
S T A T E O F T H E A R T I N M A N Y F I E L D S N O W
Fixed Size Input
Constraint
D E E P M O D E L I N P U T L A Y E R
✦ This constraint of fixed size input in Deep Network
compromises the aesthetics of original image. input images
need to be transformed via cropping, scaling, or padding, which
often damages image composition, reduces image resolution,
or causes image distortion, thus compromising the aesthetics of
the original images.
Input Layer of Network
A M A Z I N G I N T H E I R O R I G I N A L A S P E C T R A T I O
Original Image
Transformed Image
R E S I Z I N G L O S E S T H E O R I G I N A L A E S T H E T I C S O F I M A G E
O R I G I N A L R E S I Z E D
Transformed Image
O R I G I N A L R E S I Z E D
R E S I Z I N G L O S E S T H E O R I G I N A L A E S T H E T I C S O F I M A G E
DEEP
LEARNING
Demystifying the Network
✦ The first few layers of networks are either convolution or pooling layer, followed by fully connected
layers.
✦ The fixed length input is only the constraint of fully-connected layer. Be it convolution or pooling,
both perform operations on input, with the spatial information.
Spatial Pyramid
Pooling
✦ Network with another pooling strategy, ‘spatial pyramid pooling’ just before the fully
connected layer, makes it capable of taking input of any size and generate fixed length
representation.
Max Pooling
O U T P U T I S P R O P O R T I O N A L T O I N P U T
Kernel size is fixed
Spatial Pyramid Pooling
Output dimension (Bin Size , Bin Size)
Spatial Pyramid Pooling
Fixed Bin Numbers
Fixed Kernel Size
SPP-Net
CNN
Training a SPP-net
Z O M A T O D A T A S E T U S E D
T R A I N I N G L O S S T R A I N I N G A C C U R A C Y
M A X - A C C U R A C Y - 7 7 . 2
NOREGIONOFINTEREST
Interesting Prediction Results
Spp-Score=0.46
REGIONOFINTERESTISPRESENT
Interesting Prediction Results
Spp-Score=0.94
It’s often neglected, but plays a major role in classification learning.
S C A L E I N V A R I A N T L E A R N I N G
Allows to explore CNN without the constraint of fixed size representation of input.
F I X E D S I Z E I N P U T
Take Aways
S P A T I A L P Y R A M I D P O O L I N G N E T
R E F E R E N C E S
✦ Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, “Spatial Pyramid Pooling in Deep
Convolutional Networks for Visual Recognition”
✦ Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton, ImageNet Classification with Deep
Convolutional Neural Networks 2012
Composition-Preserving Deep Photo Aesthetics Assessment
✦ Long Mai, Hailin Jin, Feing Liu, Composition-Preserving Deep Photo Aesthetics
Assessment
h t t p s : / / w w w . g i t h u b . c o m / y a r d s t i c k 1 7 /
h t t p s : / / w w w . l i n k e d i n . c o m / i n / y a r d s t i c k 1 7 /
a m i t _ k u s h w a h a @ o u t l o o k . c o m
Thank You
F i n d m e h e r e

More Related Content

PPTX
Viola-Jones Object Detection
PDF
#10 pydata warsaw object detection with dn ns
PDF
Brain Tumor Classification using Support Vector Machine
PDF
Deep VO and SLAM
PDF
[PR12] intro. to gans jaejun yoo
PDF
Computer vision for transportation
PDF
Comp4010 lecture11 VR Applications
PDF
Gradient Descent, Back Propagation, and Auto Differentiation - Advanced Spark...
Viola-Jones Object Detection
#10 pydata warsaw object detection with dn ns
Brain Tumor Classification using Support Vector Machine
Deep VO and SLAM
[PR12] intro. to gans jaejun yoo
Computer vision for transportation
Comp4010 lecture11 VR Applications
Gradient Descent, Back Propagation, and Auto Differentiation - Advanced Spark...

What's hot (20)

PDF
Logistic route rationalization and optimization
PPTX
Deep learning-for-pose-estimation-wyang-defense
PDF
openCV with python
PDF
GAN in medical imaging
PPTX
Tutorial on convolutional neural networks
PDF
Mask R-CNN
PDF
Designing Augmented Reality Experiences
PPTX
Deep learning for object detection
PPTX
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
PDF
Introduction to object detection
PPTX
Final thesis presentation
PDF
[GAN by Hung-yi Lee]Part 1: General introduction of GAN
PDF
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
PDF
Intro to Deep Learning for Computer Vision
PPTX
Sky x technology
PDF
Convolutional neural network
PPT
Action Recognition (Thesis presentation)
PDF
An Introduction to Optimal Transport
PPTX
Convolutional Neural Network (CNN)
PDF
Trusted Computing Base
Logistic route rationalization and optimization
Deep learning-for-pose-estimation-wyang-defense
openCV with python
GAN in medical imaging
Tutorial on convolutional neural networks
Mask R-CNN
Designing Augmented Reality Experiences
Deep learning for object detection
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
Introduction to object detection
Final thesis presentation
[GAN by Hung-yi Lee]Part 1: General introduction of GAN
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Intro to Deep Learning for Computer Vision
Sky x technology
Convolutional neural network
Action Recognition (Thesis presentation)
An Introduction to Optimal Transport
Convolutional Neural Network (CNN)
Trusted Computing Base
Ad

Similar to Image Aesthetic Assessment With Deep Learning (20)

PPTX
Gen Z - Circular Economy.pptx
PPTX
Multi-focus Application Presentation in ICSSE2017
PPT
Automatic image mosaicing an approach based on fft
PDF
Rosa Cano_openexpo2018_sin_texto
PDF
Bringing Cable TV to VR
PDF
Construir productos y transformación digital mezclando tecnología y educación...
PPTX
(EIA) REGULACION. PARA PLATA SOLAR EN MONTERIA
PPT
Atlanta Airport
PPTX
Improvement of strip thickness control through the process of data analytics
PDF
Pintrace: Distributed tracing @Pinterest
PDF
Pintrace: Distributed tracing@Pinterest
PDF
Drones: What’s Next?
PDF
Drones - What's next?
PDF
cofaso Presentation
PDF
08.06.15 training occupant_engagement
PDF
ET-Q2-L1-Graphic Design.Lesson Proper for Quarter 1
PDF
Hypothesis Driven Development at Agile Australia 2015
PPTX
Bluesky - Esri UK Annual Conference 2016
PDF
Studio Art Direct
PDF
JacobSpence - 2016
Gen Z - Circular Economy.pptx
Multi-focus Application Presentation in ICSSE2017
Automatic image mosaicing an approach based on fft
Rosa Cano_openexpo2018_sin_texto
Bringing Cable TV to VR
Construir productos y transformación digital mezclando tecnología y educación...
(EIA) REGULACION. PARA PLATA SOLAR EN MONTERIA
Atlanta Airport
Improvement of strip thickness control through the process of data analytics
Pintrace: Distributed tracing @Pinterest
Pintrace: Distributed tracing@Pinterest
Drones: What’s Next?
Drones - What's next?
cofaso Presentation
08.06.15 training occupant_engagement
ET-Q2-L1-Graphic Design.Lesson Proper for Quarter 1
Hypothesis Driven Development at Agile Australia 2015
Bluesky - Esri UK Annual Conference 2016
Studio Art Direct
JacobSpence - 2016
Ad

Recently uploaded (20)

PDF
Analyzing Impact of Pakistan Economic Corridor on Import and Export in Pakist...
PDF
737-MAX_SRG.pdf student reference guides
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PPT
Total quality management ppt for engineering students
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PDF
86236642-Electric-Loco-Shed.pdf jfkduklg
PPT
Occupational Health and Safety Management System
PPT
A5_DistSysCh1.ppt_INTRODUCTION TO DISTRIBUTED SYSTEMS
PPTX
Fundamentals of Mechanical Engineering.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
Soil Improvement Techniques Note - Rabbi
PPTX
Nature of X-rays, X- Ray Equipment, Fluoroscopy
PPTX
communication and presentation skills 01
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPT
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
Analyzing Impact of Pakistan Economic Corridor on Import and Export in Pakist...
737-MAX_SRG.pdf student reference guides
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
Total quality management ppt for engineering students
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
86236642-Electric-Loco-Shed.pdf jfkduklg
Occupational Health and Safety Management System
A5_DistSysCh1.ppt_INTRODUCTION TO DISTRIBUTED SYSTEMS
Fundamentals of Mechanical Engineering.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Automation-in-Manufacturing-Chapter-Introduction.pdf
Soil Improvement Techniques Note - Rabbi
Nature of X-rays, X- Ray Equipment, Fluoroscopy
communication and presentation skills 01
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt

Image Aesthetic Assessment With Deep Learning

  • 2. Who am I ? M a c h i n e L e a r n i n g E n g i n e e r A m i t K u s h w a h a
  • 3. Image Aesthetics W H A T D E F I N E S A N I M A G E ? ★ Image aesthetic evaluation aims to classify photos into high quality or low quality from the perspective of human. ★ Many visual features have been explored under this formulation (handcrafted features), ranging from low-level image statistics, such as edge distributions and color histograms, to high-level photographic rules, such as the rule of thirds and golden ratio
  • 4. Beautiful Capture “ B E A U T Y I S R E A L L Y I N T H E E Y E O F T H E B E H O L D E R ” ✦ While everyone has different tastes, there are universally accepted norms when it comes to beauty – things which everyone pretty much agrees are beautiful, like sunsets or sunrises over the mountains or the ocean. A S U B J E C T I V E C O N J E C T U R E
  • 5. T E C H N I Q U E S O F I M A G E C A P T U R I N G A S U B J E C T I V E C O N J E C T U R E ✦ Image aesthetics can be affected by the different usages of lighting, contrast , and image composition Beautiful Capture
  • 6. The Two Images F A C T O R S L I K E L I G H T I N G , C O N T R A S T , A N G L E D E T E R M I N E T H E A E S T H E T I C S
  • 7. Significance of Aesthetic Assessment ✦ With the ever increasing user generated content, biggest challenge is to showcase high quality images to users. ✦ Zomato measures the performance of restaurant for ad sales with click through rate (CTR) as one of the parameter. Restaurants with High Quality Display Image have greater CTR. 01 02 04 06 WRITESOMETHING
  • 8. Photo Gallery P H O T O S D O H A V E A S T O R Y
  • 9. Let’s solve Image Aesthetics with the most disruptive technology over the last five years.
  • 10. Deep Learning S T A T E O F T H E A R T I N M A N Y F I E L D S N O W
  • 11. Fixed Size Input Constraint D E E P M O D E L I N P U T L A Y E R ✦ This constraint of fixed size input in Deep Network compromises the aesthetics of original image. input images need to be transformed via cropping, scaling, or padding, which often damages image composition, reduces image resolution, or causes image distortion, thus compromising the aesthetics of the original images. Input Layer of Network
  • 12. A M A Z I N G I N T H E I R O R I G I N A L A S P E C T R A T I O Original Image
  • 13. Transformed Image R E S I Z I N G L O S E S T H E O R I G I N A L A E S T H E T I C S O F I M A G E O R I G I N A L R E S I Z E D
  • 14. Transformed Image O R I G I N A L R E S I Z E D R E S I Z I N G L O S E S T H E O R I G I N A L A E S T H E T I C S O F I M A G E
  • 15. DEEP LEARNING Demystifying the Network ✦ The first few layers of networks are either convolution or pooling layer, followed by fully connected layers. ✦ The fixed length input is only the constraint of fully-connected layer. Be it convolution or pooling, both perform operations on input, with the spatial information.
  • 16. Spatial Pyramid Pooling ✦ Network with another pooling strategy, ‘spatial pyramid pooling’ just before the fully connected layer, makes it capable of taking input of any size and generate fixed length representation.
  • 17. Max Pooling O U T P U T I S P R O P O R T I O N A L T O I N P U T Kernel size is fixed
  • 18. Spatial Pyramid Pooling Output dimension (Bin Size , Bin Size)
  • 20. Fixed Bin Numbers Fixed Kernel Size SPP-Net CNN
  • 21. Training a SPP-net Z O M A T O D A T A S E T U S E D T R A I N I N G L O S S T R A I N I N G A C C U R A C Y M A X - A C C U R A C Y - 7 7 . 2
  • 24. It’s often neglected, but plays a major role in classification learning. S C A L E I N V A R I A N T L E A R N I N G Allows to explore CNN without the constraint of fixed size representation of input. F I X E D S I Z E I N P U T Take Aways S P A T I A L P Y R A M I D P O O L I N G N E T
  • 25. R E F E R E N C E S ✦ Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, “Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition” ✦ Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton, ImageNet Classification with Deep Convolutional Neural Networks 2012 Composition-Preserving Deep Photo Aesthetics Assessment ✦ Long Mai, Hailin Jin, Feing Liu, Composition-Preserving Deep Photo Aesthetics Assessment
  • 26. h t t p s : / / w w w . g i t h u b . c o m / y a r d s t i c k 1 7 / h t t p s : / / w w w . l i n k e d i n . c o m / i n / y a r d s t i c k 1 7 / a m i t _ k u s h w a h a @ o u t l o o k . c o m Thank You F i n d m e h e r e

Editor's Notes

  • #8: To change the image behind the Mock up. Select the layer - > Right Click -> Send to Back -> Delete the image -> Drag & Drop your Own Picture -> Send to Back (again)
  • #19: To change the image behind the Mock up. Select the layer - > Right Click -> Send to Back -> Delete the image -> Drag & Drop your Own Picture -> Send to Back (again)
  • #20: To change the image behind the Mock up. Select the layer - > Right Click -> Send to Back -> Delete the image -> Drag & Drop your Own Picture -> Send to Back (again)
  • #25: To change the image behind the Mock up. Select the layer - > Right Click -> Send to Back -> Delete the image -> Drag & Drop your Own Picture -> Send to Back (again)