SlideShare a Scribd company logo
DIVING DEEP INTO SENTIMENT:
UNDERSTANDING FINE-TUNED CNNS
FOR VISUAL SENTIMENT PREDICTION
Víctor Campos Xavier Giró Amaia Salvador Brendan Jou
Outline
1. Introduction
2. Related work
3. Methodology and results
4. Conclusions
5. Future work
2
Introduction: motivation
3
4
Introduction: problem definition
▷ What?
▷ How?
▷ What? Predict the sentiment that an image provokes to a human
▷ How?
5
Introduction: problem definition
▷ What? Predict the sentiment that an image provokes to a human
▷ How?
6
Introduction: problem definition
▷ What? Predict the sentiment that an image provokes to a human
▷ How? Using Convolutional Neural Networks (CNNs)
7
CNN
Introduction: problem definition
8
CNN
Introduction: example
9
CNN
Introduction: example
Outline
1. Introduction
2. Related work
3. Methodology and results
4. Conclusions
5. Future work
10
Related work: low-level descriptors
11
Siersdorfer, S., Minack, E., Deng, F., & Hare, J. (2010, October). Analyzing
and predicting sentiment of images on the social web. In Proceedings of the
international conference on Multimedia (pp. 715-718). ACM.
Machajdik, J., & Hanbury, A. (2010, October). Affective image classification
using features inspired by psychology and art theory. In Proceedings of the
international conference on Multimedia (pp. 83-92). ACM.
12
Borth, D., Ji, R., Chen, T., Breuel, T., & Chang, S. F. (2013, October). Large-scale visual sentiment ontology and detectors using adjective
noun pairs. In Proceedings of the 21st ACM international conference on Multimedia (pp. 223-232). ACM.
Related work: SentiBank
Related work: CNNs for sentiment prediction
13
You, Q., Luo, J., Jin, H., & Yang, J. (2015). Robust image sentiment analysis using progressively trained and domain transferred deep
networks. In The Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI).
Outline
1. Introduction
2. Related work
3. Methodology and results
a. Convolutional Neural Networks
b. Datasets
c. Experimental setup and results
4. Conclusions
5. Future work
14
Convolutional Neural Networks
15
Krizhevsky, A.; Sutskever, I. & Hinton, G. E.: ImageNet Classification with Deep Convolutional Neural Networks. In: NIPS., 2012
Outline
1. Introduction
2. Related work
3. Methodology and results
a. Convolutional Neural Networks
b. Datasets
c. Experimental setup and results
4. Conclusions
5. Future work
16
Datasets
17
Flickr Twitter
Authors Borth et al. (2013) You et al. (2015)
Size ~500k 1269
Annotation method Textual tags
5 human
annotators
Datasets
18
Size
Flickr
dataset
Quality of the
annotations
Twitter
5-agree
dataset
Datasets
19
Size
Flickr
dataset
Quality of the
annotations
Twitter
5-agree
dataset
Outline
1. Introduction
2. Related work
3. Methodology and results
a. Convolutional Neural Networks
b. Datasets
c. Experimental setup and results
4. Conclusions
5. Future work
20
21
ARCHITECTURE
CaffeNet
Experimental setup: CNN
22
ARCHITECTURE
CaffeNet
SOFTWARE
[Jia’14]
Experimental setup: CNN
Experimental setup: CNN
23
Pre-trained
Model
ARCHITECTURE
CaffeNet
SOFTWARE
[Jia’14]
DATASET
[Deng’09]
Experimental setup: CNN
24
Model
ARCHITECTURE
CaffeNet
SOFTWARE
[Jia’14]
DATASET
[Deng’09]
DATASET
[You’15]
Twitter 5-agree
+
Fine-tuning
Pre-training
Experimental setup: outline
1. Fine-tuning CaffeNet
2. Layer by layer analysis
3. Layer ablation
4. Layer addition
25
Fine-tuning CaffeNet
26
Fine-tuning CaffeNet
27
Fine-tuning CaffeNet
28
Fine-tuning CaffeNet
29
Pre-trained
model
Fine-tuning CaffeNet
30
Experimental setup: outline
1. Fine-tuning CaffeNet
2. Layer by layer analysis
3. Layer ablation
4. Layer addition
31
Layer by layer analysis
32
Layer by layer analysis
33
Experimental setup: outline
1. Fine-tuning CaffeNet
2. Layer by layer analysis
3. Layer ablation
4. Layer addition
34
Layer ablation
35
Raw ablation
2-neuron on top
Layer ablation
36
Layer ablation
37
Layer ablation
38
~16M
params
(~25%)
Experimental setup: outline
1. Fine-tuning CaffeNet
2. Layer by layer analysis
3. Layer ablation
4. Layer addition
39
Layer addition
40
FC8: semantic
information
Layer addition
41
FC8: semantic
information
Outline
1. Introduction
2. Related work
3. Methodology and results
4. Conclusions
5. Future work
42
Conclusions
43
Pre-trained
model
44
CNN
Conclusions
Conclusions
45
Outline
1. Introduction
2. Related work
3. Methodology and results
4. Conclusions
5. Future work
46
Future work
47
Size
Flickr
dataset
Quality of the
annotations
Twitter
dataset
Future work
48
Size
Flickr
dataset
Quality of the
annotations
Twitter
dataset
MVSO
dataset
(†) B. Jou*, T. Chen*, N. Pappas*, M. Redi*, M. Topkara*, and S.-F. Chang. Visual Affect Around the World: A Large-scale Multilingual
Visual Sentiment Ontology. ACM Int'l Conference on Multimedia (MM), 2015.
†
49
Model
ARCHITECTURE
CaffeNet
SOFTWARE
[Jia’14]
DATASET
MVSO [Jou’15]
Future work
Acknowledgements
50
Financial supportTechnical support
Albert Gil Josep Pujal
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentiment prediction
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentiment prediction
Data augmentation (oversampling)
53
CNN
Data augmentation (oversampling)
54
CNN
Data augmentation (oversampling)
55
CNN
Data augmentation (oversampling)
56
CNN
Data augmentation (oversampling)
57
CNN
Data augmentation (oversampling)
58
CNN
Data augmentation (oversampling)
59
CNN

More Related Content

PDF
Layer-wise CNN Surgery for Visual Sentiment Prediction
PDF
Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks
PDF
Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016
PDF
Video Analysis (D4L2 2017 UPC Deep Learning for Computer Vision)
PDF
Welcome (D1L1 2017 UPC Deep Learning for Computer Vision)
PDF
Deep Convnets for Video Processing (Master in Computer Vision Barcelona, 2016)
PDF
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
PDF
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...
Layer-wise CNN Surgery for Visual Sentiment Prediction
Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks
Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016
Video Analysis (D4L2 2017 UPC Deep Learning for Computer Vision)
Welcome (D1L1 2017 UPC Deep Learning for Computer Vision)
Deep Convnets for Video Processing (Master in Computer Vision Barcelona, 2016)
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
Closing, Course Offer 17/18 & Homework (D5 2017 UPC Deep Learning for Compute...

What's hot (20)

PDF
Deep Learning for Computer Vision: Video Analytics (UPC 2016)
PDF
Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...
PDF
Learning with Videos (D4L4 2017 UPC Deep Learning for Computer Vision)
PDF
Deep Learning for Computer Vision (2/4): Object Analytics @ laSalle 2016
PDF
Multimodal Deep Learning (D4L4 Deep Learning for Speech and Language UPC 2017)
PPTX
Deep Learning - Convolutional Neural Networks - Architectural Zoo
PDF
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
PDF
Intro To Convolutional Neural Networks
PDF
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
PDF
Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019
PDF
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
PPTX
Deep Learning Explained
PDF
Language and Vision (D3L5 2017 UPC Deep Learning for Computer Vision)
PDF
One Perceptron to Rule Them All: Language and Vision
PDF
Deep convnets for global recognition (Master in Computer Vision Barcelona 2016)
PDF
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
PDF
Deep Learning - Convolutional Neural Networks
PDF
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
PDF
Il deep learning ed una nuova generazione di AI - Simone Scardapane
PPTX
Andrew Ng, Chief Scientist at Baidu
Deep Learning for Computer Vision: Video Analytics (UPC 2016)
Advanced Deep Architectures (D2L6 Deep Learning for Speech and Language UPC 2...
Learning with Videos (D4L4 2017 UPC Deep Learning for Computer Vision)
Deep Learning for Computer Vision (2/4): Object Analytics @ laSalle 2016
Multimodal Deep Learning (D4L4 Deep Learning for Speech and Language UPC 2017)
Deep Learning - Convolutional Neural Networks - Architectural Zoo
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
Intro To Convolutional Neural Networks
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning Explained
Language and Vision (D3L5 2017 UPC Deep Learning for Computer Vision)
One Perceptron to Rule Them All: Language and Vision
Deep convnets for global recognition (Master in Computer Vision Barcelona 2016)
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Deep Learning - Convolutional Neural Networks
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
Il deep learning ed una nuova generazione di AI - Simone Scardapane
Andrew Ng, Chief Scientist at Baidu
Ad

Viewers also liked (7)

PPTX
Information retrieval dynamic indexing
PDF
Applying Machine Learning to Network Security Monitoring - BayThreat 2013
PDF
The Unreasonable Benefits of Deep Learning
PDF
Elastic Search: Beyond Ordinary Fulltext Search (Webexpo 2011 Prague)
PDF
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
PDF
Region-oriented Convolutional Networks for Object Retrieval
PPT
12. Indexing and Hashing in DBMS
Information retrieval dynamic indexing
Applying Machine Learning to Network Security Monitoring - BayThreat 2013
The Unreasonable Benefits of Deep Learning
Elastic Search: Beyond Ordinary Fulltext Search (Webexpo 2011 Prague)
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
Region-oriented Convolutional Networks for Object Retrieval
12. Indexing and Hashing in DBMS
Ad

Similar to Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentiment prediction (20)

PPTX
Introduction to computer vision with Convoluted Neural Networks
PPTX
Introduction to computer vision
PPTX
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
PDF
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...
PPTX
intro-to-cnn-April_2020.pptx
PPTX
introduction Convolutional Neural Networks.pptx
PPTX
Deep Learning for Image Processing on 16 June 2025 MITS.pptx
PDF
PyDresden 20170824 - Deep Learning for Computer Vision
PDF
Scene understanding
PPTX
Image captioning
PPTX
PyConZA'17 Deep Learning for Computer Vision
PPTX
Obscenity Detection in Images
PDF
Deep Learning for Computer Vision - ExecutiveML
PDF
SaturdaysAI: CNN in Underwater Robotics and eHealth
PDF
Machine learning for newbies
PPTX
Convolutional-Neural-Networks-Revolutionizing-Computer-Vision (1).pptx
PDF
PPTX
Image classification using convolutional neural network
PDF
物件偵測與辨識技術
DOCX
Automating e government using ai
Introduction to computer vision with Convoluted Neural Networks
Introduction to computer vision
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...
intro-to-cnn-April_2020.pptx
introduction Convolutional Neural Networks.pptx
Deep Learning for Image Processing on 16 June 2025 MITS.pptx
PyDresden 20170824 - Deep Learning for Computer Vision
Scene understanding
Image captioning
PyConZA'17 Deep Learning for Computer Vision
Obscenity Detection in Images
Deep Learning for Computer Vision - ExecutiveML
SaturdaysAI: CNN in Underwater Robotics and eHealth
Machine learning for newbies
Convolutional-Neural-Networks-Revolutionizing-Computer-Vision (1).pptx
Image classification using convolutional neural network
物件偵測與辨識技術
Automating e government using ai

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
PDF
Deep Generative Learning for All
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
PDF
Open challenges in sign language translation and production
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
PDF
Intepretability / Explainable AI for Deep Neural Networks
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
PDF
Curriculum Learning for Recurrent Video Object Segmentation
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
The Transformer - Xavier Giró - UPC Barcelona 2021
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Open challenges in sign language translation and production
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Learn2Sign : Sign language recognition and translation using human keypoint e...
Intepretability / Explainable AI for Deep Neural Networks
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Curriculum Learning for Recurrent Video Object Segmentation
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Spectroscopy.pptx food analysis technology
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Empathic Computing: Creating Shared Understanding
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Machine Learning_overview_presentation.pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPT
Teaching material agriculture food technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Machine learning based COVID-19 study performance prediction
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Spectroscopy.pptx food analysis technology
MYSQL Presentation for SQL database connectivity
Encapsulation_ Review paper, used for researhc scholars
Empathic Computing: Creating Shared Understanding
The AUB Centre for AI in Media Proposal.docx
Building Integrated photovoltaic BIPV_UPV.pdf
Machine Learning_overview_presentation.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Unlocking AI with Model Context Protocol (MCP)
Network Security Unit 5.pdf for BCA BBA.
Review of recent advances in non-invasive hemoglobin estimation
Teaching material agriculture food technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Per capita expenditure prediction using model stacking based on satellite ima...
Digital-Transformation-Roadmap-for-Companies.pptx
Spectral efficient network and resource selection model in 5G networks
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentiment prediction