WHERE SHOULD
SALIENCY MODELS
LOOK NEXT ?
Zoya Bylinskii, Adrià Recasens, Ali Borji, Aude Oliva, Antonio Torralba, Frédo Durand “Where Should Saliency
Models Look Next?”, ECCV2016
Hello!I am Junting Pan
I am here because I love to give presentations.
You can find me at junting.pa@gmail.com
Saliency map
Saliency map is a probability distribution map, that
describe where human observers look in images.
It can provide important clues to human image
understanding :
- Main focus
- Action or event
- Participants
Regions of interest to human
1.
Motivation
Let’s start with the first set of slides
Breakthroughs because of ….
× Prediction score increment has benn stable for
looong time since ..
× CNN comes !!
× End to end manner : feat. extraction, feat.
integration & saliency prediction.
Breakthroughs because of ….
× Prediction score increment has benn stable for
looong time since ..
× CNN comes !!
× End to end manner : feat. extraction, feat.
integration & saliency prediction.
Evaluation scores have begun
to saturate
“Have saliency models begun
to converge on human
performance and is saliency
a solved problem ?”
HIGHER
LEVEL
- Text
- Objects of gaze and action
- Locations of motion
- People in image
A picture is worth a
thousand words
A complex idea can be conveyed with just a
single still image, namely making it possible to
absorb large amounts of data quickly.
Where should saliency models look next ? (UPC Reading Group)
2.
RELATED WORKS
And
EVALUATION
SALICON model
CNN applied at 2 different
image scales : small & BIG
BEST MODELS AT mit benchmark
DeepFix
FCN built on top of the
VGG.
10 MOST REPRESENTATIVE images
0,97 of Spearman correlation relative to their ranking on all dataset
images
3.
QUANTIFYING WHERE
people & MODELS LOOK
IN IMAGES
Name all image regions under the fixation map
95 percentile
threshold
651 regions
over 300
images
20 users and
2 MTurk task
Where should saliency models look next ? (UPC Reading Group)
Where should saliency models look next ? (UPC Reading Group)
WHAT CAN MODELS GAIN?
Gains that
model could
have if specific
region were
correctly
predicted
4.
The importance of
people
77%Correctly prediction (DeepFix)
Face saliency is underestimated when
faces are small, non-frontal, or not
centered in an image
Sometimes the actions in a scene are
more salient to human observers than
the participants, but saliency models
can overestimate the relative saliency
of the faces
Not all people are equally important
× Assign importance score to each face (using
fixation gt and predicted map.
× Relative ordering assign by saliency model
does not match by the importance given by
human fixations.
Name all image regions under the fixation map
GrayWhite Black
4.
The informativeness
of text
Understanding the text ...
× The description of a warning or a book are
more informative to observers than the
warning or book title..
× Only piece of English text..
Where should saliency models look next ? (UPC Reading Group)
4.
Object of gaze and
action
Objects and action
× Objects of gaze and/or action are usually
missed
× Detecting objects of action remains a problem
area for saliency model..
Where should saliency models look next ? (UPC Reading Group)
4.
Conclusion
Let’s finish with one slide
Conclusion
Models continue to under-predict crucial image
regions containing people, actions, and text.
These are the regions with greatest semantic
importance in an image, and become essential
for saliency applications
“Have saliency models begun
to converge on human
performance and is saliency
a solved problem ?”
“NoT YET!”
THANKS!Any questions?
Please contact with the authors :)

More Related Content

PPT
6308 Casper Presentationupdated2
PDF
U20 LO2 Composition
PDF
Joint unsupervised learning of deep representations and image clusters
PDF
The Importance of Time in Visual Attention Models
PDF
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
PDF
Image Retrieval using Graph based Visual Saliency
PDF
Efficient fusion of spatio-temporal saliency for frame wise saliency identifi...
PDF
Canosa Saliency Based Decision Support
6308 Casper Presentationupdated2
U20 LO2 Composition
Joint unsupervised learning of deep representations and image clusters
The Importance of Time in Visual Attention Models
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Image Retrieval using Graph based Visual Saliency
Efficient fusion of spatio-temporal saliency for frame wise saliency identifi...
Canosa Saliency Based Decision Support

Similar to Where should saliency models look next ? (UPC Reading Group) (20)

PDF
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
PDF
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012
PDF
Visual Saliency Model Using Sift and Comparison of Learning Approaches
PDF
Learning where to look: focus and attention in deep vision
PDF
Guided tour of visual attention
PDF
Attention mechanism in brain and deep neural network
PPTX
Visual Attention Model: Teach a Robot How to Watch the World
PDF
50120130405009
PDF
International Journal of Engineering Research and Development (IJERD)
PDF
Particle filter framework for salient object detection in videos
PDF
The impact of visual saliency prediction in image classification
PDF
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
PDF
Object Detection with Computer Vision
PPTX
(Reading Group) Saliency Detection: A Boolean Map Approach
PDF
Video saliency-recognition by applying custom spatio temporal fusion technique
PPTX
MediaEval 2018: Show and Recall at MediaEval 2018 ViMemNet: Predicting Video ...
PDF
How saccadic models help predict where we look during a visual task? Applicat...
PDF
Assessing Explainability in Deep Learning for Medical Image Analysis
PDF
528 439-449
PDF
Light Roasted Use of Caffe in Yahoo! JAPAN
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012
Visual Saliency Model Using Sift and Comparison of Learning Approaches
Learning where to look: focus and attention in deep vision
Guided tour of visual attention
Attention mechanism in brain and deep neural network
Visual Attention Model: Teach a Robot How to Watch the World
50120130405009
International Journal of Engineering Research and Development (IJERD)
Particle filter framework for salient object detection in videos
The impact of visual saliency prediction in image classification
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
Object Detection with Computer Vision
(Reading Group) Saliency Detection: A Boolean Map Approach
Video saliency-recognition by applying custom spatio temporal fusion technique
MediaEval 2018: Show and Recall at MediaEval 2018 ViMemNet: Predicting Video ...
How saccadic models help predict where we look during a visual task? Applicat...
Assessing Explainability in Deep Learning for Medical Image Analysis
528 439-449
Light Roasted Use of Caffe in Yahoo! JAPAN
Ad

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
PDF
Deep Generative Learning for All
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
PDF
Open challenges in sign language translation and production
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
PDF
Intepretability / Explainable AI for Deep Neural Networks
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
PDF
Curriculum Learning for Recurrent Video Object Segmentation
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
The Transformer - Xavier Giró - UPC Barcelona 2021
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Open challenges in sign language translation and production
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Learn2Sign : Sign language recognition and translation using human keypoint e...
Intepretability / Explainable AI for Deep Neural Networks
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Curriculum Learning for Recurrent Video Object Segmentation
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Ad

Recently uploaded (20)

PPTX
CYBER SECURITY the Next Warefare Tactics
PPTX
New ISO 27001_2022 standard and the changes
PDF
Global Data and Analytics Market Outlook Report
PPTX
eGramSWARAJ-PPT Training Module for beginners
PPT
statistic analysis for study - data collection
PPTX
Caseware_IDEA_Detailed_Presentation.pptx
PPTX
ai agent creaction with langgraph_presentation_
PPTX
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PDF
Loose-Leaf for Auditing & Assurance Services A Systematic Approach 11th ed. E...
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PPTX
SET 1 Compulsory MNH machine learning intro
PDF
ahaaaa shbzjs yaiw jsvssv bdjsjss shsusus s
PDF
Best Data Science Professional Certificates in the USA | IABAC
PPTX
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
PPTX
statsppt this is statistics ppt for giving knowledge about this topic
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PDF
Navigating the Thai Supplements Landscape.pdf
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
Tapan_20220802057_Researchinternship_final_stage.pptx
CYBER SECURITY the Next Warefare Tactics
New ISO 27001_2022 standard and the changes
Global Data and Analytics Market Outlook Report
eGramSWARAJ-PPT Training Module for beginners
statistic analysis for study - data collection
Caseware_IDEA_Detailed_Presentation.pptx
ai agent creaction with langgraph_presentation_
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
Loose-Leaf for Auditing & Assurance Services A Systematic Approach 11th ed. E...
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
SET 1 Compulsory MNH machine learning intro
ahaaaa shbzjs yaiw jsvssv bdjsjss shsusus s
Best Data Science Professional Certificates in the USA | IABAC
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
statsppt this is statistics ppt for giving knowledge about this topic
AI AND ML PROPOSAL PRESENTATION MUST.pptx
Navigating the Thai Supplements Landscape.pdf
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
Tapan_20220802057_Researchinternship_final_stage.pptx

Where should saliency models look next ? (UPC Reading Group)