SlideShare a Scribd company logo
UNIFESP at MediaEval 2016:
Predicting Media Interestingness Task
Jurandy Almeida
GIBIS Lab, Institute of Science and Technology, Federal University of S˜ao Paulo – UNIFESP
jurandy.almeida@unifesp.br
Introduction
• Developed in the MediaEval 2016 Pre-
dicting Media Interestingness Task
and for its video subtask only.
• The goal is to automatically select the
most interesting video segments ac-
cording to a common viewer.
• The focus is on features derived from
audio-visual content or associated tex-
tual information.
Proposed Approach
It relies on combining learning-to-rank algo-
rithms and exploiting visual information:
1. A simple histogram of motion patterns
is used for processing visual information.
2. A majority voting scheme is used for
combining machine-learned rankers and
predicting the interestingness of videos.
Visual Features
• Low-Level & Mid-Level Features: Not used
• Applying an algorithm to encode visual
properties from video segments.
– “Comparison of Video Sequences with
Histograms of Motion Patterns” [1].
• It relies on three steps:
1. partial decoding;
2. feature extraction;
3. signature generation.
106 111
100 88
91 94
95 90
90 93
96 91
1 1
2 1
2 1
0 3
Previous Current Next
Temporal Spatial
Time Series of Macroblocks
Video Frames
I-frames
Macroblock
Pixel Block
Histogram Distribution
DC coefficient
1: Partial Decoding
2: Feature Extraction
3: Signature Generation
Motion Pattern
0101100110010011
Histograms of Motion Patterns (HMP)
Learning to Rank Strategies
• Ranking SVM [5]: Use the traditional SVM classifier
to learn a ranking function.
• RankNet [2]: Probability distribution metrics as cost
functions to be optimized.
• RankBoost [4]: Regression error on weighted distri-
butions of pairwise rankings.
• ListNet [3]: Extension of RankNet that uses a ranked
list instead of pairwise rankings.
• Majority Voting [6]: The label with the most votes
is selected as the label for a given instance.
Input
Rankers R1 R2 RN
O1 O2 ON
Combining Rankings
Output ˆo
Experimental Protocol
• 4-fold cross validation
• Development data
– 5,054 videos from 52 movie trailers
• Test data
– 2,342 videos from 26 movie trailers
• Mean Average Precision (MAP)
Configurations of Runs
Run Learning-to-Rank Strategy
1 Ranking SVM
2 RankNet
3 RankBoost
4 ListNet
5 Majority Voting
Experimental Results
Results obtained on the development data. Results of the official submitted runs.
Ranking
SVM
RankN
et
RankBoost
ListN
et
M
ajority
Voting
MAP(%)
10
11
12
13
14
15
16
17
18
19
20
0
5
10
15
20
25
MAP(%)
Ranking
SVM
RankN
et
RankBoost
ListN
et
M
ajority
Voting
18.15
16.1716.17 16.56
14.35
AP per movie trailer achieved in each run.
video−52
video−53
video−54
video−55
video−56
video−57
video−58
video−59
video−60
video−61
video−62
video−63
video−64
video−65
video−66
video−67
video−68
video−69
video−70
video−71
video−72
video−73
video−74
video−75
video−76
video−77
0
10
20
30
40
50
60
70
AveragePrecision(%)
Ranking SVM
RankNet
RankBoost
ListNet
Majority Voting
The learning-to-rank algorithms
provide complementary infor-
mation that can be combined by
fusion techniques aiming at pro-
ducing better results.
Remarks
• The proposed approach has explored only
visual properties. Different learning-
to-rank strategies were considered, in-
cluding a fusion of all of them.
• Results demonstrate that the proposed
approach is promising. By combining
learning-to-rank algorithms, it is possible
to make a contribution to better results.
Future Works
The investigation of a smarter strategy for combining learning-to-rank algorithms and considering
other information sources to include more features semantically related to visual content.
Acknowledgements
This research was supported by Brazilian agencies FAPESP, CAPES, and CNPq.
References
[1] J. Almeida, N. J. Leite, and R. S. Torres. Compar-
ison of video sequences with Histograms of Motion
Patterns. In ICIP, pages 3673–3676, 2011.
[2] C. J. C. Burges, T. Shaked, E. Renshaw, A. Lazier,
M. Deeds, N. Hamilton and G. N. Hullender. Learn-
ing to rank using gradient descent. In ICML, pages
89–96, 2005.
[3] Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li.
Learning to rank: from pairwise approach to listwise
approach. In ICML, pages 129–136, 2007.
[4] Y. Freund, R. D. Iyer, R. E. Schapire, and Y. Singer.
An efficient boosting algorithm for combining prefer-
ences. Journal of Machine Learning Research, 4:933–
969, 2003.
[5] T. Joachims. Training linear SVMs in linear time. In
ACM SIGKDD, pages 217–226, 2006.
[6] L. Lam and C. Y. Suen. Application of majority vot-
ing to pattern recognition: an analysis of its behavior
and performance. IEEE Trans. Systems, Man, and
Cybernetics, Part A, 27(5):553–568, 1997.

More Related Content

PPTX
Predictive Modeling: Predict Premium Subscriber for a Leading International M...
PDF
(SURVEY) Active Learning
PDF
(SURVEY) Semi Supervised Learning
PDF
Large Scale GAN Training for High Fidelity Natural Image Synthesis
PPTX
Population Based Training of Neural Networks
PDF
A principal component analysis-based feature dimensionality reduction scheme ...
PDF
Audio augmentation
PDF
Content Based Image Retrieval Based on Color: A Survey
Predictive Modeling: Predict Premium Subscriber for a Leading International M...
(SURVEY) Active Learning
(SURVEY) Semi Supervised Learning
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Population Based Training of Neural Networks
A principal component analysis-based feature dimensionality reduction scheme ...
Audio augmentation
Content Based Image Retrieval Based on Color: A Survey

Viewers also liked (14)

PDF
DOCX
DOCX
SRINIVAS_Project Engineer_Construction Superintendent
DOCX
Curriculum Vitae – Gary Hubbard
PDF
Kalle Mallikarjuna
PPTX
M3 game download
DOC
eng-ahmed-ali-CV
PDF
MediaEval 2016 - ININ Submission to Zero Cost ASR Task
PDF
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
DOC
Resume srivenkatesh instrumentation engineer
PDF
Master Sportwissenschaft studieren | HG Hochschule
PDF
Bachelor Creative Media berufsbegleitend studieren an der H:G Hochschule
PDF
Psychologie studieren (Bachelor of Science)
DOCX
SRINIVAS_Project Engineer_Construction Superintendent
Curriculum Vitae – Gary Hubbard
Kalle Mallikarjuna
M3 game download
eng-ahmed-ali-CV
MediaEval 2016 - ININ Submission to Zero Cost ASR Task
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
Resume srivenkatesh instrumentation engineer
Master Sportwissenschaft studieren | HG Hochschule
Bachelor Creative Media berufsbegleitend studieren an der H:G Hochschule
Psychologie studieren (Bachelor of Science)
Ad

Similar to MediaEval 2016 - UNIFESP Predicting Media Interestingness Task (20)

PDF
MediaEval 2016 - UNIFESP Predicting Media Interestingness Task
PDF
sourabh_bajaj_resume
PDF
CVPR2022 paper reading - Balanced multimodal learning - All Japan Computer Vi...
PDF
Games to Improve Clinical Practice and Healthcare Administration
PDF
MediaEval 2017 - Interestingness Task: GIBIS at MediaEval 2017: Predicting Me...
PDF
Predicting Engagement in Video Lectures
DOC
Word
PPTX
fINAL Lesson_1_Course_Introduction_v1.pptx
PPT
Programs Coming Together Using ExamSoft to assess interprofessional education...
PPTX
L injection toward effective collaborative filtering using uninteresting items
PDF
Artificial Intelligence based Pattern Recognition
PDF
Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning
PPTX
Multi-modal sources for predictive modeling using deep learning
PDF
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
PDF
IRJET- Software Bug Prediction using Machine Learning Approach
PDF
Robust Tracking Via Feature Mapping Method and Support Vector Machine
PDF
Training and Placement Portal
PPTX
Lead Scores 64.pptxj,jhjyfjyffjufjfkfjgk
PDF
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
PDF
A Software Measurement Using Artificial Neural Network and Support Vector Mac...
MediaEval 2016 - UNIFESP Predicting Media Interestingness Task
sourabh_bajaj_resume
CVPR2022 paper reading - Balanced multimodal learning - All Japan Computer Vi...
Games to Improve Clinical Practice and Healthcare Administration
MediaEval 2017 - Interestingness Task: GIBIS at MediaEval 2017: Predicting Me...
Predicting Engagement in Video Lectures
Word
fINAL Lesson_1_Course_Introduction_v1.pptx
Programs Coming Together Using ExamSoft to assess interprofessional education...
L injection toward effective collaborative filtering using uninteresting items
Artificial Intelligence based Pattern Recognition
Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning
Multi-modal sources for predictive modeling using deep learning
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
IRJET- Software Bug Prediction using Machine Learning Approach
Robust Tracking Via Feature Mapping Method and Support Vector Machine
Training and Placement Portal
Lead Scores 64.pptxj,jhjyfjyffjufjfkfjgk
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
A Software Measurement Using Artificial Neural Network and Support Vector Mac...
Ad

More from multimediaeval (20)

PPTX
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
PDF
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
PDF
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
PDF
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
PPTX
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
PDF
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
PDF
Fooling an Automatic Image Quality Estimator
PDF
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
PDF
Pixel Privacy: Quality Camouflage for Social Images
PDF
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
PPTX
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
PDF
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
PDF
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
PPTX
Deep Conditional Adversarial learning for polyp Segmentation
PPTX
A Temporal-Spatial Attention Model for Medical Image Detection
PPTX
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
PDF
Fine-tuning for Polyp Segmentation with Attention
PPTX
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
PPTX
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
PDF
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Fooling an Automatic Image Quality Estimator
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Pixel Privacy: Quality Camouflage for Social Images
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Deep Conditional Adversarial learning for polyp Segmentation
A Temporal-Spatial Attention Model for Medical Image Detection
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
Fine-tuning for Polyp Segmentation with Attention
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Recently uploaded (20)

PDF
. Radiology Case Scenariosssssssssssssss
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
BIOMOLECULES PPT........................
PPTX
famous lake in india and its disturibution and importance
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
. Radiology Case Scenariosssssssssssssss
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
7. General Toxicologyfor clinical phrmacy.pptx
ECG_Course_Presentation د.محمد صقران ppt
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
neck nodes and dissection types and lymph nodes levels
BIOMOLECULES PPT........................
famous lake in india and its disturibution and importance
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
Introduction to Cardiovascular system_structure and functions-1
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Biophysics 2.pdffffffffffffffffffffffffff
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
HPLC-PPT.docx high performance liquid chromatography
The KM-GBF monitoring framework – status & key messages.pptx
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS

MediaEval 2016 - UNIFESP Predicting Media Interestingness Task

  • 1. UNIFESP at MediaEval 2016: Predicting Media Interestingness Task Jurandy Almeida GIBIS Lab, Institute of Science and Technology, Federal University of S˜ao Paulo – UNIFESP jurandy.almeida@unifesp.br Introduction • Developed in the MediaEval 2016 Pre- dicting Media Interestingness Task and for its video subtask only. • The goal is to automatically select the most interesting video segments ac- cording to a common viewer. • The focus is on features derived from audio-visual content or associated tex- tual information. Proposed Approach It relies on combining learning-to-rank algo- rithms and exploiting visual information: 1. A simple histogram of motion patterns is used for processing visual information. 2. A majority voting scheme is used for combining machine-learned rankers and predicting the interestingness of videos. Visual Features • Low-Level & Mid-Level Features: Not used • Applying an algorithm to encode visual properties from video segments. – “Comparison of Video Sequences with Histograms of Motion Patterns” [1]. • It relies on three steps: 1. partial decoding; 2. feature extraction; 3. signature generation. 106 111 100 88 91 94 95 90 90 93 96 91 1 1 2 1 2 1 0 3 Previous Current Next Temporal Spatial Time Series of Macroblocks Video Frames I-frames Macroblock Pixel Block Histogram Distribution DC coefficient 1: Partial Decoding 2: Feature Extraction 3: Signature Generation Motion Pattern 0101100110010011 Histograms of Motion Patterns (HMP) Learning to Rank Strategies • Ranking SVM [5]: Use the traditional SVM classifier to learn a ranking function. • RankNet [2]: Probability distribution metrics as cost functions to be optimized. • RankBoost [4]: Regression error on weighted distri- butions of pairwise rankings. • ListNet [3]: Extension of RankNet that uses a ranked list instead of pairwise rankings. • Majority Voting [6]: The label with the most votes is selected as the label for a given instance. Input Rankers R1 R2 RN O1 O2 ON Combining Rankings Output ˆo Experimental Protocol • 4-fold cross validation • Development data – 5,054 videos from 52 movie trailers • Test data – 2,342 videos from 26 movie trailers • Mean Average Precision (MAP) Configurations of Runs Run Learning-to-Rank Strategy 1 Ranking SVM 2 RankNet 3 RankBoost 4 ListNet 5 Majority Voting Experimental Results Results obtained on the development data. Results of the official submitted runs. Ranking SVM RankN et RankBoost ListN et M ajority Voting MAP(%) 10 11 12 13 14 15 16 17 18 19 20 0 5 10 15 20 25 MAP(%) Ranking SVM RankN et RankBoost ListN et M ajority Voting 18.15 16.1716.17 16.56 14.35 AP per movie trailer achieved in each run. video−52 video−53 video−54 video−55 video−56 video−57 video−58 video−59 video−60 video−61 video−62 video−63 video−64 video−65 video−66 video−67 video−68 video−69 video−70 video−71 video−72 video−73 video−74 video−75 video−76 video−77 0 10 20 30 40 50 60 70 AveragePrecision(%) Ranking SVM RankNet RankBoost ListNet Majority Voting The learning-to-rank algorithms provide complementary infor- mation that can be combined by fusion techniques aiming at pro- ducing better results. Remarks • The proposed approach has explored only visual properties. Different learning- to-rank strategies were considered, in- cluding a fusion of all of them. • Results demonstrate that the proposed approach is promising. By combining learning-to-rank algorithms, it is possible to make a contribution to better results. Future Works The investigation of a smarter strategy for combining learning-to-rank algorithms and considering other information sources to include more features semantically related to visual content. Acknowledgements This research was supported by Brazilian agencies FAPESP, CAPES, and CNPq. References [1] J. Almeida, N. J. Leite, and R. S. Torres. Compar- ison of video sequences with Histograms of Motion Patterns. In ICIP, pages 3673–3676, 2011. [2] C. J. C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton and G. N. Hullender. Learn- ing to rank using gradient descent. In ICML, pages 89–96, 2005. [3] Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li. Learning to rank: from pairwise approach to listwise approach. In ICML, pages 129–136, 2007. [4] Y. Freund, R. D. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining prefer- ences. Journal of Machine Learning Research, 4:933– 969, 2003. [5] T. Joachims. Training linear SVMs in linear time. In ACM SIGKDD, pages 217–226, 2006. [6] L. Lam and C. Y. Suen. Application of majority vot- ing to pattern recognition: an analysis of its behavior and performance. IEEE Trans. Systems, Man, and Cybernetics, Part A, 27(5):553–568, 1997.