SlideShare a Scribd company logo
CUNI at MediaEval 2012
Search and Hyperlinking Task
  Petra Galuščáková and Pavel Pecina

       Institute of Formal and Applied Linguistics
                      Charles University in Prague
          {galuscakova,pecina}@ufal.mff.cuni.cz
Search and Hyperlinking Task
●   Search and Hyperlinking task
    ●   Search Subtask
         –look up the relevant segment in the set of visual data
    ●   Hyperlinking Subtask
         and then possibly find another video segments related to the
         –
         retrieved one
●   We have participated in the Search Subtask only
●   Both transcripts (LIMSI and LIUM) were used
●   We did not use concept recognition, shot segmentation and face
    detection
Segmentation
●   The exact relevant passage in the recording should be retrieved
    → the transcripts were at first divided into segments
●   The IR system then was used for the retrieval in the collection of
    such segments


●   Two strategies for segmentation:
    ●   Regular segmentation according to the time
    ●   TextTilling
Regular Segmentation
●   Segments of 45, 60, 90 and 120 seconds
●   Segments were partially overlapping
    ●   Each 30 seconds a new segment was created.
    ●   The segment was removed from the list of the retrieved
        segments if it partially overlapped with one of the higher
        ranked segments.
TextTiling Segmentation
●   Good results achieved in RSR MediaEval Track in 2011 [Eskevich et
    al, 2012].
●   The transcripts were at first preprocessed and the sentences
    boundaries (based mainly on the punctuation) were marked.
●   Used settings:
    ●   average number of the words in a sentence was set to 27 and
    ●   average number of the sentences in one segment was set to 9
    ●   Better correspond to the 90 seconds long segments.
Terrier
●   Terrier information retrieval system was used
●   http://terrier.org
●   Wide range of applicable search engines, language models and
    available features
●   The highest score was achieved applying Hiemstra Language
    Model and TF IDF search engine.
●   Terrier settings: we used Porter Stemmer, stopword list, query
    expansion and implicit parameters for both TF IDF search
    engine and Hiemstra language model
Experiments
Results
    Tran.   Eng.    Seg        MRR                   mGAP                      MASP
                          60    30    10    60     30     10     Mod    60      30    10
-   LIMSI   Hiem    No    0.34 0.27   0.10 0.21    0.10    0     0.57    0      0      0
1   LIMSI TFIDF 90s 0.42 0.31         0.15 0.26    0.16   0.03   0.56   0.11   0.08 0.04
2   LIUM    Hiem    60s 0.38 0.34     0.19 0.26    0.17   0.03   0.50   0.11   0.11   0.06
3   LIMSI TFIDF 60s 0.47 0.40         0.19 0.31    0.20   0.04   0.62   0.16   0.14 0.06
4   LIMSI   Hiem    90s 0.47 0.36     0.19 0.29    0.19   0.04   0.64   0.12   0.09 0.04
5   LIMSI   Hiem    TT    0.28 0.26   0.2   0.21   0.16   0.03   0.37   0.16   0.16 0.15

●   Runs 1 and 2 were required, only title field of the query was used
●   Another three runs use also short title field
●   In all of the cases metadata information was added (description
    and tags) to each segment.
Observations
●   The highest MRR and mGAP scores were achieved applying
    regular segmentation.
●   The highest MASP score was achieved using TextTiling
    segmentation
●   The difference between scores achieved by TF IDF engine with 60
    seconds long segments and Hiemstra LM with 90 seconds long
    segments are very small for MRR and mGAP measures but it is
    higher for MASP measure.
Segment Length




●   Shorter segments achieve higher mGAP and MASP scores but this
    dependency is more pronounced for MASP measure
●   MRR score achieves the highest values for the 90 seconds long segments
●   Window size 60 seconds
Future Work
●   We would especially like to aim on the increasing mGAP and MASP
    score in future
    → we would like improve the segmentation precision
●   And use audio and visual information (e.g. shot segmentation)
●   Examine shorter segments
Conclusions
Conclusions
●   Two types of segmentation: regular according to the time and
    TextTiling
●   Terrier IR system, Hiemstra LM and TF IDF search engine
    were used
●   The highest MRR and mGAP scores were achieved using
    regular segmentation (60 and 90 seconds) comparing to
    TextTiling segmentation algorithm which achieved the highest
    MASP scores
●   The dependency of the measures on the length of the
    segments was examined.
Thank you

More Related Content

PPTX
Social Event Detection at MediaEval 2012: Challenges, Dataset and Evaluation
PPTX
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
PDF
The CLEF Initiative From 2010 to 2012 and Onwards
PPTX
Brave New Task: Musiclef Multimodal Music Tagging
PDF
Search and Hyperlinking Task at MediaEval 2012
PDF
A Multimodal Approach for Video Geocoding
PDF
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
PDF
MediaEval 2012 Opening
Social Event Detection at MediaEval 2012: Challenges, Dataset and Evaluation
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
The CLEF Initiative From 2010 to 2012 and Onwards
Brave New Task: Musiclef Multimodal Music Tagging
Search and Hyperlinking Task at MediaEval 2012
A Multimodal Approach for Video Geocoding
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
MediaEval 2012 Opening

Similar to CUNI at MediaEval 2012: Search and Hyperlinking Task (10)

PDF
Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visua...
PDF
Multimodal Features for Search and Hyperlinking of Video Content
PDF
CUNI at MediaEval 2014 Search and Hyperlinking Task: Search Task Experiments
PDF
Multimodal Features for Linking Television Content
PDF
Audio Information for Hyperlinking of TV Content
PDF
CUNI at MediaEval 2014 Search and Hyperlinking Task: Search Task Experiments
PDF
CUNI at MediaEval 2014 Search and Hyperlinking Task: Visual and Prosodic Feat...
PDF
CUNI at MediaEval 2013 Similar Segments in Social Speech Task
PDF
DOCX
MultiModal Image Search on Mobile Device
Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visua...
Multimodal Features for Search and Hyperlinking of Video Content
CUNI at MediaEval 2014 Search and Hyperlinking Task: Search Task Experiments
Multimodal Features for Linking Television Content
Audio Information for Hyperlinking of TV Content
CUNI at MediaEval 2014 Search and Hyperlinking Task: Search Task Experiments
CUNI at MediaEval 2014 Search and Hyperlinking Task: Visual and Prosodic Feat...
CUNI at MediaEval 2013 Similar Segments in Social Speech Task
MultiModal Image Search on Mobile Device
Ad

More from MediaEval2012 (20)

PDF
Closing
PPTX
Brave New Task: User Account Matching
PPT
Overview of MediaEval 2012 Visual Privacy Task
PPT
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
PPT
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
PPTX
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
PPTX
mevd2012 esra_
PPTX
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
PPT
The MediaEval 2012 Affect Task: Violent Scenes Detectio
PPT
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
PDF
LIG at MediaEval 2012 affect task: use of a generic method
PPT
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
PPT
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
PPTX
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
PDF
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
PDF
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
PPT
ARF @ MediaEval 2012: Multimodal Video Classification
PPTX
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
PPTX
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
PPT
Overview of the MediaEval 2012 Tagging Task
Closing
Brave New Task: User Account Matching
Overview of MediaEval 2012 Visual Privacy Task
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
mevd2012 esra_
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
The MediaEval 2012 Affect Task: Violent Scenes Detectio
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
LIG at MediaEval 2012 affect task: use of a generic method
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
ARF @ MediaEval 2012: Multimodal Video Classification
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
Overview of the MediaEval 2012 Tagging Task
Ad

Recently uploaded (20)

PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
1. Introduction to Computer Programming.pptx
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PPTX
Tartificialntelligence_presentation.pptx
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
TLE Review Electricity (Electricity).pptx
PDF
project resource management chapter-09.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
August Patch Tuesday
PDF
Architecture types and enterprise applications.pdf
Hindi spoken digit analysis for native and non-native speakers
1. Introduction to Computer Programming.pptx
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
Tartificialntelligence_presentation.pptx
O2C Customer Invoices to Receipt V15A.pptx
Module 1.ppt Iot fundamentals and Architecture
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Group 1 Presentation -Planning and Decision Making .pptx
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
TLE Review Electricity (Electricity).pptx
project resource management chapter-09.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
A contest of sentiment analysis: k-nearest neighbor versus neural network
observCloud-Native Containerability and monitoring.pptx
A comparative study of natural language inference in Swahili using monolingua...
NewMind AI Weekly Chronicles - August'25-Week II
August Patch Tuesday
Architecture types and enterprise applications.pdf

CUNI at MediaEval 2012: Search and Hyperlinking Task

  • 1. CUNI at MediaEval 2012 Search and Hyperlinking Task Petra Galuščáková and Pavel Pecina Institute of Formal and Applied Linguistics Charles University in Prague {galuscakova,pecina}@ufal.mff.cuni.cz
  • 2. Search and Hyperlinking Task ● Search and Hyperlinking task ● Search Subtask –look up the relevant segment in the set of visual data ● Hyperlinking Subtask and then possibly find another video segments related to the – retrieved one ● We have participated in the Search Subtask only ● Both transcripts (LIMSI and LIUM) were used ● We did not use concept recognition, shot segmentation and face detection
  • 3. Segmentation ● The exact relevant passage in the recording should be retrieved → the transcripts were at first divided into segments ● The IR system then was used for the retrieval in the collection of such segments ● Two strategies for segmentation: ● Regular segmentation according to the time ● TextTilling
  • 4. Regular Segmentation ● Segments of 45, 60, 90 and 120 seconds ● Segments were partially overlapping ● Each 30 seconds a new segment was created. ● The segment was removed from the list of the retrieved segments if it partially overlapped with one of the higher ranked segments.
  • 5. TextTiling Segmentation ● Good results achieved in RSR MediaEval Track in 2011 [Eskevich et al, 2012]. ● The transcripts were at first preprocessed and the sentences boundaries (based mainly on the punctuation) were marked. ● Used settings: ● average number of the words in a sentence was set to 27 and ● average number of the sentences in one segment was set to 9 ● Better correspond to the 90 seconds long segments.
  • 6. Terrier ● Terrier information retrieval system was used ● http://terrier.org ● Wide range of applicable search engines, language models and available features ● The highest score was achieved applying Hiemstra Language Model and TF IDF search engine. ● Terrier settings: we used Porter Stemmer, stopword list, query expansion and implicit parameters for both TF IDF search engine and Hiemstra language model
  • 8. Results Tran. Eng. Seg MRR mGAP MASP 60 30 10 60 30 10 Mod 60 30 10 - LIMSI Hiem No 0.34 0.27 0.10 0.21 0.10 0 0.57 0 0 0 1 LIMSI TFIDF 90s 0.42 0.31 0.15 0.26 0.16 0.03 0.56 0.11 0.08 0.04 2 LIUM Hiem 60s 0.38 0.34 0.19 0.26 0.17 0.03 0.50 0.11 0.11 0.06 3 LIMSI TFIDF 60s 0.47 0.40 0.19 0.31 0.20 0.04 0.62 0.16 0.14 0.06 4 LIMSI Hiem 90s 0.47 0.36 0.19 0.29 0.19 0.04 0.64 0.12 0.09 0.04 5 LIMSI Hiem TT 0.28 0.26 0.2 0.21 0.16 0.03 0.37 0.16 0.16 0.15 ● Runs 1 and 2 were required, only title field of the query was used ● Another three runs use also short title field ● In all of the cases metadata information was added (description and tags) to each segment.
  • 9. Observations ● The highest MRR and mGAP scores were achieved applying regular segmentation. ● The highest MASP score was achieved using TextTiling segmentation ● The difference between scores achieved by TF IDF engine with 60 seconds long segments and Hiemstra LM with 90 seconds long segments are very small for MRR and mGAP measures but it is higher for MASP measure.
  • 10. Segment Length ● Shorter segments achieve higher mGAP and MASP scores but this dependency is more pronounced for MASP measure ● MRR score achieves the highest values for the 90 seconds long segments ● Window size 60 seconds
  • 11. Future Work ● We would especially like to aim on the increasing mGAP and MASP score in future → we would like improve the segmentation precision ● And use audio and visual information (e.g. shot segmentation) ● Examine shorter segments
  • 13. Conclusions ● Two types of segmentation: regular according to the time and TextTiling ● Terrier IR system, Hiemstra LM and TF IDF search engine were used ● The highest MRR and mGAP scores were achieved using regular segmentation (60 and 90 seconds) comparing to TextTiling segmentation algorithm which achieved the highest MASP scores ● The dependency of the measures on the length of the segments was examined.