Exploiting User Comments for Audio-visual Content Indexing and Retrieval (ECIR'13)

Exploiting User Comments for Audio-visual
Content Indexing and Retrieval

Carsten Eickhoff, Wen Li and Arjen P. de Vries
March 25, 2013

Delft
University of
Technology

Challenge the future

Overview

• Introduction and statistics

• Harnessing user comments for content indexing

• Dealing with noise

• Retrieval experiments

User Comments for Content Indexing and Retrieval 2

Example


Content Annotation

• Audio-visual content retrieval relies on textual meta data

• Author-provided titles and descriptions are often not enough

• Collaborative tagging can provide more information


Available Annotation Sources

• Tagging content is a tedious task

• To make it more interesting, tagging is sometimes integrated in
games and reputation schemes

• Still, 58% of a 10,000-video sample from YouTube are annotated
with less than 140 characters of text each

• At the same time, comment threads are massive…


Automatic term extraction
You will get kissed on the nearest
possible Friday by the love of your omg i luv
that stuff
life.Tomorrow will be the best day
of your life.However,if you don't
post this comment to at least 3
videos,you will die within 2
days.Now uv started reading dis
dunt stop…

lol luv it luv
Cute
snoopy


Types of Noise

1. Uninformative comments
omg i luv
that stuff


Types of Noise

1. Uninformative comments You will get kissed on the nearest
possible Friday by the love of your
life.Tomorrow will be the best day
2. Unrelated comments (incl. spam) of your life.However,if you don't
post this comment to at least 3
videos,you will die within 2
days.Now uv started reading dis
dunt stop…


Types of Noise

OMG YEAH
2. Unrelated comments (incl. spam) LOL1!1!!! i luv
that part u like
3. Misspellings and chat speak robot chicken?


Types of Noise


2. Unrelated comments (incl. spam) Snoopy est
si mignon!!
3. Misspellings and chat speak

4. Foreign language utterances


LM-based Keyword extraction

• Find those terms that have a locally higher likelihood of
occurrence than globally in the collection

• Similar notion as tf/idf but within the LM framework


Bursts

• Peaks in commenting activity may contain interesting information


Bursts


[External]:
Actor wins
an award


Bursts


[Internal]:
Controversial
comment


Generalized Burst Detection

• Kleinberg [1] measured bursts per term

• We need a more general representation of activity peaks

[1] John Kleinberg. Bursty and Hierarchical Structure in Streams, 2003


Burst and Cause

• Capturing bursts seems to help

• But we also need its cause

• A mixture of language models
accounts for burst and pre-
burst term likelihoods


Vocabulary Regularization

• Currently: Discriminative terms are good

• As a result: Misspellings and non-English terms are recommended

• Wikipedia can help identify such cases:

Snoopy


Vocabulary Regularization

• Currently: Discriminative terms are good

• As a result: Misspellings and non-English terms are recommended

• Wikipedia can help identify such cases:

Yeah!!1% Wait, that’s
not a word…


Data Set

• 10,000 YouTube videos crawled in 2009/10

• 20 seed queries, following “related videos” link

• 4.7 M user comments

• On average 360 comments per video (σ = 984)


Retrieval experiments

• TREC-style retrieval experiment

• 40 manually constructed topics

• Pooled top 10 results evaluated via crowdsourcing

• BM25F models with fields per source (title, description, etc.)


Retrieval performance



• 40% gain in MAP


Experiments under Sparsity

• 58% of all video descriptions are shorter than 140 characters

• 50% of all titles are shorter than 35 characters

• We limit our corpus to videos with short titles and/or descriptors

• This affects 77% of all videos in our sample…


Retrieval performance (sparse)


Retrieval performance (sparse)

• 54% gain in MAP


Closing the Circle


Conclusion

• User comments can enhance content annotation if we deal with
the domain-inherent noise appropriately

• Modeling commenting activity bursts, we can find informative
on-topic comments

• Through the use of Wikipedia, misspellings and foreign language
utterances can be reliably identified


Future Directions

• Additional regularization resources (e.g., Delicious, WordNet)

• New domains (e.g., social media streams linked to TV)

• Content-aware term extraction

• Cold start problem

• Cross-language ability


Thank You!


Exploiting User Comments for Audio-visual Content Indexing and Retrieval (ECIR'13)

More Related Content

Viewers also liked (12)

Similar to Exploiting User Comments for Audio-visual Content Indexing and Retrieval (ECIR'13) (20)

More from Carsten Eickhoff (8)

Recently uploaded (20)

Exploiting User Comments for Audio-visual Content Indexing and Retrieval (ECIR'13)