Graph-based multimodal clustering for social event detection in large collections of images

Graph-based multimodal clustering for social event
detection in large collections of images
Georgios Petkos, Symeon Papadopoulos, Emmanouil Schinas,
Yiannis Kompatsiaris
Information Technologies Institute (ITI)
Centre for Research & Technologies Hellas (CERTH)
MMM 2014

Overview
• The problem of social event detection
• Existing approaches
• Proposed approach
• Evaluation
• Summary & future work

MMM 2014

#2

Georgios Petkos et al.

the problem

MMM 2014


Social events?
Attended by people and represented by multimedia content shared online
news
demonstration /
riot / speech

personal
wedding /
birthday / drinks

entertainment
concert / play /
sports

MMM 2014

#4


Pope Benedict
2007: iPhone release
2008: Android release
2010: iPad release

Pope Francis

http://petapixel.com/2013/03/14/a-starry-sea-of-cameras-at-the-unveiling-of-pope-francis/

MMM 2014

#5


Social event detection
Social event detection involves the automatic
organization of a multimedia collection C into groups
of items, each (group) of which corresponds to a
distinct event.
Can be treated as a multimodal clustering problem
COLLECTION

EVENT SET
E1
EVENT DETECTION

E2

EN
MMM 2014

#6


existing approaches

MMM 2014


Supervised event detection
• Rationale: use a large number of “known” event assignments
to “learn” how to identify “same event” / “same cluster”
relationships
Two variants:
• Item-to-item: learn whether two items belong to the same
event cluster or not.
– Model Input: the set of per modality distances between two images.

• Item-to-cluster: learn whether a new item belongs to a given
event cluster or not.
– Model input: the set of per modality distances between an image and
a prototype representation of the event.
MMM 2014

#8


Utilizing the “same event” model for clustering
• Item-to-item:
– (Incremental). For each incoming image, average all item-to-item SE
scores for all items in each cluster. Assign to best-matching cluster if
average above threshold or create new cluster (Becker et. al.).
– (Batch). Compute all item-item SE scores between each image and all
other images and form an indicator vector. Cluster indicator vectors
(Petkos et. al.).

• Item-to-cluster:
– (Incremental). For each cluster maintain a multimodal representation.
Compute SE score between each incoming item and the existing
prototype event representations. Assign to best-matching cluster if
above threshold or create new cluster (Becker et. al). Alternatively use
a second model for deciding if a new cluster should be added or not
(Reuter et. al.).
MMM 2014

#9


proposed approach

MMM 2014


Overview of proposed approach

• Item-to-item SE model utilized.
• Candidate neighbours selection step (first appears in (Reuter et. al)) using a set of per
modality indexes.
• Graph representation.
• Community detection on graph. Two variants of the algorithm:
• Batch: SCAN
• Incremental: QCA
MMM 2014

#11


Proposed approach: advantages

• Item-to-cluster methods may suffer from incorrect prototype representations (due to
averaging).
• Candidate neighbours selection step makes the application of the method much more
scalable.
• Graph representation: in order to introduce a scalable item-to-item approach without
averaging.
MMM 2014

#12


evaluation

MMM 2014


Evaluation setup
• Used the dataset of the 2012 SED task of MediaEval
• Ground truth: 7,779 photos clustered around 149
events (18 technical, 79 soccer, 52 Indignados)
• Assess the following aspects:
– accuracy of same-event classification
– compare clustering quality between item-to-cluster and
the two versions of item-to-item (batch & incremental)
– measure contributions of different features
– study generalization abilities of same event model

MMM 2014


Evaluation setup
Features:
• Uploader identity.
• Actual image content:
– GIST
– SURF, aggregated using the VLAD scheme

• Textual features: title, description and tags. Either a TF-IDF or
a BM25 weighting scheme is utilized.
• Time of media creation.
• Location, when available (geodesic distance).
Appropriate indices are utilized in order to rapidly fetch the
candidate neighbours for each modality.
MMM 2014


Evaluation: SE accuracy & clustering quality
• Same event classification accuracy 98.58% (SVM)
– 10K pos/neg training, 10K pos/neg testing (random)

• Clustering quality (NMI): 30/119 training/testing events [10 random splits]
– Incremental same or better than batch
– Item-to-item better than item-to-cluster (significant at 0.95 confidence)
BATCH

INCREMENTAL

ITEM-TO-CLUSTER

AVG

0.924

0.934

0.898

STD

0.019

0.021

0.027

• When non-event photos enter the dataset, NMI degrades quickly
NON-EVENT

BATCH

INCREMENTAL

ITEM-TO-CLUSTER

5%

0.4824

0.5164

0.3954

10%

0.3421

0.3683

0.2899

*

* In the second table, results were obtained using sed2011 for training and sed2012 for testing.
MMM 2014


Evaluation: contribution of features
• Same experiments using limited sets of features
FEATUERS

BATCH

INCREMENTAL

VISUAL

0.8020 ∓ 0.0193

0.8179 ∓ 0.0151

TEXTUAL

0.7925 ∓ 0.0255

0.7792 ∓ 0.0310

VISUAL+TIME

0.9244 ∓ 0.0195

0.9360 ∓ 0.0183

TEXTUAL+TIME

0.9016 ∓ 0.0173

0.9049 ∓ 0.0209

• Repeating the same experiments without the use of
blocking led to significantly worse results
– e.g. 0.030 for visual, 0.7148 for textual

• Time is an extremely important feature
MMM 2014


Evaluation: generalizing same event model
• Train using one event type > test on a different one
• In most cases negative impact
• In few cases, performance is very high!
BATCH
soccer

technical

Indignados

soccer

-

0.8658

0.8494

technical

0.7967

-

0.8977

Indignados

0.9645

0.8456

-

INCREMENTAL
soccer

Indignados

soccer

-

0.8892

0.8667

technical

0.7661

-

0.7735

Indignados
MMM 2014

technical

0.9845

0.8482


summary & future work

MMM 2014


Summary
• Scalable item-to-item multimodal clustering approach for SED
• Key characteristics:
– Item-to-item “same event” model
– Candidate neighbor selection
– Organization of “same event” relationships to a graph
– Efficient graph clustering algorithms: SCAN (batch) / QCA
(incremental)
• In general though, item-to-item approaches are less scalable
than item-to-cluster approaches
MMM 2014

#20


Future work
• Extend method so that non-event images are properly
handled
• Multiple sources of multimedia
• The MediaEval datasets are somewhat limited. Investigate
the effect of crawling / image collection to the quality of
results

MMM 2014

#21


thank you!
questions?
Acknowledgements

MMM 2014


online clustering of same-event graph
QCA maintains community structure incrementally following
graph change operations: node & edge addition (removal
operations not applicable in same event graph): based on the
concept of community attraction forces
Cz
new edge

new node

force from Cu to Cz

A

D

X

force from Cz to Cu

C

Cw

B

Cu

• Depending on a test (computed based on local
graph structure), community structure could
remain the same, X assigned to Cu or A to Cz.
• If A is assigned to Cu, all its neighbours will be
checked for potential reassignment.

MMM 2014

#23


graph clustering :: SCAN

hub
(μ,ε)- core

structural similarity

outlier

• resilient to spurious links (e.g. visual links that connect
unrelated images)
• very fast (scales linearly to the number of edges)
• leaves less-/ and over-connected items out of the clustering
MMM 2014

#24


References
•
•
•

•

•

Reuter, T., & Cimiano, P. (2012, June). Event-based classification of social media
streams. In Proceedings of the 2nd ACM International Conference on Multimedia
Retrieval (p. 22). ACM.
Petkos, G., Papadopoulos, S., & Kompatsiaris, Y. (2012). Social event detection
using multimodal clustering and integrating supervisory signals. In Proceedings of
the 2nd ACM International Conference on Multimedia Retrieval (p. 23). ACM.
Becker, H., Naaman, M. & Gravano, L.. Learning similarity metrics for event
identification in social media. In Proceedings of the third ACM International
Conference on Web search and Data Mining, WSDM ’10, pages 291–300, New
York.
Nguyen, N., Dinh, T., Xuan, Y., & Thai, M.. Adaptive algorithms for detecting
community structure in dynamic social networks. In INFOCOM 2011. 30th IEEE
International Conference on Computer Communications, Joint Conference of the
IEEE Computer and Communications Societies, 10-15 April 2011, Shanghai, China,
pages 2282–2290. IEEE, 2011.
Xu, X., Yuruk, N., Feng, Z. & Schweiger, T.. SCAN: a structural clustering algorithm
for networks. In Proceedings of the 13th ACM SIGKDD, KDD ’07, pages 824–833,
NY, USA, 2007. ACM

MMM 2014

#25


Graph-based multimodal clustering for social event detection in large collections of images

More Related Content

Similar to Graph-based multimodal clustering for social event detection in large collections of images (20)

More from Symeon Papadopoulos (20)

Recently uploaded (20)

Graph-based multimodal clustering for social event detection in large collections of images