SlideShare a Scribd company logo
Cluster-based Landmark and Event Detection
on Tagged Photo Collections
Symeon Papadopoulos, Christos Zigkolis,
Yiannis Kompatsiaris, Athena Vakali
user generated content creates new
opportunities
real-world depicted in users’ online collections
potential for many insights into what people
see, do and like




       need new tools for content organization
image clustering
clusters  landmarks + events




                                landmark

                                event
the framework
+          +


photos       tags       geo
overview




1              2

    landmark           landmark


               event
4              3
step 1: create photo similarity graph




            1                  2

                   landmark             landmark


                                event
            4                  3
SURF
     SIFT

visual similarity   casa mila, la pedrera



                      tag similarity

                         co-occurrence
                    latent semantic indexing
step 2: use graph to cluster the photos




            1                  2

                   landmark             landmark


                                event
            4                  3
the concept of node structure


 neighborhood of node v   + node itself   = structure of node v



                 v             v                    v




          N(v)                  v                  Γ(v)
the concept of structural similarity (1)




            v
                         u




            Γ(v) ∩ Γ(u)
                              structural similarity between nodes v and u

                Γ(v)  Γ(u)
the concept of structural similarity (2)


 high structural similarity
                                       photo cluster 1
                        C
                               A
                                   B



             photo cluster 2
 low structural similarity
# edges
complexity
                                     O (km  m)            graph-based clustering


                                  average node degree

                                                        # dimensions
                                         # clusters


             k-means clustering          O (I  C  n  D)

                                    # iterations
                                               # nodes


                                     O (n2  log n)
                         hierarchical agglomerative clustering
step 3: detect landmarks & events




            1                 2

                  landmark            landmark


                              event
            4                 3
#users / #photos                baseline features



                                            [2 years, 50 users / 120 photos]
         [1 day, 2 users / 10 photos]




      Quack et al., CIVR 2008                                 duration
Landmark Tags   additional
                  features




                    Event Tags
step 4: post-process landmark clusters




            1                 2

                  landmark             landmark


                               event
            4                 3
cluster merging based on proximity
cluster tag filtering




                                 CLUSTER TAGS

            helado   tropical   barcelona        cielos   spain      field

            park güell          jaume oller   park   sclupture   el beso



       low frequency tags
                                                     generic tags
results
207,750 photos
7,768 users
33,959 unique tags

compare graph-based vs. k-means clustering
     user study             geospatial coherence

                  high geospatial
                  coherence

                                              low geospatial
                                              coherence
user study
                             VISUAL
                 precision    recall   κ-statistic

   graph-based    1.000       0.110      1.000

   k-means        0.806       0.324      0.226


                              TAG
                 precision    recall   κ-statistic

   graph-based    0.950       0.182      0.820

   k-means        0.848       0.307      0.564
geospatial coherence
                                  VISUAL
                              radius     std. deviation

                graph-based   357 m          1.18 km

                k-means       2.4 km         1.73 km

                                       TAG

                graph-based   456 m          1.15 km

                k-means       767 m          1.76 km
classification performance



     16% - 23%
            improvement thanks to tag features
landmark localization accuracy


                     sagrada familia, cathedral, catholic   15.2m



                     la pedrera, casa mila                  31.8m



                     parc guell                              9.6m



                     boqueria, market, mercado, ramblas     82.1m



                     camp nou, fc barcelona, nou camp       18.7m
event category composition



                        music, concert, gigs, dj     43.1%




                        conference, presentation     6.5%




                        local traditional, parades   4.6%




                        racing, motorbikes, f1       3.3%
clusttour




                www.clusttour.gr

twitter.com/clusttour       facebook.com/clusttour

More Related Content

PDF
LUXi NYC Intro to Customer Development
PDF
Text Analysis Methods for Digital Humanities
PPTX
Disk Image!...and then what? Strategies for sustainable long-term storage an...
PPTX
I want to know more about compuerized text analysis
PDF
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
PDF
Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...
PDF
Mit6870 template matching and histograms
PPTX
Seminar
LUXi NYC Intro to Customer Development
Text Analysis Methods for Digital Humanities
Disk Image!...and then what? Strategies for sustainable long-term storage an...
I want to know more about compuerized text analysis
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...
Mit6870 template matching and histograms
Seminar

More from Symeon Papadopoulos (20)

PDF
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
PDF
Deepfakes: An Emerging Internet Threat and their Detection
PDF
Knowledge-based Fusion for Image Tampering Localization
PDF
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
PPTX
COVID-19 Infodemic vs Contact Tracing
PDF
Similarity-based retrieval of multimedia content
PPTX
Twitter-based Sensing of City-level Air Quality
PPTX
Aggregating and Analyzing the Context of Social Media Content
PDF
Verifying Multimedia Content on the Internet
PPTX
A Web-based Service for Image Tampering Detection
PPTX
Learning to detect Misleading Content on Twitter
PPTX
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
PPTX
Verifying Multimedia Use at MediaEval 2016
PPTX
Multimedia Privacy
PPTX
Placing Images with Refined Language Models and Similarity Search with PCA-re...
PPTX
In-depth Exploration of Geotagging Performance
PPTX
Perceived versus Actual Predictability of Personal Information in Social Netw...
PPTX
Web and Social Media Image Forensics for News Professionals
PPTX
Predicting News Popularity by Mining Online Discussions
PPTX
Finding Diverse Social Images at MediaEval 2015
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
Deepfakes: An Emerging Internet Threat and their Detection
Knowledge-based Fusion for Image Tampering Localization
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
COVID-19 Infodemic vs Contact Tracing
Similarity-based retrieval of multimedia content
Twitter-based Sensing of City-level Air Quality
Aggregating and Analyzing the Context of Social Media Content
Verifying Multimedia Content on the Internet
A Web-based Service for Image Tampering Detection
Learning to detect Misleading Content on Twitter
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Verifying Multimedia Use at MediaEval 2016
Multimedia Privacy
Placing Images with Refined Language Models and Similarity Search with PCA-re...
In-depth Exploration of Geotagging Performance
Perceived versus Actual Predictability of Personal Information in Social Netw...
Web and Social Media Image Forensics for News Professionals
Predicting News Popularity by Mining Online Discussions
Finding Diverse Social Images at MediaEval 2015
Ad

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Encapsulation_ Review paper, used for researhc scholars
PPT
Teaching material agriculture food technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Spectroscopy.pptx food analysis technology
PPTX
A Presentation on Artificial Intelligence
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Machine learning based COVID-19 study performance prediction
PDF
Approach and Philosophy of On baking technology
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
cuic standard and advanced reporting.pdf
PPTX
Machine Learning_overview_presentation.pptx
Electronic commerce courselecture one. Pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
A comparative analysis of optical character recognition models for extracting...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Encapsulation_ Review paper, used for researhc scholars
Teaching material agriculture food technology
Digital-Transformation-Roadmap-for-Companies.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Spectroscopy.pptx food analysis technology
A Presentation on Artificial Intelligence
The Rise and Fall of 3GPP – Time for a Sabbatical?
Machine learning based COVID-19 study performance prediction
Approach and Philosophy of On baking technology
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
20250228 LYD VKU AI Blended-Learning.pptx
Review of recent advances in non-invasive hemoglobin estimation
cuic standard and advanced reporting.pdf
Machine Learning_overview_presentation.pptx
Ad

Cluster based landmark and event detection for tagged photo collections

  • 1. Cluster-based Landmark and Event Detection on Tagged Photo Collections Symeon Papadopoulos, Christos Zigkolis, Yiannis Kompatsiaris, Athena Vakali
  • 2. user generated content creates new opportunities
  • 3. real-world depicted in users’ online collections
  • 4. potential for many insights into what people see, do and like need new tools for content organization
  • 6. clusters  landmarks + events landmark event
  • 8. + + photos tags geo
  • 9. overview 1 2 landmark landmark event 4 3
  • 10. step 1: create photo similarity graph 1 2 landmark landmark event 4 3
  • 11. SURF SIFT visual similarity casa mila, la pedrera tag similarity co-occurrence latent semantic indexing
  • 12. step 2: use graph to cluster the photos 1 2 landmark landmark event 4 3
  • 13. the concept of node structure neighborhood of node v + node itself = structure of node v v v v N(v) v Γ(v)
  • 14. the concept of structural similarity (1) v u Γ(v) ∩ Γ(u) structural similarity between nodes v and u Γ(v)  Γ(u)
  • 15. the concept of structural similarity (2) high structural similarity photo cluster 1 C A B photo cluster 2 low structural similarity
  • 16. # edges complexity O (km  m) graph-based clustering average node degree # dimensions # clusters k-means clustering O (I  C  n  D) # iterations # nodes O (n2  log n) hierarchical agglomerative clustering
  • 17. step 3: detect landmarks & events 1 2 landmark landmark event 4 3
  • 18. #users / #photos baseline features [2 years, 50 users / 120 photos] [1 day, 2 users / 10 photos] Quack et al., CIVR 2008 duration
  • 19. Landmark Tags additional features Event Tags
  • 20. step 4: post-process landmark clusters 1 2 landmark landmark event 4 3
  • 21. cluster merging based on proximity
  • 22. cluster tag filtering CLUSTER TAGS helado tropical barcelona cielos spain field park güell jaume oller park sclupture el beso low frequency tags generic tags
  • 24. 207,750 photos 7,768 users 33,959 unique tags compare graph-based vs. k-means clustering user study geospatial coherence high geospatial coherence low geospatial coherence
  • 25. user study VISUAL precision recall κ-statistic graph-based 1.000 0.110 1.000 k-means 0.806 0.324 0.226 TAG precision recall κ-statistic graph-based 0.950 0.182 0.820 k-means 0.848 0.307 0.564
  • 26. geospatial coherence VISUAL radius std. deviation graph-based 357 m 1.18 km k-means 2.4 km 1.73 km TAG graph-based 456 m 1.15 km k-means 767 m 1.76 km
  • 27. classification performance 16% - 23% improvement thanks to tag features
  • 28. landmark localization accuracy sagrada familia, cathedral, catholic 15.2m la pedrera, casa mila 31.8m parc guell 9.6m boqueria, market, mercado, ramblas 82.1m camp nou, fc barcelona, nou camp 18.7m
  • 29. event category composition music, concert, gigs, dj 43.1% conference, presentation 6.5% local traditional, parades 4.6% racing, motorbikes, f1 3.3%
  • 30. clusttour www.clusttour.gr twitter.com/clusttour facebook.com/clusttour