SlideShare a Scribd company logo
IMAGE TAG REFINEMENT ALONG THE ‘WHAT’ DIMENSION USING TAG CATEGORIZATION AND NEIGHBOR VOTING IEEE International Conference on Multimedia & Expo Singapore – July 19-23, 2010 Sihyoung Lee , Wesley De Neve, Yong Man Ro Image and Video Systems Lab Department of Electrical Engineering Korea Advanced Institute of Science and Technology (KAIST) email: ijiat@kaist.ac.kr
Outline Introduction Image tag refinement Experiments Conclusions
Outline Introduction Image tag refinement Experiments Conclusions
Introduction Social media applications allow users to share and manually annotate photos Flickr hosts more than 4 billion photos as of Oct. 2009 more than 2.5 billion photos are uploaded to Facebook each month as of Jan. 2009 Image folksonomies collections of user-provided images annotated with user-supplied tags user-supplied tags are used by most search engines to retrieve images
Question Can we  trust all image tags  in an image folksonomy?
Noisy Tags What are noisy tags? tags that are irrelevant to the image content meaningless tags (e.g., `asdf’ and `grrr’), tags containing typographical errors, imprecise tags, and incorrect tags Why are noisy tags present? people tend to interpret an image by using their personal experience, knowledge, and imagination people tend to assign the same tags to different images that have been taken at the same place or during the same event (i.e., batch tagging)
Problem Statement Need for identifying correct tags and noisy tags in order to boost the effectiveness of image retrieval
Outline Introduction Image tag refinement Experiments Conclusions
Neighbor Voting  (1/3) De facto standard for tag relevance learning X. Li  et al. , “ Learning social tag relevance by neighbor voting ”, IEEE ToM, 11(7), pp. 1310-1322, Nov. 2009 Assumption if different people label visually similar images using  the same tags, then these tags are likely to reflect  objective aspects of the visual image content Algorithm follows a two-step approach retrieval of visual neighbors tag relevance learning by accumulating votes from  the tags assigned to the visual neighbors
Neighbor Voting  (2/3) input image tag relevance learning [Figure adopted from X. Li  et al. ] bridge bicycle perfect MyWinners robyn fishing me bristol court 1 number Sydney bridge Australia architecture bridge tranquil bruges trees NikonE3100 Sydney bridge SuperShot clouds a5PhotosaDay ireland irlanda ingiro northireland irlandadelnord connemara Sweden bridge lake APlusPhot SuperAPlus image folksonomy retrieval of visual neighbors bridge 4 bicycle 0 perfect 0 MyWinners 0
Neighbor Voting  (3/3) Disadvantage: “a one size fits all” approach the same technique is applied to all tags, regardless of the type of these tags How about (additionally) taking into account location information (e.g., GPS information)? time information (available in Exif metadata)? Idea take into account the type of tags in order to allow for more specialized tag relevance learning techniques
Previous Tag Refinement Image folksonomy refined tags Retrieval of visual neighbors Voting Neighbor voting algorithm nikon, leaves, jw, east, clouds, coast, ground, rita, red, rain, newengland, pretty, yellow, summer, portrait, pink, macro, grass, flowers,  etc …
Proposed Tag Refinement Tag categorization WordNet refined tags along the  what  dimension Visual information-based refinement GPS-based refinement  where refined tags along the  where  dimension Time-based refinement  when refined tags along the  when  dimension Refinement using affective content analysis how refined tags along the  how  dimension who refined tags along the  who  dimension Face recognition-based refinement  what Neighbor voting algorithm Retrieval of visual neighbors Image folksonomy Scope of this paper: tag refinement along  the ‘what’ dimension Voting nikon, leaves, jw, east, clouds, coast, ground, rita, red, rain, newengland, pretty, yellow, summer, portrait, pink, macro, grass, flowers,  etc …
Tag Categorization Tags assigned to an image are mapped to the WordNet noun semantic categories The WordNet noun semantic categories are mapped onto five intuitive categories that are relevant for users what ,  when ,  who ,  where , and  how WordNet semantic noun categories Our categories animal, artifact, attribute, body, food, object, phenomenon, plant, shape, something, substance what location, space where person, group who time, event when act, cognition, communication, feeling, motive, possession, process, quantity, relation, state how
Outline Introduction Image tag refinement Experiments Conclusions
Experimental Setup Image set used: MIRFLICKR-25000 images were annotated with 223,537 tags by 9,862 users concept vocabulary of 68,004 unique tags test set 200 images from MIRFLICKR25000 annotated with at least four tags along the  what  dimension Image descriptor MPEG-7 Scalable Color (256-D vector) L 1  distance is adopted to measure the visual distance
Evaluation Metrics Image tag refinement Noise Level ( NL ) NL  denotes the ratio of noisy tags to  all tags in an image folksonomy Image tag recommendation Precision at rank  k  ( [email_address] ) [email_address]  denotes the ratio of recommended tags that are relevant, averaged over all photos
Results for Tag Categorization Most frequent WordNet categories for the user-supplied tags in MIRFLICKR-25000 46% 54% 22% 17% 12% 7% 5% 37% 46% 19% 17% 12% 6%
Objective Performance of Tag Refinement  (1/2) Before tag refinement After refinement without categorization with categorization NL 0.774 0.466 0.249 proposed tag refinement removes more noisy tag assignments than tag refinement only making use of neighbor voting
Objective Performance of Tag Refinement  (2/2) before tag refinement after tag refinement without tag categorization after tag refinement with tag categorization
Subjective Performance of Tag Refinement Image Tags Before tag refinement After tag refinement without categorization with categorization bc,  beach , british, cacade, canada, casio,  cloud ,  clouds , columbia, exf1, firstquality, forest, fpg, fun, hike, hiking,  island , juandefuca,  lagoon , lighthouse, metchosin,  mountain ,  mountains ,  ocean , picnic, play, range,  sand , state, trail, vancouver, victoria, washington,  water ,  wave ,  waves , witty, wittys clouds ,  beach ,  water ,  ocean ,  cloud ,  sand ,  mountains ,  island ,  wave ,  mountain , canada, washington, forest, trail,  waves , hiking clouds ,  beach ,  water ,  ocean ,  cloud ,  sand ,  mountains ,  island ,  mountain , trail
Objective Performance of Image Tag Recommendation Original folksonomy Refined folksonomy without categorization with categorization [email_address] 0.285 0.505 0.680 [email_address] 0.315 0.330 0.405 the  P@1 and P@5 values demonstrate that the proposed tag refinement technique improves the effectiveness of image tag recommendation for non-tagged images
Subjective Performance of Image Tag Recommendation Image Recommended tags Original folksonomy Refined folksonomy without categorization with categorization explore,  sky ,  blue , water, nature,  yellow ,  clouds , nikon, canon, 2007 sky , explore,  blue , nature,  yellow , water, canon,  clouds ,  white , geotagged sky ,  blue , nature,  yellow , water,  clouds ,  building ,  architecture , beach, snow explore,  nature ,  green ,  sky ,  blue , macro, water, nikon,  flower ,  clouds nature ,  green ,  sky ,  blue , explore, macro,  yellow , canon,  flower , water nature ,  green ,  sky ,  blue ,  yellow ,  flower , water,  clouds , bird,  grass sky , explore,  blue ,  clouds ,  nature , water, landscape, nikon, canon, geotagged sky ,  blue ,  clouds ,  nature , explore, water, landscape, hdr,  yellow , canon sky ,  blue ,  clouds ,  nature , water,  yellow , snow,  bird , lake, beach
Outline Introduction Image tag refinement Experiments Conclusions
Conclusions Proposed a tag refinement strategy that makes use of tag categorization and neighbor voting allows taking into account other information sources for tag relevance learning, besides visual information dependent on the nature of the tags Presented experimental results focusing on tag refinement along the  what  dimension Future research design of specialized tag refinement techniques that are able to operate along other dimensions
Thank you! Any questions?

More Related Content

PDF
Face annotation for personal photos using collaborative face recognition in o...
PPT
Background Information & Suggestions for Joint Research Topics IVY Lab & MMLab
PDF
Sparse representation based human action recognition using an action region-a...
PDF
Vine Measurement Study
PDF
Challenges and requirements for a next generation service for video content s...
PPTX
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...
PPT
Towards data driven estimation of image tag relevance using visually similar ...
PDF
Het machine verstaanbaar boek - de volle kracht van metadata
Face annotation for personal photos using collaborative face recognition in o...
Background Information & Suggestions for Joint Research Topics IVY Lab & MMLab
Sparse representation based human action recognition using an action region-a...
Vine Measurement Study
Challenges and requirements for a next generation service for video content s...
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...
Towards data driven estimation of image tag relevance using visually similar ...
Het machine verstaanbaar boek - de volle kracht van metadata

Similar to Image Tag Refinement Along the 'What' Dimension using Tag Categorization and Neighbor Voting (20)

PPT
Semantics In Digital Photos A Contenxtual Analysis
PDF
Tag And Tag Based Recommender
PDF
A Graph-based Web Image Annotation for Large Scale Image Retrieval
PDF
Enhanced Watemarked Images by Various Attacks Based on DWT with Differential ...
PPTX
Object Based Image Analysis
PDF
Personalized geo tag recommendation for community contributed images
PPTX
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
PPT
On the Influence Propagation of Web Videos
PDF
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
PPTX
Object Detection on Dental X-ray Images using R-CNN
PDF
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
PDF
IRJET- Comparison and Simulation based Analysis of an Optimized Block Mat...
PDF
How might machine learning help advance solar PV research?
PDF
IRJET - Underwater Object Identification using Matlab and Machine
PDF
Slides on Photosynth.net, from my MSc at Imperial
PPT
ITK Tutorial Presentation Slides-946
PPTX
Placing Images with Refined Language Models and Similarity Search with PCA-re...
PDF
IRJET-A Review on User Personalized Tag Based Image Search by Tag Relevance
PDF
A Review on User Personalized Tag Based Image Search by Tag Relevance
PDF
A Review on Deformation Measurement from Speckle Patterns using Digital Image...
Semantics In Digital Photos A Contenxtual Analysis
Tag And Tag Based Recommender
A Graph-based Web Image Annotation for Large Scale Image Retrieval
Enhanced Watemarked Images by Various Attacks Based on DWT with Differential ...
Object Based Image Analysis
Personalized geo tag recommendation for community contributed images
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
On the Influence Propagation of Web Videos
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
Object Detection on Dental X-ray Images using R-CNN
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
IRJET- Comparison and Simulation based Analysis of an Optimized Block Mat...
How might machine learning help advance solar PV research?
IRJET - Underwater Object Identification using Matlab and Machine
Slides on Photosynth.net, from my MSc at Imperial
ITK Tutorial Presentation Slides-946
Placing Images with Refined Language Models and Similarity Search with PCA-re...
IRJET-A Review on User Personalized Tag Based Image Search by Tag Relevance
A Review on User Personalized Tag Based Image Search by Tag Relevance
A Review on Deformation Measurement from Speckle Patterns using Digital Image...
Ad

More from Ghent University Global Campus (20)

PPTX
Ghent University Global Campus: Overview
DOCX
GUGC 10th Anniversary Celebration - Progress Report - Celebrating a Decade of...
PPTX
Introduction to the Center for Biosystems and Biotech Data Science at Ghent U...
PDF
GUGC Research Overview (December 2024)
PDF
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
PDF
Investigating the biological relevance in trained embedding representations o...
PDF
Impact of adversarial examples on deep learning models for biomedical image s...
PPTX
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
PPTX
The 5th Aslla Symposium
PPTX
Ghent University Global Campus 101
PDF
Booklet for the First GUGC Research Symposium
PDF
Center for Biotech Data Science at Ghent University Global Campus
PDF
Center for Biotech Data Science at Ghent University Global Campus
PPTX
Learning biologically relevant features using convolutional neural networks f...
PPTX
Towards reading genomic data using deep learning-driven NLP techniques
PPTX
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
PPTX
GUGC Info Session - Informatics and Bioinformatics
PPTX
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
PPTX
Ghent University and GUGC-K: Overview of Teaching and Research Activities
PDF
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Ghent University Global Campus: Overview
GUGC 10th Anniversary Celebration - Progress Report - Celebrating a Decade of...
Introduction to the Center for Biosystems and Biotech Data Science at Ghent U...
GUGC Research Overview (December 2024)
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Investigating the biological relevance in trained embedding representations o...
Impact of adversarial examples on deep learning models for biomedical image s...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
The 5th Aslla Symposium
Ghent University Global Campus 101
Booklet for the First GUGC Research Symposium
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
Learning biologically relevant features using convolutional neural networks f...
Towards reading genomic data using deep learning-driven NLP techniques
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
GUGC Info Session - Informatics and Bioinformatics
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Ad

Recently uploaded (20)

PDF
Architecture types and enterprise applications.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
1. Introduction to Computer Programming.pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
WOOl fibre morphology and structure.pdf for textiles
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PPTX
Chapter 5: Probability Theory and Statistics
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
project resource management chapter-09.pdf
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Tartificialntelligence_presentation.pptx
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Architecture types and enterprise applications.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Module 1.ppt Iot fundamentals and Architecture
Getting started with AI Agents and Multi-Agent Systems
NewMind AI Weekly Chronicles - August'25-Week II
Zenith AI: Advanced Artificial Intelligence
1. Introduction to Computer Programming.pptx
Getting Started with Data Integration: FME Form 101
WOOl fibre morphology and structure.pdf for textiles
O2C Customer Invoices to Receipt V15A.pptx
Chapter 5: Probability Theory and Statistics
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Web App vs Mobile App What Should You Build First.pdf
Enhancing emotion recognition model for a student engagement use case through...
project resource management chapter-09.pdf
Developing a website for English-speaking practice to English as a foreign la...
Tartificialntelligence_presentation.pptx
observCloud-Native Containerability and monitoring.pptx
From MVP to Full-Scale Product A Startup’s Software Journey.pdf

Image Tag Refinement Along the 'What' Dimension using Tag Categorization and Neighbor Voting

  • 1. IMAGE TAG REFINEMENT ALONG THE ‘WHAT’ DIMENSION USING TAG CATEGORIZATION AND NEIGHBOR VOTING IEEE International Conference on Multimedia & Expo Singapore – July 19-23, 2010 Sihyoung Lee , Wesley De Neve, Yong Man Ro Image and Video Systems Lab Department of Electrical Engineering Korea Advanced Institute of Science and Technology (KAIST) email: ijiat@kaist.ac.kr
  • 2. Outline Introduction Image tag refinement Experiments Conclusions
  • 3. Outline Introduction Image tag refinement Experiments Conclusions
  • 4. Introduction Social media applications allow users to share and manually annotate photos Flickr hosts more than 4 billion photos as of Oct. 2009 more than 2.5 billion photos are uploaded to Facebook each month as of Jan. 2009 Image folksonomies collections of user-provided images annotated with user-supplied tags user-supplied tags are used by most search engines to retrieve images
  • 5. Question Can we trust all image tags in an image folksonomy?
  • 6. Noisy Tags What are noisy tags? tags that are irrelevant to the image content meaningless tags (e.g., `asdf’ and `grrr’), tags containing typographical errors, imprecise tags, and incorrect tags Why are noisy tags present? people tend to interpret an image by using their personal experience, knowledge, and imagination people tend to assign the same tags to different images that have been taken at the same place or during the same event (i.e., batch tagging)
  • 7. Problem Statement Need for identifying correct tags and noisy tags in order to boost the effectiveness of image retrieval
  • 8. Outline Introduction Image tag refinement Experiments Conclusions
  • 9. Neighbor Voting (1/3) De facto standard for tag relevance learning X. Li et al. , “ Learning social tag relevance by neighbor voting ”, IEEE ToM, 11(7), pp. 1310-1322, Nov. 2009 Assumption if different people label visually similar images using the same tags, then these tags are likely to reflect objective aspects of the visual image content Algorithm follows a two-step approach retrieval of visual neighbors tag relevance learning by accumulating votes from the tags assigned to the visual neighbors
  • 10. Neighbor Voting (2/3) input image tag relevance learning [Figure adopted from X. Li et al. ] bridge bicycle perfect MyWinners robyn fishing me bristol court 1 number Sydney bridge Australia architecture bridge tranquil bruges trees NikonE3100 Sydney bridge SuperShot clouds a5PhotosaDay ireland irlanda ingiro northireland irlandadelnord connemara Sweden bridge lake APlusPhot SuperAPlus image folksonomy retrieval of visual neighbors bridge 4 bicycle 0 perfect 0 MyWinners 0
  • 11. Neighbor Voting (3/3) Disadvantage: “a one size fits all” approach the same technique is applied to all tags, regardless of the type of these tags How about (additionally) taking into account location information (e.g., GPS information)? time information (available in Exif metadata)? Idea take into account the type of tags in order to allow for more specialized tag relevance learning techniques
  • 12. Previous Tag Refinement Image folksonomy refined tags Retrieval of visual neighbors Voting Neighbor voting algorithm nikon, leaves, jw, east, clouds, coast, ground, rita, red, rain, newengland, pretty, yellow, summer, portrait, pink, macro, grass, flowers, etc …
  • 13. Proposed Tag Refinement Tag categorization WordNet refined tags along the what dimension Visual information-based refinement GPS-based refinement where refined tags along the where dimension Time-based refinement when refined tags along the when dimension Refinement using affective content analysis how refined tags along the how dimension who refined tags along the who dimension Face recognition-based refinement what Neighbor voting algorithm Retrieval of visual neighbors Image folksonomy Scope of this paper: tag refinement along the ‘what’ dimension Voting nikon, leaves, jw, east, clouds, coast, ground, rita, red, rain, newengland, pretty, yellow, summer, portrait, pink, macro, grass, flowers, etc …
  • 14. Tag Categorization Tags assigned to an image are mapped to the WordNet noun semantic categories The WordNet noun semantic categories are mapped onto five intuitive categories that are relevant for users what , when , who , where , and how WordNet semantic noun categories Our categories animal, artifact, attribute, body, food, object, phenomenon, plant, shape, something, substance what location, space where person, group who time, event when act, cognition, communication, feeling, motive, possession, process, quantity, relation, state how
  • 15. Outline Introduction Image tag refinement Experiments Conclusions
  • 16. Experimental Setup Image set used: MIRFLICKR-25000 images were annotated with 223,537 tags by 9,862 users concept vocabulary of 68,004 unique tags test set 200 images from MIRFLICKR25000 annotated with at least four tags along the what dimension Image descriptor MPEG-7 Scalable Color (256-D vector) L 1 distance is adopted to measure the visual distance
  • 17. Evaluation Metrics Image tag refinement Noise Level ( NL ) NL denotes the ratio of noisy tags to all tags in an image folksonomy Image tag recommendation Precision at rank k ( [email_address] ) [email_address] denotes the ratio of recommended tags that are relevant, averaged over all photos
  • 18. Results for Tag Categorization Most frequent WordNet categories for the user-supplied tags in MIRFLICKR-25000 46% 54% 22% 17% 12% 7% 5% 37% 46% 19% 17% 12% 6%
  • 19. Objective Performance of Tag Refinement (1/2) Before tag refinement After refinement without categorization with categorization NL 0.774 0.466 0.249 proposed tag refinement removes more noisy tag assignments than tag refinement only making use of neighbor voting
  • 20. Objective Performance of Tag Refinement (2/2) before tag refinement after tag refinement without tag categorization after tag refinement with tag categorization
  • 21. Subjective Performance of Tag Refinement Image Tags Before tag refinement After tag refinement without categorization with categorization bc, beach , british, cacade, canada, casio, cloud , clouds , columbia, exf1, firstquality, forest, fpg, fun, hike, hiking, island , juandefuca, lagoon , lighthouse, metchosin, mountain , mountains , ocean , picnic, play, range, sand , state, trail, vancouver, victoria, washington, water , wave , waves , witty, wittys clouds , beach , water , ocean , cloud , sand , mountains , island , wave , mountain , canada, washington, forest, trail, waves , hiking clouds , beach , water , ocean , cloud , sand , mountains , island , mountain , trail
  • 22. Objective Performance of Image Tag Recommendation Original folksonomy Refined folksonomy without categorization with categorization [email_address] 0.285 0.505 0.680 [email_address] 0.315 0.330 0.405 the P@1 and P@5 values demonstrate that the proposed tag refinement technique improves the effectiveness of image tag recommendation for non-tagged images
  • 23. Subjective Performance of Image Tag Recommendation Image Recommended tags Original folksonomy Refined folksonomy without categorization with categorization explore, sky , blue , water, nature, yellow , clouds , nikon, canon, 2007 sky , explore, blue , nature, yellow , water, canon, clouds , white , geotagged sky , blue , nature, yellow , water, clouds , building , architecture , beach, snow explore, nature , green , sky , blue , macro, water, nikon, flower , clouds nature , green , sky , blue , explore, macro, yellow , canon, flower , water nature , green , sky , blue , yellow , flower , water, clouds , bird, grass sky , explore, blue , clouds , nature , water, landscape, nikon, canon, geotagged sky , blue , clouds , nature , explore, water, landscape, hdr, yellow , canon sky , blue , clouds , nature , water, yellow , snow, bird , lake, beach
  • 24. Outline Introduction Image tag refinement Experiments Conclusions
  • 25. Conclusions Proposed a tag refinement strategy that makes use of tag categorization and neighbor voting allows taking into account other information sources for tag relevance learning, besides visual information dependent on the nature of the tags Presented experimental results focusing on tag refinement along the what dimension Future research design of specialized tag refinement techniques that are able to operate along other dimensions
  • 26. Thank you! Any questions?