SlideShare a Scribd company logo
Combining Multimedia and SemanticsOscar Corcho (ocorcho@fi.upm.es)Universidad Politécnica de Madridhttp://www.oeg-upm.net/LACNEM 2010, Cali, ColombiaSeptember 9th 2010Credits: Adrián Siles, Mariano Rico, Víctor Méndez, Hector Andrés García-Silva, María del Carmen Suárez-Figueroa, Ghislain Atemezing, Raphaël TroncyWorkdistributedunderthelicenseCreativeCommonsAttribution-Noncommercial-Share Alike3.0http://www.slideshare.net/ocorcho
2Asunción Gómez PérezOntologyEngineering Group. Whomwe areDirector: A. Gómez-PérezResearch Group (37 people)2 Full Professor4 AssociateProfessors1 AssistantProfessor3 Postdocs17 PhD Students8 MScStudents2 Software Engineers Management (4 people)2 Project Managers1 SystemAdministrator1 Secretary 50+ PastCollaborators 10+ visitors
Research Areas20042008199519972000
Beforewestart…Howmany of youhaveeverheardabouttheword “Ontology”?And howmany of you do actuallyknowwhatitmeans?4
Comingtotermswithontologies and semanticsAn ontology is an engineering artifact, which provides: A vocabulary of termsA set of explicit assumptions regarding the intended meaning of the vocabulary. Almost always including concepts and their classificationAlmost always including properties between conceptsShared understanding of a domain of interest Agreement on the meaning of termsFormal and machine manipulable model of a domain of interestBesides...The meaning (semantics) of such terms is formally specifiedNew terms can be formed by combining existing onesCan also specify relationships between terms in multiple ontologies5
Example: Anontologyaboutsatellites6
OutlineIntroductionWhat I willbetalkingabout and what I willnot…Therewereseveraloptionsthat I exploredbeforeselectingtheonethatyouwillbehearing in thistalk…7
Option 1: The Semantic GapThe lack of coincidencebetweentheinformationthatone can extractfromthesensory data and theinterpretationthatthesame data has for a user in a givensituation8A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain: Content-based image retrieval at the end of the early years, IEEE PAMI, 1349–1380, 2000.However, I alreadyassumedthatEbroulwouldbetalking a lotaboutit in hisopeningkeynote (as he did). Besides, I havenotworked at allonthelow-levelpart, so itmaybedifficultfor me toprovideyouwith a goodinsightonthe (many) open problems in thisarea
Option 2: MPEG-7 and the Semantic WebISO standard since December 2001Main components:Descriptors (Ds) and Description Schemes (DSs)DDL (XML Schema + extensions)Concern all types of mediaA good number of ontologies developed around it9Part 5 – MDSMultimedia Description Schemes
Option 2: MPEG-7 and the Semantic WebHowever, thetalkmay:	Be a bit boring and tootechnicalMaylackthemix of state of the art and visionthataninvitedtalkshouldnormallyhave	And MPEG-7 isnotusedtoomuch…So I willcoveronlysomeaspects of thislater, when I talkabout multimedia ontologies.10
Option 3: Canonical Processes of Media Production (and semantics, obviously)For example….http://www.cewe-photobook.comApplication for authoring digital photo booksAutomatic selection, sorting and ordering of photosContext analysis methods: timestamp, annotation, etc.Content analysis methods: color histograms, edge detection, etc.Customized layout and backgroundPrint by the European leader photo finisher company11Credits: Raphaël Troncy, LyndaHardman
CeWe Color PhotoBook ProcessesMy winter ski holidays with my friendsCredits: Raphaël Troncy, LyndaHardman
CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
Semantics can be important in the processCredits: Raphaël Troncy, LyndaHardman
Option 3: Canonical Processes of Media ProductionHowever, some of youprobablyattended Raphaël Troncy’stalklastyear (available in slideshare)19
In summary…I decidedtotalkaboutsomethingthat I havebeenworking in forthelastcouple of years, and which combinesSemantics (of course, thisisthekeyexpertise of ourgroup)Mainlyannotation, Linked Data and a bit of Multimedia OntologyEngineeringSocial networks, collaboration, sharing and collectiveintelligenceExploiting home networks and online multimedia sitesAnd, obviously, multimediaAnd hence I stillleaveoutmanyinterestingtopics (e.g., semanticsin user interfaces)20
OutlineIntroductionWhat I willbetalkingabout and what I willnotSem-UPnP-GridSharing multimedia contentacrosshomesthroughsemanticannotationsCredits: Mariano Rico and Adrián Siles (UPM), Víctor Méndez and José Manuel Gómez-Pérez (iSOCO), José Manuel Palacios and Mónica Pérez (TID)Sem4TagsTagdisambiguation in FlickrM3 Ontology(onlyif time permits)A semanticbackboneforour multimedia-relatedworkConclusions and outlook21
InternetMotivationMultimedia resources in Web2.0 are stored in centralised servers.You lose some of yourrights as anauthorwhenyouuploadtheseresourcestothese servers.Privacyproblems.Poorannotations and metadata.Theseresourcescannotbesharedwithotherresources in your home.22UpGrid
Multimedia Content SharingwithUpGrid23Annotation:“Ángel onthebeach”Reasoning: “Ángel is my son”
 “Pedro is my brother”
 “Juan is my brother”
-------------------------------
Ángel is my nephewJuanP2PSemantic-basedquery:“multimedia contentrelatedto my nephew”Annotation:“Ángel playing soccer”PedroAdditionalsemanticinformation: “Ángel is my son”
 “Pedro is my brother”Additionalsemanticinformation:“Juan is my brother”
Architecture
Architecture (anotherviewonit)
SnapshotsfromtheapplicationCheckhttp://www.youtube.com/results?search_query=UPnPGrid
SummaryAneffectivemeansforsharing multimedia contentsacrosshomes, avoiding Web2.0 siteswhereyourrightsmaybecompromisedHowever, itisstill a prototype, and no serioususabilitytesting has been doneMuchworkstillneeded in ordertogointo a real systemAnd endusersfinditdifficulttoprovideannotationsDo you imagine yourparents and grandparentsannotatingphotos and videos likethat?Let’sseehowthiscouldbeamelioratedwiththenextpart of ourpresentation.27
OutlineIntroductionWhat I willbetalkingabout and what I willnotSem-UPnP-GridSharing multimedia contentacrosshomesthroughsemanticannotationsSem4TagsTagdisambiguation in FlickrCredits: Héctor Andrés García SilvaM3 Ontology(onlyif time permits)A semanticbackboneforour multimedia-relatedworkConclusions and outlook28Egresado de laUniversidad del Valle
IntroductionSocial Tagging SystemsWeb 2.0 applications Applications for storing, sharing, and discovering information resources.Users assign tagsto identify information resourcesTags are used to search/discover resources29
IntroductionFolksonomyEmerging classification scheme from social tagging systems Folk: People, Taxonomy: ClassificationRepresented by: Users, Tags, ResourcesTaxonomyFolksonomyTop-down
Controlled Vocabulary
Hierarchical structure
Exclusive/Restrictive
Expensive to maintain
Bottom-up (user created)
No fixed vocabulary
No Hierarchical structure
No Exclusive/Flexible
Low cost30
IntroductionWhy is tagging so popular?Reduce cognitive burdens it’s easy to useUsers don´t need any special skill or experienceThe benefits of tagging are immediateFuture retrievalContribution and sharingAttract AttentionSelf PresentationOpinion Expression31
IntroductionHoweverTags can be ambiguous Polysemy: partyas a celebration as opposed to partyas a political organizationSynonym: party and celebration Morphological variations:     party, parties, partying, partyignPluralsAcronymsConjugated verbsMisspellingCompound wordsPolitical party, PoliticalParty, Political_party,    Political-Party, etc.Detail/granularity levelA general tag as partyin contrast to a specific tag as banquet.32
MotivationThe problem: Morphological variations, synonyms, granularity, and polysemy hamper information retrieval processes based on folksonomies. Systems ignore resources tagged with morphological variationsor synonyms of that tag, as well as the resources tagged with more generic or more specific tags710.659 results8.661.581 Results33
    When searching with polysemous tags, all the resources tagged with that tag are retrieved without taking into account the tag sense the user was looking for.	(e.g., Query flickr with bank results in photos about financial institutions, river edges, fog banks, and sand banks, etc. )34Motivation
MotivationWhat if we associate tags with semantic entities?http://morpheus.cs.umbc.edu/aks1/ontosem.owl #non-work-activityWe can avoid the aforementioned pitfalls#organization#special-occasion#political-entity#party#Celebration#political-party#Coalition#federation#Birthday#Anniversaryuk, tories, party, conservative, speech party, balloons, colors, bar, crowd35
State of the Art: Semantic Grounding of Cross-Lingual FolksonomiesGarcia HA, Corcho O, Alani H, Gómez-Pérez A. Review of the state of the art: Discovering and Associating Semantics to Folksonomies. Knowledge Engineering Review (in press)None of the analyzed approaches deals with multilingual tags36
Semantic Grounding of Cross-Lingual FolksonomiesMSR: a Multilingual Sense Repository based on Wikipedia and enriched with semantic information taken from DBpedia.Terms and frequencyBancoBankhttp://dbpedia.org/resource/BankTerms and frequencyBancoCardumenSwarmhttp://dbpedia.org/resource/SwarmBanco de Arenahttp://dbpedia.org/resource/SandBankTerms and frequencySandbank37
Semantic Grounding of Cross-Lingual FolksonomiesSem4Tags: A process for Associating Semantics to Tags.38Dinero,Calle,Santander,Money,Madrid,Atm, cajeroEuropeEuro FinanceCentral bankawesomePicNikon ..BankBancohttp://dbpedia.org/resource/Bank
Semantic Grounding of Cross-Lingual FolksonomiesDisambiguation activityThe candidate senses and the tag context are represented as vectors. The vector components are the set of most frequent terms in each Wikipedia page representing a sense.For each sense the values of the vector are calculated using TF-IDF.For the tag context the values in each position are 1 or 0 if the corresponding term appears in the tag context. The tag context vector is compared against each sense vector using the cosine of the angle as similarity measure. The most similar sense to the tag context is selected as the one representing the meaning of the analyzed tag3939
Semantic Grounding of Cross-Lingual FolksonomiesDisambiguation activityWe use the information of the wikipedia default sense for a term. Sim(TagContext, Sensei)= λ*Cosine + β*defaultSenseWe experimentally defined β = 0,2 and λ = 0.8We attempt to use DBpedia semantic information in the disambiguation activity:Sim(TagContext, Sensei)= λ*Cosine + β*defaultSense + δ*SemanticInfoStudies have shown that tags in flickr refers mainly to: Locations, Time, Given Names, Potography related subjects among others. We use DBpedia and YAGO relations to classify the senses according to this categories.However, we found that not all the senses related to a term have the same amount of relations. (e.g. Madrid is not a city)40
Let’s try ithttp://robinson.dia.fi.upm.es:8080/SemanticTagsWebApp/index.jspWhatdoes “bernabeu” mean ifitscontextis…?estadio, madrid, fútbol41
ExperimentBaseline: Directly associate tags with DBpedia resourcesLook for spaces and replace them with  ' _‘.For tags in English:Create a URI of the form http://en.wikipedia.org/wiki/tagQuery DBpedia using the http://xmlns.com/foaf/0.1/page relationFor tags in Spanish:Create a URI of the form http://es.wikipedia.org/wiki/tagQuery DBpedia using the http://dbpedia.org/property/wikipage-es relation42
ExperimentApproaches:Baseline: Selection of the sense without a disambiguation activity.Sem4Tags: For each sense we use the whole Wikipedia article as source for frequentterms.Sem4TagsAC: Same as Sem4Tags including the selection of the Active Context.Sem4TagsAbs: For each sense we use the Wikipedia article abstract (extracted from DBpedia) as source for frequent terms.Sem4TagsAbsAC: Same as Sem4TagsAbs including the selection of the Active Context.43
ExperimentInitial Data SetWide range of Users, photos, and tags.764 photos uploaded by 719 users to Flickr that have been tagged with tags describing tourist places in Spain12.4 (+/- 7.85) tags per photo9484 tagging activities (TAS) : <user,photo,tag>4135 distinct tags where usedProcessed Data SetFrom each photo we processed on average 2 tags 2260 taggingactivities (TAS)44
ExperimentEvaluation Campaign41 EvaluatorsEvaluate semantic associations produce by each approach: <user; tag; photo; DBpedia resource; language>Three different evaluators evaluated each semantic association.Questions:Able to identify the tag meaning (known or Unknown)Tag language (English, Spanish, Both, other)The tag correspond to a Named entityAccording to the identified tag language they evaluate the semantic association in terms ofHighly related, Related, Not Related.45
ExperimentResultsEvaluators identified the semantics of the 87% of TAS (known)62.6 % of TAS were considered in English87.7% of TAS were considered in SpanishAgreement among evaluators (Fleiss’ kappa statistics):k=0.76 for highly relatedK=0.71 for the related case/highly related case46
ExperimentPrecision and RecallforHighlyRelevantresults47EnglishSpanish
ExperimentConclusionsBaseline obtained high precision, however it was able to find semantic resources for just a fraction of the analyzed data set:Baseline: 27.7% in English and 19.4% in Spanish.Sem4Tags: 79.1 % in English and 81.4% in SpanishAll approaches obtained better precision with named entities than with unnamed entities. Sem4Tags and Sem4TagsAC are the approaches that obtained the best results in terms of Precision and Recall. Sometimes Sem4TagsAC obtains better P@1 values but the improvements are supported by no or low statistical evidence. Sem4TagsAbs and Sem4TagsAbs are clearly the worst approaches. 48
OutlineIntroductionWhat I willbetalkingabout and what I willnotSem-UPnP-GridSharing multimedia contentacrosshomesthroughsemanticannotationsSem4TagsTagdisambiguation in FlickrM3 Ontology(onlyif time permits)A semanticbackboneforour multimedia-relatedworkConclusions and outlook49
50Ontología M3
There are already multimedia ontologiesMDS Upper Layer represented in RDFS2001: HunterLater on: link to the ABC upper ontologyMDS fully represented in OWL-DL2004: Tsinaraki et al., DS-MIRF modelMPEG-7 fully represented in OWL-DL2005: Garcia and Celma, Rhizomik modelFully automatic translation of the whole standardMDS and Visual parts represented in OWL-DL2007: Arndt et al., COMM model Re-engineering MPEG-7 using DOLCE design patternsHowever, their requirements are not always clear nor have they been developed with clear methodological guidelines

More Related Content

PDF
Provenance for Multimedia
PPTX
Linked Statistical Data 101
PPTX
Aplicando los principios de Linked Data en AEMET
PPTX
STARS4ALL general presentation at ALAN2016
PDF
RuleML 2015: Ontology Reasoning using Rules in an eHealth Context
PPTX
Educando sobre datos abiertos: desde el colegio a la universidad
PDF
RuleML2015 : Hybrid Relational and Graph Reasoning
PPTX
Ojo Al Data 100 - Call for sharing session at IODC 2016
Provenance for Multimedia
Linked Statistical Data 101
Aplicando los principios de Linked Data en AEMET
STARS4ALL general presentation at ALAN2016
RuleML 2015: Ontology Reasoning using Rules in an eHealth Context
Educando sobre datos abiertos: desde el colegio a la universidad
RuleML2015 : Hybrid Relational and Graph Reasoning
Ojo Al Data 100 - Call for sharing session at IODC 2016

Similar to Combining Multimedia and Semantics (LACNEM2010) (20)

PPT
Ontologies for multimedia: the Semantic Culture Web
PPT
Introduction to Semantic Web for GIS Practitioners
PDF
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
PDF
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
PPTX
From semantic platforms to semantic apps
PDF
The Semantic Web: What IAs Need to Know About Web 3.0
PPT
Netflix presentation final
PPT
Realizing a Semantic Web Application - ICWE 2010 Tutorial
PPT
Developing A Semantic Web Application - ISWC 2008 tutorial
PDF
Semantic Web applications for mobility and social interaction
PDF
The Redlink way towards a Semantic CMS
PDF
Semantic CMS using the Redlink Platform by Thomas Kurz
PDF
Introduction to the Semantic Web
PPTX
Making things findable
PPT
Intro semanticweb
PPT
Semantic technologies at work
PPTX
The Social Semantic Web
PPTX
Semantic web
PPT
Intelligent expert systems for location planning
PPT
Semantic web
Ontologies for multimedia: the Semantic Culture Web
Introduction to Semantic Web for GIS Practitioners
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
Creating Semantic Mashups Bridging Web 2 0 And The Semantic Web Presentation 1
From semantic platforms to semantic apps
The Semantic Web: What IAs Need to Know About Web 3.0
Netflix presentation final
Realizing a Semantic Web Application - ICWE 2010 Tutorial
Developing A Semantic Web Application - ISWC 2008 tutorial
Semantic Web applications for mobility and social interaction
The Redlink way towards a Semantic CMS
Semantic CMS using the Redlink Platform by Thomas Kurz
Introduction to the Semantic Web
Making things findable
Intro semanticweb
Semantic technologies at work
The Social Semantic Web
Semantic web
Intelligent expert systems for location planning
Semantic web
Ad

More from Oscar Corcho (20)

PPTX
Organisational Interoperability in Practice at Universidad Politécnica de Madrid
PPTX
Introducción a los Datos Abiertos - Open Data Day 2020
PPTX
Open Data (and Software, and other Research Artefacts) - A proper management
PDF
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
PPTX
Ontology Engineering at Scale for Open City Data Sharing
PPTX
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
PPTX
STARS4ALL - Contaminación Lumínica
PPTX
Towards Reproducible Science: a few building blocks from my personal experience
PPTX
Publishing Linked Statistical Data: Aragón, a case study
PPTX
An initial analysis of topic-based similarity among scientific documents base...
PPTX
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
PPTX
Presentación de la red de excelencia de Open Data y Smart Cities
PPTX
Why do they call it Linked Data when they want to say...?
PPTX
Linked Statistical Data: does it actually pay off?
PPTX
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
PPTX
Research Objects for improved sharing and reproducibility
PPTX
(Big) Data (Science) Skills
PPTX
Big Data - El Futuro a través de los Datos
PPTX
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
PPTX
Aspectos técnicos de la ontología PPROC
Organisational Interoperability in Practice at Universidad Politécnica de Madrid
Introducción a los Datos Abiertos - Open Data Day 2020
Open Data (and Software, and other Research Artefacts) - A proper management
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
Ontology Engineering at Scale for Open City Data Sharing
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
STARS4ALL - Contaminación Lumínica
Towards Reproducible Science: a few building blocks from my personal experience
Publishing Linked Statistical Data: Aragón, a case study
An initial analysis of topic-based similarity among scientific documents base...
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
Presentación de la red de excelencia de Open Data y Smart Cities
Why do they call it Linked Data when they want to say...?
Linked Statistical Data: does it actually pay off?
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Research Objects for improved sharing and reproducibility
(Big) Data (Science) Skills
Big Data - El Futuro a través de los Datos
EKAW2014 Keynote: Ontology Engineering for and by the Masses: are we already ...
Aspectos técnicos de la ontología PPROC
Ad

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Big Data Technologies - Introduction.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Chapter 3 Spatial Domain Image Processing.pdf
Machine learning based COVID-19 study performance prediction
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
NewMind AI Weekly Chronicles - August'25 Week I
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Unlocking AI with Model Context Protocol (MCP)
Programs and apps: productivity, graphics, security and other tools
Big Data Technologies - Introduction.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Understanding_Digital_Forensics_Presentation.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm

Combining Multimedia and Semantics (LACNEM2010)

  • 1. Combining Multimedia and SemanticsOscar Corcho (ocorcho@fi.upm.es)Universidad Politécnica de Madridhttp://www.oeg-upm.net/LACNEM 2010, Cali, ColombiaSeptember 9th 2010Credits: Adrián Siles, Mariano Rico, Víctor Méndez, Hector Andrés García-Silva, María del Carmen Suárez-Figueroa, Ghislain Atemezing, Raphaël TroncyWorkdistributedunderthelicenseCreativeCommonsAttribution-Noncommercial-Share Alike3.0http://www.slideshare.net/ocorcho
  • 2. 2Asunción Gómez PérezOntologyEngineering Group. Whomwe areDirector: A. Gómez-PérezResearch Group (37 people)2 Full Professor4 AssociateProfessors1 AssistantProfessor3 Postdocs17 PhD Students8 MScStudents2 Software Engineers Management (4 people)2 Project Managers1 SystemAdministrator1 Secretary 50+ PastCollaborators 10+ visitors
  • 4. Beforewestart…Howmany of youhaveeverheardabouttheword “Ontology”?And howmany of you do actuallyknowwhatitmeans?4
  • 5. Comingtotermswithontologies and semanticsAn ontology is an engineering artifact, which provides: A vocabulary of termsA set of explicit assumptions regarding the intended meaning of the vocabulary. Almost always including concepts and their classificationAlmost always including properties between conceptsShared understanding of a domain of interest Agreement on the meaning of termsFormal and machine manipulable model of a domain of interestBesides...The meaning (semantics) of such terms is formally specifiedNew terms can be formed by combining existing onesCan also specify relationships between terms in multiple ontologies5
  • 7. OutlineIntroductionWhat I willbetalkingabout and what I willnot…Therewereseveraloptionsthat I exploredbeforeselectingtheonethatyouwillbehearing in thistalk…7
  • 8. Option 1: The Semantic GapThe lack of coincidencebetweentheinformationthatone can extractfromthesensory data and theinterpretationthatthesame data has for a user in a givensituation8A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain: Content-based image retrieval at the end of the early years, IEEE PAMI, 1349–1380, 2000.However, I alreadyassumedthatEbroulwouldbetalking a lotaboutit in hisopeningkeynote (as he did). Besides, I havenotworked at allonthelow-levelpart, so itmaybedifficultfor me toprovideyouwith a goodinsightonthe (many) open problems in thisarea
  • 9. Option 2: MPEG-7 and the Semantic WebISO standard since December 2001Main components:Descriptors (Ds) and Description Schemes (DSs)DDL (XML Schema + extensions)Concern all types of mediaA good number of ontologies developed around it9Part 5 – MDSMultimedia Description Schemes
  • 10. Option 2: MPEG-7 and the Semantic WebHowever, thetalkmay: Be a bit boring and tootechnicalMaylackthemix of state of the art and visionthataninvitedtalkshouldnormallyhave And MPEG-7 isnotusedtoomuch…So I willcoveronlysomeaspects of thislater, when I talkabout multimedia ontologies.10
  • 11. Option 3: Canonical Processes of Media Production (and semantics, obviously)For example….http://www.cewe-photobook.comApplication for authoring digital photo booksAutomatic selection, sorting and ordering of photosContext analysis methods: timestamp, annotation, etc.Content analysis methods: color histograms, edge detection, etc.Customized layout and backgroundPrint by the European leader photo finisher company11Credits: Raphaël Troncy, LyndaHardman
  • 12. CeWe Color PhotoBook ProcessesMy winter ski holidays with my friendsCredits: Raphaël Troncy, LyndaHardman
  • 13. CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
  • 14. CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
  • 15. CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
  • 16. CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
  • 17. CeWe Color PhotoBook ProcessesCredits: Raphaël Troncy, LyndaHardman
  • 18. Semantics can be important in the processCredits: Raphaël Troncy, LyndaHardman
  • 19. Option 3: Canonical Processes of Media ProductionHowever, some of youprobablyattended Raphaël Troncy’stalklastyear (available in slideshare)19
  • 20. In summary…I decidedtotalkaboutsomethingthat I havebeenworking in forthelastcouple of years, and which combinesSemantics (of course, thisisthekeyexpertise of ourgroup)Mainlyannotation, Linked Data and a bit of Multimedia OntologyEngineeringSocial networks, collaboration, sharing and collectiveintelligenceExploiting home networks and online multimedia sitesAnd, obviously, multimediaAnd hence I stillleaveoutmanyinterestingtopics (e.g., semanticsin user interfaces)20
  • 21. OutlineIntroductionWhat I willbetalkingabout and what I willnotSem-UPnP-GridSharing multimedia contentacrosshomesthroughsemanticannotationsCredits: Mariano Rico and Adrián Siles (UPM), Víctor Méndez and José Manuel Gómez-Pérez (iSOCO), José Manuel Palacios and Mónica Pérez (TID)Sem4TagsTagdisambiguation in FlickrM3 Ontology(onlyif time permits)A semanticbackboneforour multimedia-relatedworkConclusions and outlook21
  • 22. InternetMotivationMultimedia resources in Web2.0 are stored in centralised servers.You lose some of yourrights as anauthorwhenyouuploadtheseresourcestothese servers.Privacyproblems.Poorannotations and metadata.Theseresourcescannotbesharedwithotherresources in your home.22UpGrid
  • 23. Multimedia Content SharingwithUpGrid23Annotation:“Ángel onthebeach”Reasoning: “Ángel is my son”
  • 24. “Pedro is my brother”
  • 25. “Juan is my brother”
  • 27. Ángel is my nephewJuanP2PSemantic-basedquery:“multimedia contentrelatedto my nephew”Annotation:“Ángel playing soccer”PedroAdditionalsemanticinformation: “Ángel is my son”
  • 28. “Pedro is my brother”Additionalsemanticinformation:“Juan is my brother”
  • 32. SummaryAneffectivemeansforsharing multimedia contentsacrosshomes, avoiding Web2.0 siteswhereyourrightsmaybecompromisedHowever, itisstill a prototype, and no serioususabilitytesting has been doneMuchworkstillneeded in ordertogointo a real systemAnd endusersfinditdifficulttoprovideannotationsDo you imagine yourparents and grandparentsannotatingphotos and videos likethat?Let’sseehowthiscouldbeamelioratedwiththenextpart of ourpresentation.27
  • 33. OutlineIntroductionWhat I willbetalkingabout and what I willnotSem-UPnP-GridSharing multimedia contentacrosshomesthroughsemanticannotationsSem4TagsTagdisambiguation in FlickrCredits: Héctor Andrés García SilvaM3 Ontology(onlyif time permits)A semanticbackboneforour multimedia-relatedworkConclusions and outlook28Egresado de laUniversidad del Valle
  • 34. IntroductionSocial Tagging SystemsWeb 2.0 applications Applications for storing, sharing, and discovering information resources.Users assign tagsto identify information resourcesTags are used to search/discover resources29
  • 35. IntroductionFolksonomyEmerging classification scheme from social tagging systems Folk: People, Taxonomy: ClassificationRepresented by: Users, Tags, ResourcesTaxonomyFolksonomyTop-down
  • 45. IntroductionWhy is tagging so popular?Reduce cognitive burdens it’s easy to useUsers don´t need any special skill or experienceThe benefits of tagging are immediateFuture retrievalContribution and sharingAttract AttentionSelf PresentationOpinion Expression31
  • 46. IntroductionHoweverTags can be ambiguous Polysemy: partyas a celebration as opposed to partyas a political organizationSynonym: party and celebration Morphological variations: party, parties, partying, partyignPluralsAcronymsConjugated verbsMisspellingCompound wordsPolitical party, PoliticalParty, Political_party, Political-Party, etc.Detail/granularity levelA general tag as partyin contrast to a specific tag as banquet.32
  • 47. MotivationThe problem: Morphological variations, synonyms, granularity, and polysemy hamper information retrieval processes based on folksonomies. Systems ignore resources tagged with morphological variationsor synonyms of that tag, as well as the resources tagged with more generic or more specific tags710.659 results8.661.581 Results33
  • 48. When searching with polysemous tags, all the resources tagged with that tag are retrieved without taking into account the tag sense the user was looking for. (e.g., Query flickr with bank results in photos about financial institutions, river edges, fog banks, and sand banks, etc. )34Motivation
  • 49. MotivationWhat if we associate tags with semantic entities?http://morpheus.cs.umbc.edu/aks1/ontosem.owl #non-work-activityWe can avoid the aforementioned pitfalls#organization#special-occasion#political-entity#party#Celebration#political-party#Coalition#federation#Birthday#Anniversaryuk, tories, party, conservative, speech party, balloons, colors, bar, crowd35
  • 50. State of the Art: Semantic Grounding of Cross-Lingual FolksonomiesGarcia HA, Corcho O, Alani H, Gómez-Pérez A. Review of the state of the art: Discovering and Associating Semantics to Folksonomies. Knowledge Engineering Review (in press)None of the analyzed approaches deals with multilingual tags36
  • 51. Semantic Grounding of Cross-Lingual FolksonomiesMSR: a Multilingual Sense Repository based on Wikipedia and enriched with semantic information taken from DBpedia.Terms and frequencyBancoBankhttp://dbpedia.org/resource/BankTerms and frequencyBancoCardumenSwarmhttp://dbpedia.org/resource/SwarmBanco de Arenahttp://dbpedia.org/resource/SandBankTerms and frequencySandbank37
  • 52. Semantic Grounding of Cross-Lingual FolksonomiesSem4Tags: A process for Associating Semantics to Tags.38Dinero,Calle,Santander,Money,Madrid,Atm, cajeroEuropeEuro FinanceCentral bankawesomePicNikon ..BankBancohttp://dbpedia.org/resource/Bank
  • 53. Semantic Grounding of Cross-Lingual FolksonomiesDisambiguation activityThe candidate senses and the tag context are represented as vectors. The vector components are the set of most frequent terms in each Wikipedia page representing a sense.For each sense the values of the vector are calculated using TF-IDF.For the tag context the values in each position are 1 or 0 if the corresponding term appears in the tag context. The tag context vector is compared against each sense vector using the cosine of the angle as similarity measure. The most similar sense to the tag context is selected as the one representing the meaning of the analyzed tag3939
  • 54. Semantic Grounding of Cross-Lingual FolksonomiesDisambiguation activityWe use the information of the wikipedia default sense for a term. Sim(TagContext, Sensei)= λ*Cosine + β*defaultSenseWe experimentally defined β = 0,2 and λ = 0.8We attempt to use DBpedia semantic information in the disambiguation activity:Sim(TagContext, Sensei)= λ*Cosine + β*defaultSense + δ*SemanticInfoStudies have shown that tags in flickr refers mainly to: Locations, Time, Given Names, Potography related subjects among others. We use DBpedia and YAGO relations to classify the senses according to this categories.However, we found that not all the senses related to a term have the same amount of relations. (e.g. Madrid is not a city)40
  • 55. Let’s try ithttp://robinson.dia.fi.upm.es:8080/SemanticTagsWebApp/index.jspWhatdoes “bernabeu” mean ifitscontextis…?estadio, madrid, fútbol41
  • 56. ExperimentBaseline: Directly associate tags with DBpedia resourcesLook for spaces and replace them with ' _‘.For tags in English:Create a URI of the form http://en.wikipedia.org/wiki/tagQuery DBpedia using the http://xmlns.com/foaf/0.1/page relationFor tags in Spanish:Create a URI of the form http://es.wikipedia.org/wiki/tagQuery DBpedia using the http://dbpedia.org/property/wikipage-es relation42
  • 57. ExperimentApproaches:Baseline: Selection of the sense without a disambiguation activity.Sem4Tags: For each sense we use the whole Wikipedia article as source for frequentterms.Sem4TagsAC: Same as Sem4Tags including the selection of the Active Context.Sem4TagsAbs: For each sense we use the Wikipedia article abstract (extracted from DBpedia) as source for frequent terms.Sem4TagsAbsAC: Same as Sem4TagsAbs including the selection of the Active Context.43
  • 58. ExperimentInitial Data SetWide range of Users, photos, and tags.764 photos uploaded by 719 users to Flickr that have been tagged with tags describing tourist places in Spain12.4 (+/- 7.85) tags per photo9484 tagging activities (TAS) : <user,photo,tag>4135 distinct tags where usedProcessed Data SetFrom each photo we processed on average 2 tags 2260 taggingactivities (TAS)44
  • 59. ExperimentEvaluation Campaign41 EvaluatorsEvaluate semantic associations produce by each approach: <user; tag; photo; DBpedia resource; language>Three different evaluators evaluated each semantic association.Questions:Able to identify the tag meaning (known or Unknown)Tag language (English, Spanish, Both, other)The tag correspond to a Named entityAccording to the identified tag language they evaluate the semantic association in terms ofHighly related, Related, Not Related.45
  • 60. ExperimentResultsEvaluators identified the semantics of the 87% of TAS (known)62.6 % of TAS were considered in English87.7% of TAS were considered in SpanishAgreement among evaluators (Fleiss’ kappa statistics):k=0.76 for highly relatedK=0.71 for the related case/highly related case46
  • 62. ExperimentConclusionsBaseline obtained high precision, however it was able to find semantic resources for just a fraction of the analyzed data set:Baseline: 27.7% in English and 19.4% in Spanish.Sem4Tags: 79.1 % in English and 81.4% in SpanishAll approaches obtained better precision with named entities than with unnamed entities. Sem4Tags and Sem4TagsAC are the approaches that obtained the best results in terms of Precision and Recall. Sometimes Sem4TagsAC obtains better P@1 values but the improvements are supported by no or low statistical evidence. Sem4TagsAbs and Sem4TagsAbs are clearly the worst approaches. 48
  • 63. OutlineIntroductionWhat I willbetalkingabout and what I willnotSem-UPnP-GridSharing multimedia contentacrosshomesthroughsemanticannotationsSem4TagsTagdisambiguation in FlickrM3 Ontology(onlyif time permits)A semanticbackboneforour multimedia-relatedworkConclusions and outlook49
  • 65. There are already multimedia ontologiesMDS Upper Layer represented in RDFS2001: HunterLater on: link to the ABC upper ontologyMDS fully represented in OWL-DL2004: Tsinaraki et al., DS-MIRF modelMPEG-7 fully represented in OWL-DL2005: Garcia and Celma, Rhizomik modelFully automatic translation of the whole standardMDS and Visual parts represented in OWL-DL2007: Arndt et al., COMM model Re-engineering MPEG-7 using DOLCE design patternsHowever, their requirements are not always clear nor have they been developed with clear methodological guidelines
  • 66. 52Knowledge ResourcesOntological ResourcesO. Design Patterns34O. Repositories and Registries56FlogicRDF(S)OWLOntological ResourceReuse O. Aligning O. Merging 562Ontology DesignPattern ReuseNon Ontological ResourceReuse436Non Ontological Resources2Ontological ResourceReengineering7GlossariesDictionariesLexicons5Non Ontological ResourceReengineering46ClassificationSchemasThesauriTaxonomiesAlignments2RDF(S)1FlogicO. ConceptualizationO. ImplementationO. FormalizationO. SpecificationSchedulingOWL8Ontology Restructuring(Pruning, Extension, Specialization, Modularization)9O. Localization1,2,3,4,5,6,7,8, 9Ontology Support Activities: Knowledge Acquisition (Elicitation); Documentation; Configuration Management; Evaluation (V&V); AssessmentNeOnMethodology
  • 67. 53Ontology Requirements Specification (I)Non-functional ontology requirements:
  • 68. Characteristics not related to the ontology content 54Ontology Requirements Specification (II)Functional ontology requirements:
  • 69. Content specific requirements referred to the particular knowledge to be represented by the ontology
  • 71. in the form ofCQs
  • 72. in the form of sentences (General Characteristics)
  • 73. Strategies: (1) Top-Down, (2) Bottom-Up, and (3) Middle out55Ontology Requirements Specification (III): Functional Requirements on M3Perspectiva MultidominioPerspectiva MultimediaPerspectiva Multilenguaje
  • 74. 56Ontology Requirements Specification (IV): ORSDPerspectiva MultidominioM3Perspectiva MultimediaPerspectiva Multilenguaje
  • 75. 57Knowledge ResourcesOntological ResourcesO. Design Patterns34O. Repositories and Registries56FlogicRDF(S)OWLOntological ResourceReuse O. Aligning O. Merging 562Ontology DesignPattern ReuseNon Ontological ResourceReuse436Non Ontological Resources2Ontological ResourceReengineering7GlossariesDictionariesLexicons5Non Ontological ResourceReengineering46ClassificationSchemasThesauriTaxonomiesAlignments2RDF(S)1FlogicO. ConceptualizationO. ImplementationO. FormalizationO. SpecificationSchedulingOWL8Ontology Restructuring(Pruning, Extension, Specialization, Modularization)9O. Localization1,2,3,4,5,6,7,8, 9Ontology Support Activities: Knowledge Acquisition (Elicitation); Documentation; Configuration Management; Evaluation (V&V); AssessmentNeOnMethodology
  • 76. 58Scheduling using gOnttgOnttLife cycle model selectionI need to schedule the development of the M3 ontolgy networkScenarios selection
  • 78. 60Knowledge ResourcesOntological ResourcesO. Design Patterns34O. Repositories and Registries56FlogicRDF(S)OWLOntological ResourceReuse O. Aligning O. Merging 562Ontology DesignPattern ReuseNon Ontological ResourceReuse436Non Ontological Resources2Ontological ResourceReengineering7GlossariesDictionariesLexicons5Non Ontological ResourceReengineering46ClassificationSchemasThesauriTaxonomiesAlignments2RDF(S)1FlogicO. ConceptualizationO. ImplementationO. FormalizationO. SpecificationSchedulingOWL8Ontology Restructuring(Pruning, Extension, Specialization, Modularization)9O. Localization1,2,3,4,5,6,7,8, 9Ontology Support Activities: Knowledge Acquisition (Elicitation); Documentation; Configuration Management; Evaluation (V&V); AssessmentNeOnMethodology
  • 79. 61Reusing Ontological Resources: Comparative Analysis (I)
  • 80. 62Reusing Ontological Resources: Comparative Analysis
  • 81. OutlineIntroductionWhat I willbetalkingabout and what I willnotSem-UPnP-GridSharing multimedia contentacrosshomesthroughsemanticannotationsSem4TagsTagdisambiguation in FlickrM3 Ontology(onlyif time permits)A semanticbackboneforour multimedia-relatedworkConclusions and outlook63
  • 82. Conclusions and outlookWeallagreethat…Multimedia UGC has beenone of thebasis of Web2.0The use of semantics can provide…Betterunderstanding of thedomain and of theircontentHeavyweight: addressingthesemantic gap automaticallyLigthweight: allowinguserstoannotateMiddleweight: from free tagstoknowledgeBetterexploratorynavigation and serendipityInterconnecting multimedia contentwiththeLinked Data cloudHowever, privacyissues are still a majorbarrierfor a largeruptake, especiallyforsomepopulationsegmentsAllowing P2P exchangebetween “known” homes, whileexploitingsemantic-basedsearch64
  • 83. Combining Multimedia and SemanticsOscar Corcho (ocorcho@fi.upm.es)Universidad Politécnica de Madridhttp://www.oeg-upm.net/LACNEM 2010, Cali, ColombiaSeptember 9th 2010Credits: Adrián Siles, Mariano Rico, Víctor Méndez, Hector Andrés García-Silva, María del Carmen Suárez-Figueroa, Ghislain Atemezing, Raphaël TroncyWorkdistributedunderthelicenseCreativeCommonsAttribution-Noncommercial-Share Alike3.0http://www.slideshare.net/ocorcho