SlideShare a Scribd company logo
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
Semi-automatic Tagging of Images on Wikimedia Commons
Wikidata Con 2023, 28-29 October 2023, Online & Taiwan on-site conference
Beat Estermann
▶ Bern University of Applied Sciences, Bern Academy of the Arts
Homepage image of the ISA Tool by Islahaddow, using File:2017 06 Ali- 00213.jpg by user:Alimdaihli, CC BY-SA 4.0 (Wikimedia Commons)
The text of this slide deck is made available under a CC BY 4.0 License.
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
▶ How to leverage the power of artificial intelligence for the tagging of images?
▶ Student Projects at Bern University of Applied Sciences in 2020 and 2021:
• Leverage deep learning approaches for the tagging of images.
• Link concepts to Wikidata.
• Develop custom models to escape the “black box” of proprietary algorithms.
• The way forward is not fully automatic tagging, but semi-automatic tagging with a human in the loop.
• Combine the CAT Tool and the ISA Tool on Wikimedia Commons for a start.
• In addition to image recognition, leverage also existing image metadata.
Background / Motivation
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
Purpose:
Building a FAIR Knowledge
Graph of Open Data for
Research & Education
Try the Beta:
https://opendatanavigator.switch.ch
Support:
Open Data Navigator &
API Project Memberships
Switch Connectome Project
© SWITCH
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
The SWITCH Foundation is
the Swiss national
infrastructure service
provider for higher
education and research.
The core aspects of the
foundation’s mission are to
enable, maintain and
promote a secure and
networked research and
education infrastructure in
Switzerland.
Who is SWITCH?
© SWITCH
https://www.switch.ch/
© SWITCH
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
▶ Project Partners:
• SWITCH
• Wikimedia Sverige / software developers & power user of the ISA Tool
• Bern University of Applied Sciences / Bern University of the Grisons
▶ Project Goal:
• Implement a prototypical solution for the semi-automatic tagging of images on Wikimedia
Commons, using:
• the ISA Tool
• Google Cloud Vision
• a newly developed algorithm for the extraction of entities based on metadata
Project Website: https://commons.wikimedia.org/wiki/Commons:ISA_Tool/Image_to_Concept
SWITCH InnoLab “Image to Concept” (2022-2023)
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
The ISA Tool (Live Demo…)
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
The Technical Solution
Link to File (Wikimedia Commons, User: Beat Estermann, CC BY-SA)
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
Current State of Implementation
▶ Test Version of the enhanced Tool deployed at: https://isa-dev.toolforge.org/
• Tag suggestions from the algorithms not always working.
• Saving tags not always working.
• The situation is currently worse than during user testing in winter 2022-2023.
A series of potential causes for the issues encountered have been identified;
bug fixing has been delayed due to a lack of resources.
▶ Production Version of the Tool available at: https://isa.toolforge.org/
(without the new features)
• This version was down between May and August 2023 due to prolonged maintenance issues.
• Up and running again since mid-August 2023.
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
▶ Resolve the remaining performance and reliability issues !!
▶ Increase the visibility and take-up of the tool among potential contributors.
▶ Assess and monitor the relevance of the ISA Tool in comparison to other tools (for adding
Structured Data on Commons).
▶ Engage in a dialogue with various stakeholders on what constitutes “good” tagging of images.
▶ Further improve/complement the algorithms used for semi-automatic tagging.
▶ Develop alternatives to the current requirement of uploading all media files to Wikimedia
Commons (free license requirement is too restrictive for many research use cases).
▶ Clarify roles and responsibilities with regard to deployment, operations, and maintenance
(SLAs).
Key Learnings
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
▶ For R&D projects, we need reliable partner organizations.
▶ The respective movement entities need to be empowered for this role.
▶ Maintenance and development of important tools & services need to be backed-up
by an organizational commitment.
▶ We should become experts in navigating between the “volunteer” and the “paid staff” worlds;
we need to develop a shared professional culture englobing both of them.
Reflections with regard to the wider Community Discussion
Movement Strategy
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
▶ Include Structured Data on Commons in the SWITCH Research Data Connectome
▶ Fix the remaining bugs of the enhanced version of the ISA Tool
▶ Further Develop the Metadata-to-Concept Algorithm
(students’ project at University of Virginia – School of Data Science)
Outlook
Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
Beat Estermann
Bern Academy of the Arts
Representative Open Science Board &
Digital Humanities
beat.estermann@hkb.bfh.ch
Opendata.ch
Member of the Board
beat.estermann@opendata.ch
Contact

More Related Content

PPTX
Open GLAM CH presentation 20130918
PDF
AUTH practice
PDF
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009
PPTX
CeCC Web 2.0 Examples
PPTX
Altc2014 building a culture of flexible online learning one year on - james ...
PPTX
Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...
PPTX
The Software Sustainability Institute and engagement with the Digital Humanities
PPTX
Semantic Media Project Introduction - Mark Sandler (Barbican Arts Centre, Oct...
Open GLAM CH presentation 20130918
AUTH practice
Wiki-Service Bundeswehr @ Enterprise 2.0 Summit 2009
CeCC Web 2.0 Examples
Altc2014 building a culture of flexible online learning one year on - james ...
Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...
The Software Sustainability Institute and engagement with the Digital Humanities
Semantic Media Project Introduction - Mark Sandler (Barbican Arts Centre, Oct...

Similar to Semi-automatic Tagging of Images on Wikimedia Commons (20)

PPTX
PPT
Seminario Sobre Datasets Consorcio Madrono
PDF
University 2.0
PDF
How to share digital educational resources: the Planète Sankoré's use case
PDF
Maredata. Taller EUDAT
PDF
Building Information Modeling Using Revit For Architects And Engineers Atefe ...
PDF
Building Information Modeling Using Revit For Architects And Engineers Atefe ...
PDF
Resume-Vishnu Monn Baskaran_v3
PPTX
Acem web designing
PDF
Exploiting the inclusive and innovative use of technology in a 21st century o...
PPTX
Estermann Wikidata GLAM Example Projects 20170914
PDF
Open Badges – Open Credentials for All Skills
PDF
Seminar 2019 at CSE
PPT
Image Discovery and Access Across Campus
PPTX
OEP PPT 1
PDF
MOVING presentation at the Course in Open Education Design, July 2018, Slovenia
PPT
Building Collaborative Capacities in Learners: The M/Cyclopedia Project, Revi...
PPTX
Ppt hk pres_final
PDF
Addressing Diversity with Open Badges
PDF
Networked objects
Seminario Sobre Datasets Consorcio Madrono
University 2.0
How to share digital educational resources: the Planète Sankoré's use case
Maredata. Taller EUDAT
Building Information Modeling Using Revit For Architects And Engineers Atefe ...
Building Information Modeling Using Revit For Architects And Engineers Atefe ...
Resume-Vishnu Monn Baskaran_v3
Acem web designing
Exploiting the inclusive and innovative use of technology in a 21st century o...
Estermann Wikidata GLAM Example Projects 20170914
Open Badges – Open Credentials for All Skills
Seminar 2019 at CSE
Image Discovery and Access Across Campus
OEP PPT 1
MOVING presentation at the Course in Open Education Design, July 2018, Slovenia
Building Collaborative Capacities in Learners: The M/Cyclopedia Project, Revi...
Ppt hk pres_final
Addressing Diversity with Open Badges
Networked objects

More from Beat Estermann (20)

PPTX
Using Wikidata for Performing Arts Related Data
PPTX
Transformación digital del patrimonio cultural y sus implicaciones practicas
PPTX
Digital Transformation of the Heritage Sector and its Practical Implications
PPTX
Presentation Opendata.ch Association / Open Event Data
PPTX
Digital Public Goods in the Service of Digital Self-Determination, Digital S...
PPTX
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022
PPTX
Estermann Panel on Authority Files, 3 June 2020
PPTX
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
PPTX
Open Cultural Data in Switzerland
PPTX
BFH-Studie Digitalisierung und Umwelt - BAFU-Kaderklausur - 20191127
PDF
Wikidata Conference 2019 GLAM Panel - 20191025
PPT
Estermann ENICPA Wiki Loves Performing Arts 20191022
PPT
Wikidata Introduction, Linked Digital Future Initiative, August 2019
PPTX
Bootstrapping the International Knowledge Base for the Performing Arts
PPT
Wikidata Introductory Workshop
PPTX
Estermann wd glam-intro_20181204
PPTX
Workshop "Performing Arts Database based on Wikidata"
PPTX
Estermann wikidata performing-arts-20181109
PPTX
Estermann performing arts_database_20180721
PPT
Estermann wikidata introduction-sapa-20180630
Using Wikidata for Performing Arts Related Data
Transformación digital del patrimonio cultural y sus implicaciones practicas
Digital Transformation of the Heritage Sector and its Practical Implications
Presentation Opendata.ch Association / Open Event Data
Digital Public Goods in the Service of Digital Self-Determination, Digital S...
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022
Estermann Panel on Authority Files, 3 June 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Open Cultural Data in Switzerland
BFH-Studie Digitalisierung und Umwelt - BAFU-Kaderklausur - 20191127
Wikidata Conference 2019 GLAM Panel - 20191025
Estermann ENICPA Wiki Loves Performing Arts 20191022
Wikidata Introduction, Linked Digital Future Initiative, August 2019
Bootstrapping the International Knowledge Base for the Performing Arts
Wikidata Introductory Workshop
Estermann wd glam-intro_20181204
Workshop "Performing Arts Database based on Wikidata"
Estermann wikidata performing-arts-20181109
Estermann performing arts_database_20180721
Estermann wikidata introduction-sapa-20180630

Recently uploaded (20)

PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
annual-report-2024-2025 original latest.
PPT
Predictive modeling basics in data cleaning process
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
SAP 2 completion done . PRESENTATION.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
annual-report-2024-2025 original latest.
Predictive modeling basics in data cleaning process
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
climate analysis of Dhaka ,Banglades.pptx
Qualitative Qantitative and Mixed Methods.pptx
Business Analytics and business intelligence.pdf
STUDY DESIGN details- Lt Col Maksud (21).pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
ISS -ESG Data flows What is ESG and HowHow
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Miokarditis (Inflamasi pada Otot Jantung)
[EN] Industrial Machine Downtime Prediction
oil_refinery_comprehensive_20250804084928 (1).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj

Semi-automatic Tagging of Images on Wikimedia Commons

  • 1. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences Semi-automatic Tagging of Images on Wikimedia Commons Wikidata Con 2023, 28-29 October 2023, Online & Taiwan on-site conference Beat Estermann ▶ Bern University of Applied Sciences, Bern Academy of the Arts Homepage image of the ISA Tool by Islahaddow, using File:2017 06 Ali- 00213.jpg by user:Alimdaihli, CC BY-SA 4.0 (Wikimedia Commons) The text of this slide deck is made available under a CC BY 4.0 License.
  • 2. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences ▶ How to leverage the power of artificial intelligence for the tagging of images? ▶ Student Projects at Bern University of Applied Sciences in 2020 and 2021: • Leverage deep learning approaches for the tagging of images. • Link concepts to Wikidata. • Develop custom models to escape the “black box” of proprietary algorithms. • The way forward is not fully automatic tagging, but semi-automatic tagging with a human in the loop. • Combine the CAT Tool and the ISA Tool on Wikimedia Commons for a start. • In addition to image recognition, leverage also existing image metadata. Background / Motivation
  • 3. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences Purpose: Building a FAIR Knowledge Graph of Open Data for Research & Education Try the Beta: https://opendatanavigator.switch.ch Support: Open Data Navigator & API Project Memberships Switch Connectome Project © SWITCH
  • 4. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences The SWITCH Foundation is the Swiss national infrastructure service provider for higher education and research. The core aspects of the foundation’s mission are to enable, maintain and promote a secure and networked research and education infrastructure in Switzerland. Who is SWITCH? © SWITCH https://www.switch.ch/ © SWITCH
  • 5. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences ▶ Project Partners: • SWITCH • Wikimedia Sverige / software developers & power user of the ISA Tool • Bern University of Applied Sciences / Bern University of the Grisons ▶ Project Goal: • Implement a prototypical solution for the semi-automatic tagging of images on Wikimedia Commons, using: • the ISA Tool • Google Cloud Vision • a newly developed algorithm for the extraction of entities based on metadata Project Website: https://commons.wikimedia.org/wiki/Commons:ISA_Tool/Image_to_Concept SWITCH InnoLab “Image to Concept” (2022-2023)
  • 6. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences The ISA Tool (Live Demo…)
  • 7. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences The Technical Solution Link to File (Wikimedia Commons, User: Beat Estermann, CC BY-SA)
  • 8. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences
  • 9. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences Current State of Implementation ▶ Test Version of the enhanced Tool deployed at: https://isa-dev.toolforge.org/ • Tag suggestions from the algorithms not always working. • Saving tags not always working. • The situation is currently worse than during user testing in winter 2022-2023. A series of potential causes for the issues encountered have been identified; bug fixing has been delayed due to a lack of resources. ▶ Production Version of the Tool available at: https://isa.toolforge.org/ (without the new features) • This version was down between May and August 2023 due to prolonged maintenance issues. • Up and running again since mid-August 2023.
  • 10. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences ▶ Resolve the remaining performance and reliability issues !! ▶ Increase the visibility and take-up of the tool among potential contributors. ▶ Assess and monitor the relevance of the ISA Tool in comparison to other tools (for adding Structured Data on Commons). ▶ Engage in a dialogue with various stakeholders on what constitutes “good” tagging of images. ▶ Further improve/complement the algorithms used for semi-automatic tagging. ▶ Develop alternatives to the current requirement of uploading all media files to Wikimedia Commons (free license requirement is too restrictive for many research use cases). ▶ Clarify roles and responsibilities with regard to deployment, operations, and maintenance (SLAs). Key Learnings
  • 11. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences ▶ For R&D projects, we need reliable partner organizations. ▶ The respective movement entities need to be empowered for this role. ▶ Maintenance and development of important tools & services need to be backed-up by an organizational commitment. ▶ We should become experts in navigating between the “volunteer” and the “paid staff” worlds; we need to develop a shared professional culture englobing both of them. Reflections with regard to the wider Community Discussion Movement Strategy
  • 12. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences ▶ Include Structured Data on Commons in the SWITCH Research Data Connectome ▶ Fix the remaining bugs of the enhanced version of the ISA Tool ▶ Further Develop the Metadata-to-Concept Algorithm (students’ project at University of Virginia – School of Data Science) Outlook
  • 13. Berner Fachhochschule | Haute école spécialisée bernoise | Bern University of Applied Sciences Beat Estermann Bern Academy of the Arts Representative Open Science Board & Digital Humanities beat.estermann@hkb.bfh.ch Opendata.ch Member of the Board beat.estermann@opendata.ch Contact