SlideShare a Scribd company logo
When automated
analysis goes wrong
Tristan Roddis
Cogapp
MCN conference 2017
Tristan Roddis. Cogapp
In this session
• Successes
• FAILURES
Tristan Roddis. Cogapp
Failure 1: Interesting
images
Tristan Roddis. Cogapp
The problem
Adrian Hindle, Tristan Roddis. Cogapp
"Nourishment for the Ailing" and "Nourishment for the Healthy"
http://www.qdl.qa/en/archive/qnlhc/9549.20
Adrian Hindle, Tristan Roddis. Cogapp
"Nourishment for the Ailing" and "Nourishment for the Healthy"
http://www.qdl.qa/en/archive/qnlhc/9549.19
Adrian Hindle, Tristan Roddis. Cogapp
Tristan Roddis
Tristan Roddis
Tristan Roddis. Cogapp
Success 1: Interesting
images
Adrian Hindle, Tristan Roddis. Cogapp
• Automatically tag, organize, and search visual
content with machine learning.
• Create concepts
Adrian Hindle, Tristan Roddis. Cogapp
• Existing concepts:
• Illustration
• Drawing
Adrian Hindle, Tristan Roddis. Cogapp
• Negative concept:
• Text
• Manuscripts
Adrian Hindle, Tristan Roddis. Cogapp
• Combination of two custom concepts
• arabic_manuscript
• arabic_ manuscript_with_image
Adrian Hindle, Tristan Roddis. Cogapp
45 images
45 images
Adrian Hindle, Tristan Roddis. Cogapp
26 images
26 images
Adrian Hindle, Tristan Roddis. Cogapp
• Python script to create sets and train
• Test and train
Adrian Hindle
Tristan Roddis
Adrian Hindle, Tristan Roddis. Cogapp
Adrian Hindle
Tristan Roddis
Adrian Hindle
Tristan Roddis
Tristan Roddis
Tristan Roddis. Cogapp
Other uses
• Blank sheets vs. writing
• Handwritten vs. typewritten
• [your collection-specific question here]
Tristan Roddis. Cogapp
Failure 2: term
extraction
Tristan Roddis. Cogapp
Failure 2: term
extraction
Tristan Roddis. Cogapp
Failure 2: term
extraction
Tristan Roddis. Cogapp
Failure 2: term
extraction
Tristan Roddis. Cogapp
Success 2: add a
human
Tristan Roddis. Cogapp
Failure 3: Finding similar
images
Tristan Roddis. Cogapp
Scikit-image
Tristan Roddis. Cogapp
Scikit-image
Tristan Roddis. Cogapp
Scikit-image
Tristan Roddis. Cogapp
Scikit-image
Tristan Roddis. Cogapp
Scikit-learn
• Machine learning is not easy
• Especially for this problem
Tristan Roddis. Cogapp
Success 3: Finding similar
images
Tristan Roddis. Cogapp
Term extraction
• Clarifai
• Google Vision API
• Microsoft Computer Vision
Tristan Roddis. Cogapp
Clarifai
Tristan Roddis. Cogapp
Google Vision API
Tristan Roddis. Cogapp
MS Computer vision
Tristan Roddis. Cogapp
Images
• Sourced from Nationalmuseum Sweden
• Using Europeana API for discovery
• 2000 images
• http://labs.cogapp.com/iiif-ml/
When automated analysis goes wrong
When automated analysis goes wrong
Tristan Roddis. Cogapp
Success 4:
Optical music recognition
When automated analysis goes wrong
http://labs.cogapp.com/nls-omr/wavs/91387296.wav
http://labs.cogapp.com/nls-omr/
Tristan Roddis. Cogapp
Failure 4: Automated
captions
A vase of flowers on a display
Emanuel Swedenborg
When automated analysis goes wrong
Emanuel Swedenborgsitting in front of
a laptop
Person using a phone
Person sitting in a chair talking on a cell
phone
A man and a woman taking a selfie
A cat with its mouth open
A pizza sitting on top of a window
A man riding a bear in the water
A teddy bear
A group of sheep standing on top of a
horse
A group of sheep standing on top of a
book
Tristan Roddis. Cogapp
Conclusions
• Embrace failure
• Find workarounds
• Add humans to the mix
Thank you.
Questions?
http://labs.cogapp.com/iiif-ml
http://labs.cogapp.com/nls-omr
Tristan Roddis
tristanr@cogapp.com

More Related Content

PDF
When automated analysis goes wrong by Tristan Roddis - EuropeanaTech Conferen...
PPTX
Kaihan AceTech presentation April 2014
PPTX
QED Learning Ecosystem
PDF
Wardley Mapping Maturity
PPT
The First Five Traits of Risk Management Excellence
PPT
Summer 2014 Use of media learn360 in the classroom
PPTX
Data Science: The Main Course @ KCDC 2016
PPTX
Thinking Through Digital: Top tips for designing projects and working with te...
When automated analysis goes wrong by Tristan Roddis - EuropeanaTech Conferen...
Kaihan AceTech presentation April 2014
QED Learning Ecosystem
Wardley Mapping Maturity
The First Five Traits of Risk Management Excellence
Summer 2014 Use of media learn360 in the classroom
Data Science: The Main Course @ KCDC 2016
Thinking Through Digital: Top tips for designing projects and working with te...

More from Cogapp (20)

PDF
Digital Strategy for Museums 2016
PPT
Challenges and opportunities for presenting Arabic manuscripts online
PPTX
Creating compelling user interfaces
PPTX
Everything you ever wanted to know about IIIF but were too afraid to ask
PDF
MW16 IIIF: Unshackle Your Images
PDF
Transform your museum with Agile
PDF
Digital: from Idea to Audience - Opportunities for Digital Excellence with a ...
PDF
IIIF The International Image Interoperability Framework at MCN2015
PDF
Making Meaning with Online Collections: Ten Top Tips
PDF
Agile for Beginners and Thrillseekers - MuseumNext 2015 Geneva
PDF
Scaling Drupal on Amazon Web Services (DrupalCamp Brighton)
PDF
Building Multilingual Websites with Drupal (DrupalCamp Brighton)
PDF
Test-driven Development with Drupal and Codeception (DrupalCamp Brighton)
PDF
Rich storytelling with Drupal, Paragraphs and Islandora DAMS (DrupalCamp Brig...
PDF
Leading Digital Transformation in the Cultural Sector: A resource for managers
PDF
MuseumNext 2014: Masterclass. Opportunities for digital excellence with a lit...
KEY
Cogapp's World Usability Day Slides
KEY
Cogapp Open Studios 2012 - Rethinking Interfaces
PPTX
Cogapp Open Studios 2012 - Adventures with Linked Data
PDF
Cogapp open studios - Magnificent Maps
Digital Strategy for Museums 2016
Challenges and opportunities for presenting Arabic manuscripts online
Creating compelling user interfaces
Everything you ever wanted to know about IIIF but were too afraid to ask
MW16 IIIF: Unshackle Your Images
Transform your museum with Agile
Digital: from Idea to Audience - Opportunities for Digital Excellence with a ...
IIIF The International Image Interoperability Framework at MCN2015
Making Meaning with Online Collections: Ten Top Tips
Agile for Beginners and Thrillseekers - MuseumNext 2015 Geneva
Scaling Drupal on Amazon Web Services (DrupalCamp Brighton)
Building Multilingual Websites with Drupal (DrupalCamp Brighton)
Test-driven Development with Drupal and Codeception (DrupalCamp Brighton)
Rich storytelling with Drupal, Paragraphs and Islandora DAMS (DrupalCamp Brig...
Leading Digital Transformation in the Cultural Sector: A resource for managers
MuseumNext 2014: Masterclass. Opportunities for digital excellence with a lit...
Cogapp's World Usability Day Slides
Cogapp Open Studios 2012 - Rethinking Interfaces
Cogapp Open Studios 2012 - Adventures with Linked Data
Cogapp open studios - Magnificent Maps
Ad

Recently uploaded (20)

PDF
Item # 2 - 934 Patterson Specific Use Permit (SUP)
PDF
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
PDF
Population Estimates 2025 Regional Snapshot 08.11.25
PPTX
GSA Q+A Follow-Up To EO's, Requirements & Timelines
PDF
Abhay Bhutada and Other Visionary Leaders Reinventing Governance in India
PDF
It Helpdesk Solutions - ArcLight Group
PDF
ISO-9001-2015-gap-analysis-checklist-sample.pdf
PPTX
Portland FPDR Oregon Legislature 2025.pptx
DOCX
Alexistogel: Solusi Tepat untuk Anda yang Cari Bandar Toto Macau Resmi
PDF
Item # 5 - 5307 Broadway St final review
PDF
PPT Item #s 2&3 - 934 Patterson SUP & Final Review
PPTX
SOMANJAN PRAMANIK_3500032 2042.pptx
PPT
generalgeologygroundwaterchapt11-181117073208.ppt
PDF
ISO-9001-2015-internal-audit-checklist2-sample.pdf
DOCX
EAPP.docxdffgythjyuikuuiluikluikiukuuuuuu
PDF
Items # 6&7 - 900 Cambridge Oval Right-of-Way
PPTX
DFARS Part 249 - Termination Of Contracts
PDF
Storytelling youth indigenous from Bolivia 2025.pdf
PPTX
sepsis.pptxMNGHGBDHSB KJHDGBSHVCJB KJDCGHBYUHFB SDJKFHDUJ
PPTX
Inferenceahaiajaoaakakakakakakakakakakakakaka
Item # 2 - 934 Patterson Specific Use Permit (SUP)
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
Population Estimates 2025 Regional Snapshot 08.11.25
GSA Q+A Follow-Up To EO's, Requirements & Timelines
Abhay Bhutada and Other Visionary Leaders Reinventing Governance in India
It Helpdesk Solutions - ArcLight Group
ISO-9001-2015-gap-analysis-checklist-sample.pdf
Portland FPDR Oregon Legislature 2025.pptx
Alexistogel: Solusi Tepat untuk Anda yang Cari Bandar Toto Macau Resmi
Item # 5 - 5307 Broadway St final review
PPT Item #s 2&3 - 934 Patterson SUP & Final Review
SOMANJAN PRAMANIK_3500032 2042.pptx
generalgeologygroundwaterchapt11-181117073208.ppt
ISO-9001-2015-internal-audit-checklist2-sample.pdf
EAPP.docxdffgythjyuikuuiluikluikiukuuuuuu
Items # 6&7 - 900 Cambridge Oval Right-of-Way
DFARS Part 249 - Termination Of Contracts
Storytelling youth indigenous from Bolivia 2025.pdf
sepsis.pptxMNGHGBDHSB KJHDGBSHVCJB KJDCGHBYUHFB SDJKFHDUJ
Inferenceahaiajaoaakakakakakakakakakakakakaka
Ad

When automated analysis goes wrong

Editor's Notes

  • #3: Digital agency based in UK. Work internationally. Would love to work for you.
  • #4: Present some experiments and research we’ve been doing. Look at three problems that can be solved using automated image analysis: interesting items, color extraction, finding similar images In all cases: images only. Deliberately not using metadata: manifests only for providing lists of images to analyse. Present our findings: some positive, full disclosure: some negative
  • #5: Adrian We manage the Qatar Digital Library website, that currently has nearly a million scanned pages. Set ourselves the challenge of automating the process of finding “visually interesting” document pages Not sure what it is but we know what it’s not (not text, not blank, not bindings...
  • #6: Over 600 pages Several dozen manuscripts available, so tens of thousands of images. And constantly adding more, so not easily achievable by humans. Impractical
  • #7: We have some info but not for all, this is on the logical, hard to extract info or missing
  • #8: Examples
  • #12: Tristan First approach we tried was colour analysis More colours used in illustrations than in plain black script Imagaa extracts foreground/background colours, “color variance”
  • #14: Between variance 11 and 17. Hugely mixed. Gave up: tried a different tack.
  • #15: Adrian We manage the Qatar Digital Library website, that currently has nearly a million scanned pages. Set ourselves the challenge of automating the process of finding “visually interesting” document pages Not sure what it is but we know what it’s not (not text, not blank, not bindings...
  • #16: Adrian
  • #17: Lots of concepts: food, colour, focus ... Picking up stains
  • #18: Still not good results
  • #19: Call this what you want Tried with one - not very good Build up two training sets: one with positive results, one with negative results
  • #23: IIIF Collection picked 10 random archives and then 10 images per archive Red -> interest Trained it a couple times (good results) 1 error here updated the set
  • #24: Example for one archive
  • #25: And it works
  • #26: And again
  • #27: Created a manifest - Mirador
  • #29: Last point: you can apply this technique of negative/positive training sets to _any_ visual problem. We demonstrated one version of this particular to our collection, but I’m sure you can think of similar questions about your own.
  • #30: Tristan Colour extraction is the low-hanging fruit of automated image analysis Very easy to do via scripts or APIs
  • #32: “a machine quite literally just said the Met’s collection is full of shit”
  • #33: Tristan Colour extraction is the low-hanging fruit of automated image analysis Very easy to do via scripts or APIs
  • #34: Paris, Texas vs Paris, France
  • #35: Adrian Last problem we looked Tried two approaches
  • #36: Collection of algorithms for image processing Skeletonize: each pixel removed if doesn’t break connectivity, then segment colour Censure: feature detector (scale invariant center-surround detector) Daisy: local image descriptor based on gradient orientation histograms These are good to find the same image (with noise...)
  • #37: Mean squared error: square of the diff between px in A and B, sum and divide by nb of pixels Structural similarity: measure similarity (viewed as a quality measure) Results as expected
  • #38: Very different
  • #39: These 2 are more similar
  • #40: Then started working with ML Harder than it looks (ok if you are an ML scientist) Not quick Tried something else
  • #41: Adrian Last problem we looked Tried two approaches
  • #42: Tristan 10 days before conference. Tried different approach.
  • #46: Tristan: only indexed if confidence value is > 0.75 Elasticsearch with Searchkit interface
  • #47: Focus on left-hand tags
  • #48: Different tags
  • #50: Different tags
  • #52: Different tags
  • #53: Adrian Last problem we looked Tried two approaches
  • #54: MS captioning. Works well
  • #67: Each AI has a “personality” of sorts