Assessing Subject
Metadata for Images
Hannah Marie Marshall, hmm88@cornell.edu
Metadata Librarian for Image Collections
Cornell University Library
ARLIS/NA+VRA 2016
March 11, 2016
Seattle, Washington
Background
Assessment Goals
• Determine retrieval rates
• Determine the search utility
• Primary Terms
• “What is the image of?”
• Secondary Terms
• “What is the image about?”
• Tertiary Terms
• “How does the image
communicate to the viewer?”
Challenges of subject analysis for
images
• "Image indexing is a complex socio-cognitive process that
involves processing sensory input through classifying,
abstracting, and mapping sensory data into concepts and
entities often expressed through socially-defined and
culturally-justified linguistic labels and identifiers"
(Heidorn, 1999)
• "Concept-based indexing has the advantage of providing
higher-level analysis of the image content but is expensive to
implement and suffers from a lack of inter-indexer
consistency due to the subjective nature of image
interpretation" (Chen, Rasmussen, 1999)
Findings – types of terms
Search Utility
• Primary Terms
• “What is the image of?”
• Secondary Terms
• “What is the image about?”
• Tertiary Terms
• “How does the image
communicate to the viewer?”
• Non-subject Terms
• Descriptive terms that don’t
address the subject matter of the
work (i.e. worktype,
materials/techniques,
style/period)
64%
34%
12%
13%
19%
16%
5%
37%
EXISTING DATA USERS
TYPES OF TERMS
Primary Terms Secondary Terms
Tertiary Terms Non-Subject Terms
Findings – types of terms
Search Utility
• Higher levels of correspondence
for images of two-dimensional
works
• Higher retrieval rates
• Higher search utility
• Users were 2.5 times more likely
to use non-subject terms to
describe and search for images
of three-dimensional works (and
non-representational/abstract
works)
• Pottery, jewelry, sculpture
71.70%
45.30%
0
47.20%
26.40%
15.30%
16%
0
5%
8.20%
13%
19%
0
32%
16.80%
0%
19.70%
0
15.80%
48.60%
EXISTING
DATA
USERS EXISTING
DATA
USERS
2D WORKS VS. 3D WORKS
Primary Terms Secondary Terms
Tertiary Terms Non-Subject Terms
Findings – types of terms
Search Utility
• Users were 2.5 times more likely
to use non-subject terms to
describe and search for images
of three-dimensional works (and
non-representational/abstract
works)
• Pottery, jewelry, sculpture
0% 10% 20% 30% 40% 50% 60%
Worktype
Style/Period
Materials/Techniques
Culture
Most common types of non-
subject access points
Findings – literal terms
Retrieval Rates
• Literal matches = successful
image retrieval
• Non-matches = unsuccessful
image retrieval
• Successful retrieval = 8.5%
• Unsuccessful retrieval =
91.5%
Correspondence between
existing metadata and users’
search terms
Non-matches Literal Matches
Findings – literal terms
Retrieval Rates
• Of that 8.5%...
• Primary Terms (75%)
• “What is the image of?”
• Secondary Terms (3%)
• “What is the image about?”
• Tertiary Terms (16%)
• “How does the image
communicate to the viewer?”
• Non-subject Terms (6%)
• Other descriptive metadata that
does not address subject
meaning (i.e. materials and
techniques)
Corresponding literal terms
broken down by type
Primary Terms Secondary Terms
Tertiary Terms Non-Subject Terms
Conclusions
• Primary terms yield the greatest
search utility and higher levels
of successful image retrieval.
• High numbers of non-subject
terms applied to images of
three-dimensional and non-
representational works suggest
that subject metadata is a weak
access point for them
Thank you!

More Related Content

PPTX
Preliminary Findings: A Comparative Study of User- and Cataloger-Assigned Sub...
PPTX
Marshall research design and methodology
PPTX
2011 06-14 cristhian-parra_u_count
PDF
Editors, authors, publishers - Who\'s who?
PPTX
Online course metadata standard 2
PPTX
Rio grande room
PPTX
Annotation of still images
PPTX
Preliminary Findings: A Comparative Study of User- and Cataloger-Assigned Sub...
Marshall research design and methodology
2011 06-14 cristhian-parra_u_count
Editors, authors, publishers - Who\'s who?
Online course metadata standard 2
Rio grande room
Annotation of still images

Viewers also liked (14)

PPTX
Mirador: A Cross-Repository Image Comparison and Annotation Tool
PPTX
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
PPTX
IIIF Annotation and Discovery
PDF
NISO DCMI Webinar bibframe-20130123
PPTX
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
PPTX
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
PPTX
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
PPTX
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
PPTX
NISO/DCMI Webinar: Metadata for Public Sector Administration
PDF
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
PPT
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
PPTX
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
PPTX
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
Mirador: A Cross-Repository Image Comparison and Annotation Tool
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
IIIF Annotation and Discovery
NISO DCMI Webinar bibframe-20130123
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
NISO/DCMI Webinar: Metadata for Public Sector Administration
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
Ad

Similar to Assessing Subject Metadata for Images (20)

PPTX
Assessing Subject Access for Images
PPTX
Using images
PPTX
People's mode of online engagement: The Many Faces of Digital Visitors and R...
PPTX
People's mode of online engagement: The Many Faces of Digital Visitors and Re...
PPTX
Visual Methodologies in Evaluations
PPTX
Improving Image Discovery for Art Scholars
PPTX
Introduction to Information Architecture & Design - 6/25/16
PPTX
Introduction to Information Architecture & Design - 6/24/17
PPTX
Digital Art History
PPTX
Introduction to Information Architecture & Design - 3/19/16
PPTX
Introduction to Information Architecture & Design - 6/20/15
PPTX
Introduction to Information Architecture & Design - SVA Workshop 06/21/14
PPTX
information architecture presentation basics
PPTX
Using images
PPTX
Qualitative Research Methods in LIS
PPTX
Qualitative Research Methods in LIS
PPTX
There is a method to it: Making meaning in information research through a mix...
PPTX
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
PPTX
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
PPTX
Content Complexity, Similarity, and Consistency in Social Media: A Deep Learn...
Assessing Subject Access for Images
Using images
People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and Re...
Visual Methodologies in Evaluations
Improving Image Discovery for Art Scholars
Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/24/17
Digital Art History
Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 6/20/15
Introduction to Information Architecture & Design - SVA Workshop 06/21/14
information architecture presentation basics
Using images
Qualitative Research Methods in LIS
Qualitative Research Methods in LIS
There is a method to it: Making meaning in information research through a mix...
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Content Complexity, Similarity, and Consistency in Social Media: A Deep Learn...
Ad

Recently uploaded (20)

PDF
2025_Mohammad Mahbub KxXxáacscascsacabir.pdf
PDF
INTRODUCTION-TO-ARTS-PRELIM.pdf arts and appreciation
PPTX
Slides-Archival-Moment-FGCCT-6Feb23.pptx
PDF
Annah la Javanaise_ The Truth Behind Gauguin’s Model.pdf
PPTX
400kV_Switchyardasdsfesfewffwefrrwewew_Training_Module.pptx
PPTX
Understanding APIs_ Types Purposes and Implementation.pptx
PPTX
Operational Research check it out. I like this it is pretty good
PPTX
SUBANEN DANCE DUMENDINGAN DANCE LITERATURE
PPTX
Lung Cancer - Bimbingan.pptxmnbmbnmnmn mn mn
PDF
15901922083_PQA.pdf................................
PPTX
WATER RESOURCE-1.pptx ssssdsedsddsssssss
PPTX
Chemical Reactions in Our Lives.pptxyyyyyyyyy
PPTX
Nationalism in India Ch-2.pptx ssssss classs 10
PPTX
mineralsshow-160112142010.pptxkuygyu buybub
PDF
Annah, his young mistress, had ransacked his apartment Morehead on Gauguin an...
PPTX
CMU-WEEK-2_TOPIC_Photography_Its_Definition_Historical_Background_and_Princi ...
PPTX
This is about the usage of color in universities design
PPTX
level measurement foe tttttttttttttttttttttttttttttttttt
PDF
Impressionism-in-Arts.For.Those.Who.Seek.Academic.Novelty.pdf
PPTX
LESSON 2 PUBLIC SPEAKING IS VERY FUN I LOVE IT
2025_Mohammad Mahbub KxXxáacscascsacabir.pdf
INTRODUCTION-TO-ARTS-PRELIM.pdf arts and appreciation
Slides-Archival-Moment-FGCCT-6Feb23.pptx
Annah la Javanaise_ The Truth Behind Gauguin’s Model.pdf
400kV_Switchyardasdsfesfewffwefrrwewew_Training_Module.pptx
Understanding APIs_ Types Purposes and Implementation.pptx
Operational Research check it out. I like this it is pretty good
SUBANEN DANCE DUMENDINGAN DANCE LITERATURE
Lung Cancer - Bimbingan.pptxmnbmbnmnmn mn mn
15901922083_PQA.pdf................................
WATER RESOURCE-1.pptx ssssdsedsddsssssss
Chemical Reactions in Our Lives.pptxyyyyyyyyy
Nationalism in India Ch-2.pptx ssssss classs 10
mineralsshow-160112142010.pptxkuygyu buybub
Annah, his young mistress, had ransacked his apartment Morehead on Gauguin an...
CMU-WEEK-2_TOPIC_Photography_Its_Definition_Historical_Background_and_Princi ...
This is about the usage of color in universities design
level measurement foe tttttttttttttttttttttttttttttttttt
Impressionism-in-Arts.For.Those.Who.Seek.Academic.Novelty.pdf
LESSON 2 PUBLIC SPEAKING IS VERY FUN I LOVE IT

Assessing Subject Metadata for Images

  • 1. Assessing Subject Metadata for Images Hannah Marie Marshall, hmm88@cornell.edu Metadata Librarian for Image Collections Cornell University Library ARLIS/NA+VRA 2016 March 11, 2016 Seattle, Washington
  • 3. Assessment Goals • Determine retrieval rates • Determine the search utility • Primary Terms • “What is the image of?” • Secondary Terms • “What is the image about?” • Tertiary Terms • “How does the image communicate to the viewer?”
  • 4. Challenges of subject analysis for images • "Image indexing is a complex socio-cognitive process that involves processing sensory input through classifying, abstracting, and mapping sensory data into concepts and entities often expressed through socially-defined and culturally-justified linguistic labels and identifiers" (Heidorn, 1999) • "Concept-based indexing has the advantage of providing higher-level analysis of the image content but is expensive to implement and suffers from a lack of inter-indexer consistency due to the subjective nature of image interpretation" (Chen, Rasmussen, 1999)
  • 5. Findings – types of terms Search Utility • Primary Terms • “What is the image of?” • Secondary Terms • “What is the image about?” • Tertiary Terms • “How does the image communicate to the viewer?” • Non-subject Terms • Descriptive terms that don’t address the subject matter of the work (i.e. worktype, materials/techniques, style/period) 64% 34% 12% 13% 19% 16% 5% 37% EXISTING DATA USERS TYPES OF TERMS Primary Terms Secondary Terms Tertiary Terms Non-Subject Terms
  • 6. Findings – types of terms Search Utility • Higher levels of correspondence for images of two-dimensional works • Higher retrieval rates • Higher search utility • Users were 2.5 times more likely to use non-subject terms to describe and search for images of three-dimensional works (and non-representational/abstract works) • Pottery, jewelry, sculpture 71.70% 45.30% 0 47.20% 26.40% 15.30% 16% 0 5% 8.20% 13% 19% 0 32% 16.80% 0% 19.70% 0 15.80% 48.60% EXISTING DATA USERS EXISTING DATA USERS 2D WORKS VS. 3D WORKS Primary Terms Secondary Terms Tertiary Terms Non-Subject Terms
  • 7. Findings – types of terms Search Utility • Users were 2.5 times more likely to use non-subject terms to describe and search for images of three-dimensional works (and non-representational/abstract works) • Pottery, jewelry, sculpture 0% 10% 20% 30% 40% 50% 60% Worktype Style/Period Materials/Techniques Culture Most common types of non- subject access points
  • 8. Findings – literal terms Retrieval Rates • Literal matches = successful image retrieval • Non-matches = unsuccessful image retrieval • Successful retrieval = 8.5% • Unsuccessful retrieval = 91.5% Correspondence between existing metadata and users’ search terms Non-matches Literal Matches
  • 9. Findings – literal terms Retrieval Rates • Of that 8.5%... • Primary Terms (75%) • “What is the image of?” • Secondary Terms (3%) • “What is the image about?” • Tertiary Terms (16%) • “How does the image communicate to the viewer?” • Non-subject Terms (6%) • Other descriptive metadata that does not address subject meaning (i.e. materials and techniques) Corresponding literal terms broken down by type Primary Terms Secondary Terms Tertiary Terms Non-Subject Terms
  • 10. Conclusions • Primary terms yield the greatest search utility and higher levels of successful image retrieval. • High numbers of non-subject terms applied to images of three-dimensional and non- representational works suggest that subject metadata is a weak access point for them