SlideShare a Scribd company logo
Create It Once, Use It
Again…and Again…andAgain…
Cross-platform Repurposing of Archival
Metadata
Andrea Payant
Sara Skindelien
Liz Woolcott
Utah State University
Carol Ou
Katherine Rankin
University of Nevada, Las Vegas
Cory Nimer
Brigham Young University
The Missing Link
Metadata Conversion Workflows for Everyone
Andrea Payant
Metadata Specialist
andrea.payant@usu.edu
Sara Skindelien
Special Collections Assistant
sara.skindelien@usu.edu
Liz Woolcott
Head, Cataloging & Metadata
Liz.woolcott@usu.edu
CIMA Annual Conference
2016
PilotProject
Working conditions
• No archival management system
• Hand coded EAD guides
• Legacy finding aids
• No consistent use of spreadsheets
• Digital repository for archival
material
• Contribute to two consortiums
• Need to meet both standards
PilotProject
What we needed
• Streamline/automate metadata
creation
• Link digitized images between EAD
and CONTENTdm
• Make work flexible
• Work can be done by anyone
(library staff, student workers,
curators)
•Lower the tech barrier
• XML transformations require in-
depth training – is there another
way?
• Document procedures
PilotProject
SCA-Digital (SCA-D) Workflow Group
• What/Who
• Group composed of Special Collection and Archives staff, Digital Initiatives
staff, and Metadata staff
• Purpose
• Streamline workflows between Special Collections and Digital Initiatives
• Primary focus on metadata creation – most time consuming of tasks
• Timeline
• 2014-2015
• Results (View report: https://usu.box.com/s/fqyn5usd9b4wf6pcg7466bwt3oeam4q6)
• Developed two workflows
• Automation of EAD to Dublin Core and
• Digital content linking
• Digital Assessment Checklist
• Tackled two retro metadata projects
Two processes, step-by-step
Workflow for converting HTML finding aid inventory into Dublin
Core: https://workflowy.com/s/mRrejmDAtj
Workflow for Digital Content Linking:
https://workflowy.com/s/Ekz41aSze2
Converting HTML Finding Aids
to Dublin Core for Batch
Uploading
Repurposing EAD Container Lists
Problem: We needed a simple, low tech option to convert our legacy finding
aids into Dublin Core compliant metadata for digitization.
Solution: Opted for “copy/paste” process because it was by far the easiest
method to develop and teach. EVERYBODY can copy/paste.
Tools:
Methods:
Microsoft Office (Excel specifically), Oxygen XML Editor, &
CONTENTdm
In less than 10 easy steps we adjusted data using common Excel
spreadsheet formulas and batch imported the data into the digital
collection management system
Step1:Copyinventoryfromonlinefindingaid
Or is it?
Just a plain old, run-of-
the mill spreadsheet.
The copied inventory from the finding aid pasted into our Excel spreadsheet
template under the Raw HTML sheet.
Step 2: Isolate the title from the identifier:
Insert a column
Enter formula =RIGHT(ColumnRow,
LEN(ColumnRow)-7)
The Missing Link: Metadata Conversion Workflows for Everyone
PilotProject
EditColumnstoseparateBox,FolderIteminformationfromTitle
Step 3: Create another column for identifiers. Highlight
the first three rows & grab the black square in row 3 and
drag down to the last line of text to autofill consecutive
numbers.
The identifiers have
now been separated
from the title into their
own column.
Step4:CopycorrespondingcolumnsfromHTMLsheettotheEADsheet
Beware: Make sure you select
Paste Special when copying
columns so just the data is copied
& not the formulas.
Add the Collection Name, Collection Number and Collection URL at
the top for automatic exporting to Dublin Core sheet.
Step5:Insertcollectioninformation
The Missing Link: Metadata Conversion Workflows for Everyone
Review the Dublin Core sheet for
complete exportation.
Step 7: Save Excel spreadsheet as a
new tab delimited file.
Step 6: Filenames, provided by the
Digital Initiatives staff, are added for
each item.
Step 8: Open in a text editor such as
Notepad and save the file again for
batch uploading into CONTENTdm.
Batch Linking Digital Content
Batch Linking Digital Content
OVERVIEW
 Procedure 1 – Exporting and Spreadsheet Clean-Up
o Outcome: Create a tab delimited file – re-purpose existing metadata
 Procedure 2 – Mail Merge
o Outcome: Use metadata to create container lists in xml for EAD finding
aids and complete batch linking
 Procedure 3 – Uploading the Finding Aid
o Outcome: Perform quality control and upload to Archives West
Batch Linking Digital Content
Procedure 1 – Exporting and Spreadsheet Clean-Up
• Export metadata from CONTENTdm
• Open the tab delimited file in Excel and edit as needed
Batch Linking Digital Content
Procedure 2 – Mail Merge
• Use an xml container list template - copy & paste into a new Word document
• Use mail merge feature in Word to automatically populate container list fields
from your source file
• Edit the merged document
Batch Linking Digital Content
Procedure 3 – Uploading the Finding Aid
• Copy & Paste new container list from Word into the <dsc> section of the
master xml document
What we learned
- Training needs
• Be prepared to teach/re-teach
• Helping them see the bigger picture
 How are users going to access the material
 How will these descriptions look in all applicable systems (CDM, Archives
West, etc.)
- Develop and train everyone on Best Practices
- Fluency with Excel
• Excel will mess with dates – make sure this formatted correctly
- Compliance with multiple standards
• DACs allows “circa” dates, RDA prefers “approximate”, ISO standards do not
• Need to be machine-readable and human readable
- Future applications of this process will change (ie. adopting
ArchivesSpace)
Want to try it out?
Workflow for Digital Content Linking:
https://workflowy.com/s/Ekz41aSze2
Workflow for converting HTML finding aid inventory into Dublin Core:
https://workflowy.com/s/mRrejmDAtj
Visit our Blog/Find our presentation slides here:
http://usucataloging.wix.com/usucatalogers
Questions?
Andrea Payant
Metadata Specialist
andrea.payant@usu.edu
Sara Skindelien
Special Collections Assistant
sara.skindelien@usu.edu
Liz Woolcott
Head, Cataloging & Metadata
Liz.woolcott@usu.edu

More Related Content

PPTX
ARK de Triumph: Linking Finding Aids & Digital Libraries Using a Low-Tech App...
PPTX
Transparent Licenses: Making user rights clear (OLA Super Conference 2015)
PDF
Getting the Most out of CORAL
PPTX
Discovery layer decisions, configurations and strategies
PPTX
Getting on the Same Page: Aligning ERM and LIbGuides Content
PPTX
The Orlando Project Visual Workflow
PDF
AALL 2015: Hands on Linked Data Tools for Catalogers: MarcEdit and MARCNext
PPTX
Preparing Catalogers for Linked data
ARK de Triumph: Linking Finding Aids & Digital Libraries Using a Low-Tech App...
Transparent Licenses: Making user rights clear (OLA Super Conference 2015)
Getting the Most out of CORAL
Discovery layer decisions, configurations and strategies
Getting on the Same Page: Aligning ERM and LIbGuides Content
The Orlando Project Visual Workflow
AALL 2015: Hands on Linked Data Tools for Catalogers: MarcEdit and MARCNext
Preparing Catalogers for Linked data

What's hot (19)

PPTX
Linked Data at Smithsonian Libraries
PPTX
COMPanion Corporation Alexandria by Nancy Garcia, Luis Mercado, Elizabeth Tan...
PPTX
IR Metadata in the Library Catalog: Our experience with ETDs
PPT
Document management #RWIRW
PDF
Georgia Tech Drupal Users Group - February 2015 Meeting
PDF
#OSSPARIS19 : Comment ONLYOFFICE aide à organiser les travaux de recherches ...
PPTX
Show 'Em What You've Got: Exposing Finding Aids with ArchivesSpace
PPTX
Don’t make me think: biodiversity data publishing made easy
PPTX
PaLA2010 Annual Cultivating Technical Services
PDF
2020 Vision (Dubious Design Decisions)
PPTX
Walk this way: Online content platform migration experiences and collaboration
PDF
Some NoSQL
PPTX
Informatics and data analysis - McMahon - MEWE 2013
PDF
Database Systems - Lecture Week 1
Linked Data at Smithsonian Libraries
COMPanion Corporation Alexandria by Nancy Garcia, Luis Mercado, Elizabeth Tan...
IR Metadata in the Library Catalog: Our experience with ETDs
Document management #RWIRW
Georgia Tech Drupal Users Group - February 2015 Meeting
#OSSPARIS19 : Comment ONLYOFFICE aide à organiser les travaux de recherches ...
Show 'Em What You've Got: Exposing Finding Aids with ArchivesSpace
Don’t make me think: biodiversity data publishing made easy
PaLA2010 Annual Cultivating Technical Services
2020 Vision (Dubious Design Decisions)
Walk this way: Online content platform migration experiences and collaboration
Some NoSQL
Informatics and data analysis - McMahon - MEWE 2013
Database Systems - Lecture Week 1
Ad

Similar to The Missing Link: Metadata Conversion Workflows for Everyone (20)

PPT
ALA Interoperability
PPTX
Reengineering PDF-Based Documents Targeting Complex Software Specifications
PPTX
1-Introduction to Data Structures beginner.pptx
PDF
Lecture_1_Intro.pdf
PPTX
Unit - I Intro. to OOP Concepts and Control Structure -OOP and CG (2024 Patte...
PPTX
PPTX
Enterprise Data World 2018 - Building Cloud Self-Service Analytical Solution
PDF
ASMUG February 2015 Knowledge Event
PPT
Database Management & Models
PPTX
data structures and its importance
PDF
Day 4 - Excel Automation and Data Manipulation
PPTX
Data Science Process.pptx
PPTX
A machine learning and data science pipeline for real companies
PDF
Euclid Data Model 101 - Episode 01: Overview
PPTX
Database Management System
PPT
Xml Publisher And Reporting To Excel
PDF
Informatics_Practices_SrSec_2024-25.pdf.
PPT
Database Management System Processing.ppt
PDF
Scoping Level of Effort and Getting the Right Resources for the Job
ALA Interoperability
Reengineering PDF-Based Documents Targeting Complex Software Specifications
1-Introduction to Data Structures beginner.pptx
Lecture_1_Intro.pdf
Unit - I Intro. to OOP Concepts and Control Structure -OOP and CG (2024 Patte...
Enterprise Data World 2018 - Building Cloud Self-Service Analytical Solution
ASMUG February 2015 Knowledge Event
Database Management & Models
data structures and its importance
Day 4 - Excel Automation and Data Manipulation
Data Science Process.pptx
A machine learning and data science pipeline for real companies
Euclid Data Model 101 - Episode 01: Overview
Database Management System
Xml Publisher And Reporting To Excel
Informatics_Practices_SrSec_2024-25.pdf.
Database Management System Processing.ppt
Scoping Level of Effort and Getting the Right Resources for the Job
Ad

More from Andrea Payant (20)

PPTX
Avoiding a Level of Discontent in Finding Aids: An Analysis of User Engagemen...
PPTX
On Your MARC, Get Set, Code!
PPTX
Let's Get Digital!
PPTX
Where's the Data?
PPTX
Mitigating the Risk: identifying Strategic University Partnerships for Compli...
PPTX
Just Keep Cataloging: How One Cataloging Unit Changed Their Workflows to Fit ...
PPTX
But Were We Successful: Using Online Asynchronous Focus Groups to Evaluate Li...
PPTX
Assessment and Visualization Tools for Technical Services
PPTX
Research Data Management at USU
PPTX
liwalaawiiloxhbakaa (How We Lived): The Grant Bulltail Absáalooke (Crow Natio...
PPTX
Crowdsourcing Metadata Practices at USU
PPTX
Homeward Bound: How to Move an Entire Cataloging Unit to Remote Work
PPTX
MARC-y MARC and the Coding Bunch
PPTX
Outside In: Retooling Cataloging Outreach Efforts
PPTX
Charting Communication: Assessment and Visualization Tools for Mapping the Co...
PPTX
Memes of Resistance, Election Reflections, and Voices from Drug Court: Social...
PPTX
Giving Credit Where Credit is Due: Author and Funder IDs
PPTX
VOCAB for Collaboration: How “Work Language” Can Help You Win at Teamwork
PPTX
Can You Scan This For Me? Making the Most of Patron Digitization Request in t...
PPTX
Wisdom of the Crowd: Successful Ways to Engage the Public in Metadata Creation
Avoiding a Level of Discontent in Finding Aids: An Analysis of User Engagemen...
On Your MARC, Get Set, Code!
Let's Get Digital!
Where's the Data?
Mitigating the Risk: identifying Strategic University Partnerships for Compli...
Just Keep Cataloging: How One Cataloging Unit Changed Their Workflows to Fit ...
But Were We Successful: Using Online Asynchronous Focus Groups to Evaluate Li...
Assessment and Visualization Tools for Technical Services
Research Data Management at USU
liwalaawiiloxhbakaa (How We Lived): The Grant Bulltail Absáalooke (Crow Natio...
Crowdsourcing Metadata Practices at USU
Homeward Bound: How to Move an Entire Cataloging Unit to Remote Work
MARC-y MARC and the Coding Bunch
Outside In: Retooling Cataloging Outreach Efforts
Charting Communication: Assessment and Visualization Tools for Mapping the Co...
Memes of Resistance, Election Reflections, and Voices from Drug Court: Social...
Giving Credit Where Credit is Due: Author and Funder IDs
VOCAB for Collaboration: How “Work Language” Can Help You Win at Teamwork
Can You Scan This For Me? Making the Most of Patron Digitization Request in t...
Wisdom of the Crowd: Successful Ways to Engage the Public in Metadata Creation

Recently uploaded (20)

PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Sports Quiz easy sports quiz sports quiz
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Complications of Minimal Access Surgery at WLH
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Cell Types and Its function , kingdom of life
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Cell Structure & Organelles in detailed.
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Sports Quiz easy sports quiz sports quiz
O5-L3 Freight Transport Ops (International) V1.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
VCE English Exam - Section C Student Revision Booklet
Complications of Minimal Access Surgery at WLH
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Computing-Curriculum for Schools in Ghana
Cell Types and Its function , kingdom of life
GDM (1) (1).pptx small presentation for students
Final Presentation General Medicine 03-08-2024.pptx
01-Introduction-to-Information-Management.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Basic Mud Logging Guide for educational purpose
Cell Structure & Organelles in detailed.
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Renaissance Architecture: A Journey from Faith to Humanism
PPH.pptx obstetrics and gynecology in nursing
human mycosis Human fungal infections are called human mycosis..pptx

The Missing Link: Metadata Conversion Workflows for Everyone

  • 1. Create It Once, Use It Again…and Again…andAgain… Cross-platform Repurposing of Archival Metadata Andrea Payant Sara Skindelien Liz Woolcott Utah State University Carol Ou Katherine Rankin University of Nevada, Las Vegas Cory Nimer Brigham Young University
  • 2. The Missing Link Metadata Conversion Workflows for Everyone Andrea Payant Metadata Specialist andrea.payant@usu.edu Sara Skindelien Special Collections Assistant sara.skindelien@usu.edu Liz Woolcott Head, Cataloging & Metadata Liz.woolcott@usu.edu CIMA Annual Conference 2016
  • 3. PilotProject Working conditions • No archival management system • Hand coded EAD guides • Legacy finding aids • No consistent use of spreadsheets • Digital repository for archival material • Contribute to two consortiums • Need to meet both standards
  • 4. PilotProject What we needed • Streamline/automate metadata creation • Link digitized images between EAD and CONTENTdm • Make work flexible • Work can be done by anyone (library staff, student workers, curators) •Lower the tech barrier • XML transformations require in- depth training – is there another way? • Document procedures
  • 5. PilotProject SCA-Digital (SCA-D) Workflow Group • What/Who • Group composed of Special Collection and Archives staff, Digital Initiatives staff, and Metadata staff • Purpose • Streamline workflows between Special Collections and Digital Initiatives • Primary focus on metadata creation – most time consuming of tasks • Timeline • 2014-2015 • Results (View report: https://usu.box.com/s/fqyn5usd9b4wf6pcg7466bwt3oeam4q6) • Developed two workflows • Automation of EAD to Dublin Core and • Digital content linking • Digital Assessment Checklist • Tackled two retro metadata projects
  • 6. Two processes, step-by-step Workflow for converting HTML finding aid inventory into Dublin Core: https://workflowy.com/s/mRrejmDAtj Workflow for Digital Content Linking: https://workflowy.com/s/Ekz41aSze2
  • 7. Converting HTML Finding Aids to Dublin Core for Batch Uploading
  • 8. Repurposing EAD Container Lists Problem: We needed a simple, low tech option to convert our legacy finding aids into Dublin Core compliant metadata for digitization. Solution: Opted for “copy/paste” process because it was by far the easiest method to develop and teach. EVERYBODY can copy/paste. Tools: Methods: Microsoft Office (Excel specifically), Oxygen XML Editor, & CONTENTdm In less than 10 easy steps we adjusted data using common Excel spreadsheet formulas and batch imported the data into the digital collection management system
  • 10. Or is it? Just a plain old, run-of- the mill spreadsheet.
  • 11. The copied inventory from the finding aid pasted into our Excel spreadsheet template under the Raw HTML sheet.
  • 12. Step 2: Isolate the title from the identifier: Insert a column Enter formula =RIGHT(ColumnRow, LEN(ColumnRow)-7)
  • 14. PilotProject EditColumnstoseparateBox,FolderIteminformationfromTitle Step 3: Create another column for identifiers. Highlight the first three rows & grab the black square in row 3 and drag down to the last line of text to autofill consecutive numbers.
  • 15. The identifiers have now been separated from the title into their own column.
  • 16. Step4:CopycorrespondingcolumnsfromHTMLsheettotheEADsheet Beware: Make sure you select Paste Special when copying columns so just the data is copied & not the formulas.
  • 17. Add the Collection Name, Collection Number and Collection URL at the top for automatic exporting to Dublin Core sheet. Step5:Insertcollectioninformation
  • 19. Review the Dublin Core sheet for complete exportation.
  • 20. Step 7: Save Excel spreadsheet as a new tab delimited file. Step 6: Filenames, provided by the Digital Initiatives staff, are added for each item. Step 8: Open in a text editor such as Notepad and save the file again for batch uploading into CONTENTdm.
  • 22. Batch Linking Digital Content OVERVIEW  Procedure 1 – Exporting and Spreadsheet Clean-Up o Outcome: Create a tab delimited file – re-purpose existing metadata  Procedure 2 – Mail Merge o Outcome: Use metadata to create container lists in xml for EAD finding aids and complete batch linking  Procedure 3 – Uploading the Finding Aid o Outcome: Perform quality control and upload to Archives West
  • 23. Batch Linking Digital Content Procedure 1 – Exporting and Spreadsheet Clean-Up • Export metadata from CONTENTdm • Open the tab delimited file in Excel and edit as needed
  • 24. Batch Linking Digital Content Procedure 2 – Mail Merge • Use an xml container list template - copy & paste into a new Word document • Use mail merge feature in Word to automatically populate container list fields from your source file • Edit the merged document
  • 25. Batch Linking Digital Content Procedure 3 – Uploading the Finding Aid • Copy & Paste new container list from Word into the <dsc> section of the master xml document
  • 26. What we learned - Training needs • Be prepared to teach/re-teach • Helping them see the bigger picture  How are users going to access the material  How will these descriptions look in all applicable systems (CDM, Archives West, etc.) - Develop and train everyone on Best Practices - Fluency with Excel • Excel will mess with dates – make sure this formatted correctly - Compliance with multiple standards • DACs allows “circa” dates, RDA prefers “approximate”, ISO standards do not • Need to be machine-readable and human readable - Future applications of this process will change (ie. adopting ArchivesSpace)
  • 27. Want to try it out? Workflow for Digital Content Linking: https://workflowy.com/s/Ekz41aSze2 Workflow for converting HTML finding aid inventory into Dublin Core: https://workflowy.com/s/mRrejmDAtj Visit our Blog/Find our presentation slides here: http://usucataloging.wix.com/usucatalogers
  • 28. Questions? Andrea Payant Metadata Specialist andrea.payant@usu.edu Sara Skindelien Special Collections Assistant sara.skindelien@usu.edu Liz Woolcott Head, Cataloging & Metadata Liz.woolcott@usu.edu

Editor's Notes

  • #4: No ArchivesSpace Hand code EAD (or use template) Used CONTENTdm as digital repository Contributor to two consortiums, need to meet both standards ArchivesWest (for EADs) MWDL (for digital content) Some batch loading of digital content Relied on spreadsheets populated row by row
  • #7: Sara and Andrea will be demonstrating two processes – converting HTML finding aid inventories into Dublin Core metadata and Digital Content Linking. All the step-by-step procedures are available at the links above, if you want to try them out later. We will also show these links at the end of the presentation.
  • #9: In the days before standardization, finding aid formats were as unique as the people creating them. This made legacy finding aids difficult to convert into spreadsheets. In addition, we also found that XML stylesheets vary with each collection. We needed a simple, low tech option to convert our legacy finding aids into Dublin Core compliant data for digitization. After extensive research, we opted for the copy/paste method process because it was by far the easiest method to develop and teach. Everybody can copy the html table-formatted container list and paste it into an Excel spreadsheet. We also wanted to utilize the tools we already had on hand – Excel, Oxygen, CONTENTdm. We did not want to purchase or design new software- since such an approach would have been counterproductive to our goal of maintaining a low technological bar. So by developing a strategy that involves less than 10 steps, we adjusted data using common spreadsheet formulas and an XML Editor to batch import the data into the digital collection management system.
  • #10: Step 1: We copied the table formatted container list from the online finding aid
  • #11: We then open our plain, old, run-of-the mill spreadsheet. Or is it?
  • #12: We pasted the html table-formatted container list into a blank spreadsheet, which we titled “Raw HTML Copy”. We want to separate out the identifying numbers in Column B – the 01:01: and so forth, from the title and place the data into its own column.
  • #13: We accomplish this by inserting a column, enter our formula =RIGHT(C1, LEN(C1)-7, with 7 representing the number of characters you want removed from the cell.
  • #14: We now have the title isolated into its own cell.
  • #15: Step 3: Insert another column to include our identifiers. Type in the first three identifiers: 1:01, 1:02, 1:03, highlight the first three rows and grab the black square at the bottom and drag down to your last item to autofill the cells with consecutive numbers.
  • #16: The identifiers have now been separated from the title into their own column.
  • #17: Step 4: Copy corresponding columns from the Raw HTML sheet into the EAD sheet. But beware: make sure you select Paste Special when copying instead of just Paste to make sure only the data is exported over and NOT the formulas otherwise your data fields will not export correctly and will display hashtags.
  • #18: Step 5: Insert the collection name, Herald Journal Photograph Collection; Collection Number, P0001 & Collection URL into the first three rows.
  • #19: Through the use of embedded Excel formulas, collection information is then effortlessly exported over to the Dublin Core sheet from the EAD sheet into Source, Physical Collection Name,
  • #20: Physical Collection Number, Box, Item, Call Number, & Collection Inventory URL for each item. Review the Dublin Core sheet for complete exportation and clean up the sheet to remove empty columns. For instance, this collection did not have folder information so the sheet exported zeros for folder information. Those will need to be re
  • #21: Step 6: Insert filenames provided by our Digital Initiatives staff Step 7: Save spreadsheet as a tab delimited file Step 8: Open the file in a text editor such as Notepad, delete any trailing spaces, and save the file again for batch uploading into CONTENTdm. And now Andrea will explain the batch linking digital content.
  • #23: Overview This is a brief outline of the procedures involved in the workflow we have created to batch upload and embed links to digital content in EAD finding aids (this process works best when an entire collection has been digitized). The 3 main processes include first, the exporting of digital collection metadata into a tab delimited file then editing that metadata in order to repurpose it for linking. Second, we then use the mail merge function in Microsoft Word to automatically create an xml format container list that can be copied then pasted directly into the xml document for the EAD finding aid. Finally, you perform quality control on the xml document then upload the content to Archives West.
  • #24: Here is a more detailed look at the process The first step is to export metadata from your digital asset management system – in our case the system is CONTENTdm and the process is pretty simple. In CONTENTdm administration you select the collections tab and then go to the export option from the menu – you make the appropriate selections for the metadata export – then CONTENTdm creates a tab-delimited text file > you right click on the file to “Save Link As” and save it to your computer. This text file can now be opened in Microsoft Excel. You click through the text import wizard until the process is finished. The result should be a spreadsheet that looks something like this with a lot of fields for the collection metadata Which you will then edit to only include information needed to create an EAD container list with the necessary elements for the xml document including component numbers, component levels, and any necessary hierarchical containers for box, folder, or item, and title, format, date, and the ARK URLs for linking the digital content.
  • #25: Once you have finished making the necessary edits to your spreadsheet you can move on to the next step which is to utilize the mail merge function in Microsoft Word to create a new xml container list for EAD with links to digital content embedded. To begin you will need to use a template like the one you see here This template should represent the xml coding needed for a single item in your EAD finding aid and you want to be sure to include the digital access object and xlink tagging (which are necessary for the content linking to operate effectively). The parts of the xml template that are highlighted here in the angle brackets are variable while the rest of the text is constant, or fixed. Mail merge will use each row of data in your spreadsheet to populate these variable fields and duplicate this template for each item in your collection. To perform the mail merge you first go to the mailings tab in Word and click “Start Mail Merge” and then make sure “Normal Word Document” is selected. Second, you click “Select Recipients” and choose “Use Existing List”, a new window opens to select a table > you select your spreadsheet then another new window opens for you to select your spreadsheet again as the data source for the merge. Next, you will assign fields from your spreadsheet to the corresponding EAD elements in the xml template. You begin by highlighting the first EAD element, then you go to “Insert Merge Field” then select the matching field from the drop down list of data source options. You repeat the same process for each of the EAD elements in your template. Once you have finished, you complete the merge by selecting “Finish & Merge,” then you select “Edit Individual Document” then you choose “All” You will now have a new word document that should look like this. You should see xml for individual items in your collection on each page with information inserted from your spreadsheet. You will then want to make any necessary edits to the xml (like removing empty tags or getting rid of all the extra white space).
  • #26: Then for the final phase of the process you copy the entire container list in Word and paste it into the <dsc> section of your master xml file for your collection’s EAD finding aid. You can then perform quality control on the xml, once finished you can upload your new EAD finding aid complete with links to the digital objects
  • #27: Throughout the creation of this workflow we have learned a few things and we can make some suggestions of things to keep in mind for anyone seeking to implement this process: First, there will most likely be a training needs - you will need be prepared to teach and re-teach as necessary, also make sure those involved in the process understand the overall purpose and benefits from the results of their work – for example teach about how users are accessing material and also what the description differences are in each system You will also need to be sure that everyone is aware of and using best practices and standards for your institution to ensure consistency from all parties involved in the process This workflow involves the use of Excel quite a bit – so there needs to be a certain level of fluency with the program – for example: formatting cells in the spreadsheet can be tricky especially when working with dates for your collection You will need to also make sure that there is compliance across multiple standards – for example, DACS allows “circa” dates but ISO standards do not - you will need to keep in mind that there is an overall need for the information to be machine readable as well as human readable Finally – be aware of and consider the future applications for the process (for example we anticipate adopting Archives Space at some point and we will no doubt have to adapt our workflows for that)
  • #28: If you would like to try out the process you can access our detailed workflows as well and the slides from this presentation today at these sites.
  • #29: You are also welcome to contact any of us if you have further questions