SlideShare a Scribd company logo
Creating Structure in Web Archives With Collections:
Different Concepts From Web Archivists
Presented By:
Himarsha R. Jayanetti
Department of Computer Science
Old Dominion University, Norfolk, Virginia
@HimarshaJ @WebSciDL @oducs
TPDL ‘22, The 26th International Conference on Theory and Practice of Digital Libraries, Padua, Italy, 20 - 23 September 2022
Himarsha R. Jayanetti, Shawn M. Jones, Martin Klein, Alex Osbourne, Paul Koerbin, Michael L. Nelson, and Michele C. Weigle
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Web Archives Preserve the Content of Web Pages as They Were at a
Specific Point in Time
Archived web pages, or mementos, are increasingly used by researchers, including journalists, social scientist, and historians.
2
A screenshot of the https://oduwsdl.github.io/ live web page
URI-R: Original
Resource URI-M:
Memento
A screenshot of the https://oduwsdl.github.io/ web page archived on
2020-11-18T23:04:53Z (Memento-Datetime)
https://web.archive.org/web/20201118230453/https://oduwsdl.github.io/
URI-T:
TimeMap
https://web.archive.org/web/
*/https://oduwsdl.github.io/
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
The Term “Web Archives” Mean the Same Thing Among
These Eight Web Archive Platforms
3
“Old Dominion University Social Media” Collection
at Archive-It
Collections at the Internet Archive
Webpage Snapshots
Captures
Archived copies
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
But There are Different Terms Used for “Memento”
Among Web Archives
4
Captures
Archived Copies
Webpage Snapshots
Memento
“Old Dominion University Social Media” Collection
at Archive-It
Collections at the Internet Archive
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
The Term “Collection” Has Slightly Different Meanings
Among Web Archives
5
Captures
Archived Copies
Webpage Snapshots
Memento
“Old Dominion University Social Media” Collection
at Archive-It
Collections at the Internet Archive
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
We Differentiated Collections From the Greater Web Archive
(Web Archive as a Whole)
6
Greater Web
Archive: Pandora
Collection: Indigenous
Australians
https://pandora.nla.gov.au/subject/12
https://pandora.nla.gov.au/
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Some Web Archive Collections Contain Sub-Collections
7
7
No Sub-collections
Has Sub-collections
* In conifer: lists acts
as sub-collections.
*
https://archive-it.org/collections/2697
https://webarchive.nla.gov.au/collection/15003
Archive-It
Trove
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Curated By (Attribution):
Single Entity, Different Organizational Collaborators, or the Greater Web Archive
8
8
Greater Web Archive
Single account
Organizational collaborators https://webarchive.nla.gov.au/collection/13842
https://archive-it.org/collections/7635
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Some Web Archive Collections Support Private Collections
9
No Private Collections
Support for Private Collections
https://support.archive-it.org/hc/en-us/articles/208334003-Controlling-access-to-your-web-archives-
https://conifer.rhizome.org/himarshaj
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Some Platforms Embargo Some of Their Resources
10
Do not embargo resources
Embargo resources
https://www.webarchive.org.uk/en/ukwa/collection/3942
https://www.loc.gov/item/lcwaN0006607/
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchies: How a Visitor or Crawler Navigates Each
Collection for Information
Type 1 Type 2
11
Type 1 collections are original resource focused. Type 2 collections are archived resource (memento) focused.
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: Archive-It
Navigational Hierarchy Collection Landing Page
12
Type 1
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: Library of Congress (LC)
Navigational Hierarchy Collection Items Page
13
Type 1
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: Croatian Web Archive (HAW)
Navigational Hierarchy Subcategory Landing Page
14
Type 1
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: Conifer
Navigational Hierarchy Collection Landing Page
15
Type 2
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: United Kingdom Web Archive (UKWA)
Navigational Hierarchy Collection Landing Page
16
Type 2
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: National Library of Australia’s (NLA) Trove
Navigational Hierarchy
Collection Landing Page
TEP Page
17
Type 2
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: National Library of Australia’s (NLA) PANDORA
Navigational Hierarchy Collection Landing Page
18
Type 1
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: National Library of Australia’s (NLA) PANDORA and Trove
19
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Navigational Hierarchy: Internet Archive (IA)
Navigational Hierarchy
Collection Landing Page
20
Type 2
Internet Archive’s (IA) user account web archives.
https://archive.org/details/@shawnmjones?tab=web-archive
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Key Takeaways
● As web archives grow archivists create collections to make their web archives simpler
to comprehend and handle.
● Similarities among these collection structures:
○ Account-centric & General web archives.
○ Restrict a memento to a single collection or share mementos between
collections.
○ Attribute curation to a single entity or different organizational collaborators.
○ Most offer sub-collections & some offer embargo resources.
● Two types of navigational hierarchies:
○ Type 1: an original resource supports the collection’s theme.
○ Type 2: a memento supports the collection’s theme.
● We explored existing platforms rather than making recommendations on how a web
archive collection should be created.
21
Himarsha R. Jayanetti
hjaya002@odu.edu
@HimarshaJ
Technical Report at arXiv
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Backup slides …
22
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Different Web Archive Platform Have Different Names for Mementos
23
Webpage snapshots
Captures
Archived copies
https://haw.nsk.hr/en/publikacija/4818/
https://webarchive.nla.gov.au/collection/11676
https://www.loc.gov/item/lcwaN0023449/
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Human-Readable TimeMaps (URI-Ts) Are Rendered
as a List or Calendar With Links to Each URI-M
List View
Calendar View
24
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Trove’s Machine-readable TimeMap:
https://webarchive.nla.gov.au/bamboo-service/tep/{TEP_ID}
25
JSON Viewer
https://webarchive.nla.gov.au/bamboo-service/tep/33161
Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL
Details on Different Web Archive Platform Collection Structures
26

More Related Content

PPTX
2015-odu-ece-tools-for-past-web
PPTX
The Memento Protocol and Research Issues With Web Archiving
PPTX
The Many Shapes of Archive-It
PDF
MementoMap: A Web Archive Profiling Framework for Efficient Memento Routing
PPTX
Web archiving challenges and opportunities
PPTX
Storytelling With Web Archives
PDF
Improving Collection Understanding For Web Archives With Storytelling: Shinin...
PPTX
Combining Social Media Storytelling With Web Archives
2015-odu-ece-tools-for-past-web
The Memento Protocol and Research Issues With Web Archiving
The Many Shapes of Archive-It
MementoMap: A Web Archive Profiling Framework for Efficient Memento Routing
Web archiving challenges and opportunities
Storytelling With Web Archives
Improving Collection Understanding For Web Archives With Storytelling: Shinin...
Combining Social Media Storytelling With Web Archives

Similar to Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists (20)

PDF
Web Archiving: A Brief Introduction
PPTX
Improving Understanding of Web Archive Collections Through Storytelling - PhD...
PPTX
Archiving Web-Based #musetech for Institutional Memory
PDF
Web Archiving: A Brief Introduction
PDF
The web is a mess: how I learnt to stop worrying and love web archiving. Kris...
PDF
Introducing Web Archiving and WSDL Research Group
PPTX
Improving Collection Understanding in Web Archives
PPT
Creating and Maintaining Web Archives
PDF
A Framework for Aggregating Private and Public Web Archives
PDF
A Framework for Aggregating Public and Private Web Archives
PPTX
Telling Stories with Web Archives
PDF
Memory-making and the emergent archive poster
PPTX
Perseverance on Persistence
PDF
Perseverance on Persistence by Herbert van de Sompel - EuropeanaTech Conferen...
PDF
Perseverance on persistence by Herbert Van de Sompel - EuropeanaTech Conferen...
PDF
Summarize Your Archival Holdings With MementoMap
PPTX
SAA 2014 session 703
PPTX
Web Archiving for University Records
PPTX
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
PDF
TPDL 2016 Doctoral Consortium - Web Archive Profiling
Web Archiving: A Brief Introduction
Improving Understanding of Web Archive Collections Through Storytelling - PhD...
Archiving Web-Based #musetech for Institutional Memory
Web Archiving: A Brief Introduction
The web is a mess: how I learnt to stop worrying and love web archiving. Kris...
Introducing Web Archiving and WSDL Research Group
Improving Collection Understanding in Web Archives
Creating and Maintaining Web Archives
A Framework for Aggregating Private and Public Web Archives
A Framework for Aggregating Public and Private Web Archives
Telling Stories with Web Archives
Memory-making and the emergent archive poster
Perseverance on Persistence
Perseverance on Persistence by Herbert van de Sompel - EuropeanaTech Conferen...
Perseverance on persistence by Herbert Van de Sompel - EuropeanaTech Conferen...
Summarize Your Archival Holdings With MementoMap
SAA 2014 session 703
Web Archiving for University Records
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
TPDL 2016 Doctoral Consortium - Web Archive Profiling
Ad

More from Himarsha Jayanetti (6)

PPTX
Infrastructure for Tracking Information Flow from Social Media to U.S. TV New...
PPTX
Evaluating Social Media Reach via Mainstream Media Discourse - CIKM '24 - PhD...
PPTX
Evaluating Social Media Reach via Mainstream Media Discourse
PPTX
Exploring Xenophobic Events through GDELT Data Analysis
PPTX
Supporting Account-based Queries for Archived Instagram Posts
PDF
Robots Still Outnumber Humans in Web Archives, But Less Than Before
Infrastructure for Tracking Information Flow from Social Media to U.S. TV New...
Evaluating Social Media Reach via Mainstream Media Discourse - CIKM '24 - PhD...
Evaluating Social Media Reach via Mainstream Media Discourse
Exploring Xenophobic Events through GDELT Data Analysis
Supporting Account-based Queries for Archived Instagram Posts
Robots Still Outnumber Humans in Web Archives, But Less Than Before
Ad

Recently uploaded (20)

PDF
Phytochemical Investigation of Miliusa longipes.pdf
PDF
Placing the Near-Earth Object Impact Probability in Context
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
2Systematics of Living Organisms t-.pptx
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Phytochemical Investigation of Miliusa longipes.pdf
Placing the Near-Earth Object Impact Probability in Context
2. Earth - The Living Planet Module 2ELS
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
The KM-GBF monitoring framework – status & key messages.pptx
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
INTRODUCTION TO EVS | Concept of sustainability
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Cell Membrane: Structure, Composition & Functions
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
2Systematics of Living Organisms t-.pptx
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
HPLC-PPT.docx high performance liquid chromatography
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice

Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists

  • 1. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists Presented By: Himarsha R. Jayanetti Department of Computer Science Old Dominion University, Norfolk, Virginia @HimarshaJ @WebSciDL @oducs TPDL ‘22, The 26th International Conference on Theory and Practice of Digital Libraries, Padua, Italy, 20 - 23 September 2022 Himarsha R. Jayanetti, Shawn M. Jones, Martin Klein, Alex Osbourne, Paul Koerbin, Michael L. Nelson, and Michele C. Weigle
  • 2. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Web Archives Preserve the Content of Web Pages as They Were at a Specific Point in Time Archived web pages, or mementos, are increasingly used by researchers, including journalists, social scientist, and historians. 2 A screenshot of the https://oduwsdl.github.io/ live web page URI-R: Original Resource URI-M: Memento A screenshot of the https://oduwsdl.github.io/ web page archived on 2020-11-18T23:04:53Z (Memento-Datetime) https://web.archive.org/web/20201118230453/https://oduwsdl.github.io/ URI-T: TimeMap https://web.archive.org/web/ */https://oduwsdl.github.io/
  • 3. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL The Term “Web Archives” Mean the Same Thing Among These Eight Web Archive Platforms 3 “Old Dominion University Social Media” Collection at Archive-It Collections at the Internet Archive Webpage Snapshots Captures Archived copies
  • 4. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL But There are Different Terms Used for “Memento” Among Web Archives 4 Captures Archived Copies Webpage Snapshots Memento “Old Dominion University Social Media” Collection at Archive-It Collections at the Internet Archive
  • 5. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL The Term “Collection” Has Slightly Different Meanings Among Web Archives 5 Captures Archived Copies Webpage Snapshots Memento “Old Dominion University Social Media” Collection at Archive-It Collections at the Internet Archive
  • 6. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL We Differentiated Collections From the Greater Web Archive (Web Archive as a Whole) 6 Greater Web Archive: Pandora Collection: Indigenous Australians https://pandora.nla.gov.au/subject/12 https://pandora.nla.gov.au/
  • 7. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Some Web Archive Collections Contain Sub-Collections 7 7 No Sub-collections Has Sub-collections * In conifer: lists acts as sub-collections. * https://archive-it.org/collections/2697 https://webarchive.nla.gov.au/collection/15003 Archive-It Trove
  • 8. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Curated By (Attribution): Single Entity, Different Organizational Collaborators, or the Greater Web Archive 8 8 Greater Web Archive Single account Organizational collaborators https://webarchive.nla.gov.au/collection/13842 https://archive-it.org/collections/7635
  • 9. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Some Web Archive Collections Support Private Collections 9 No Private Collections Support for Private Collections https://support.archive-it.org/hc/en-us/articles/208334003-Controlling-access-to-your-web-archives- https://conifer.rhizome.org/himarshaj
  • 10. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Some Platforms Embargo Some of Their Resources 10 Do not embargo resources Embargo resources https://www.webarchive.org.uk/en/ukwa/collection/3942 https://www.loc.gov/item/lcwaN0006607/
  • 11. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchies: How a Visitor or Crawler Navigates Each Collection for Information Type 1 Type 2 11 Type 1 collections are original resource focused. Type 2 collections are archived resource (memento) focused.
  • 12. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: Archive-It Navigational Hierarchy Collection Landing Page 12 Type 1
  • 13. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: Library of Congress (LC) Navigational Hierarchy Collection Items Page 13 Type 1
  • 14. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: Croatian Web Archive (HAW) Navigational Hierarchy Subcategory Landing Page 14 Type 1
  • 15. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: Conifer Navigational Hierarchy Collection Landing Page 15 Type 2
  • 16. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: United Kingdom Web Archive (UKWA) Navigational Hierarchy Collection Landing Page 16 Type 2
  • 17. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: National Library of Australia’s (NLA) Trove Navigational Hierarchy Collection Landing Page TEP Page 17 Type 2
  • 18. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: National Library of Australia’s (NLA) PANDORA Navigational Hierarchy Collection Landing Page 18 Type 1
  • 19. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: National Library of Australia’s (NLA) PANDORA and Trove 19
  • 20. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Navigational Hierarchy: Internet Archive (IA) Navigational Hierarchy Collection Landing Page 20 Type 2 Internet Archive’s (IA) user account web archives. https://archive.org/details/@shawnmjones?tab=web-archive
  • 21. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Key Takeaways ● As web archives grow archivists create collections to make their web archives simpler to comprehend and handle. ● Similarities among these collection structures: ○ Account-centric & General web archives. ○ Restrict a memento to a single collection or share mementos between collections. ○ Attribute curation to a single entity or different organizational collaborators. ○ Most offer sub-collections & some offer embargo resources. ● Two types of navigational hierarchies: ○ Type 1: an original resource supports the collection’s theme. ○ Type 2: a memento supports the collection’s theme. ● We explored existing platforms rather than making recommendations on how a web archive collection should be created. 21 Himarsha R. Jayanetti hjaya002@odu.edu @HimarshaJ Technical Report at arXiv
  • 22. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Backup slides … 22
  • 23. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Different Web Archive Platform Have Different Names for Mementos 23 Webpage snapshots Captures Archived copies https://haw.nsk.hr/en/publikacija/4818/ https://webarchive.nla.gov.au/collection/11676 https://www.loc.gov/item/lcwaN0023449/
  • 24. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Human-Readable TimeMaps (URI-Ts) Are Rendered as a List or Calendar With Links to Each URI-M List View Calendar View 24
  • 25. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Trove’s Machine-readable TimeMap: https://webarchive.nla.gov.au/bamboo-service/tep/{TEP_ID} 25 JSON Viewer https://webarchive.nla.gov.au/bamboo-service/tep/33161
  • 26. Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists, TPDL ‘22 Padua, Italy. @HimarshaJ @WebSciDL Details on Different Web Archive Platform Collection Structures 26