SlideShare a Scribd company logo
WAS to Archive-It Migration
Visualization of linking between websites of different languages, Babel 2012 project
Rosalie Lack
rosalie.lack@ucop.edu
Who?
WAS is …
• A service of University of California’s
California Digital Library
• 2004: Funded by National Digital Information
Infrastructure Preservation Program (NDIIPP)
• 2006: Launched with partner institutions
• 2009: Transition to subscription model
• 2015: 21 UC institutions; 12 external
Archive-It
A subscription service from the Internet
Archive, which allows institutions to build,
manage and search their own web archive
Over 300 partner orgs
in the U.S. and worldwide
www.archive-it.org
Why?
Flickr by Daniel Foster
Lean Books in Wikimedia Commons
Flickr by James
How?
By Mcconnmama
CUL-hosted Web Archiving Policies
and Practice in the US summit
“… an articulation of a small number of
model programs for web archiving, and
development of ‘best practices’ for
documenting program elements”
May 2012
Attendees: CDL, Columbia, CRL, Cornell, Duke, Georgetown, Frick,
Harvard, Indiana, IA, LC, Michigan, North Texas, NYU, Sloan, Stanford,
UC Irvine, UT Austin, Virginia Tech https://webarch.cul.columbia.edu/
CDL-hosted meeting
“… more robust collaboration was desirable in
order to collectively address these challenges
[research use, intensive resource
requirements, the pace of change, fragmented
collection development, etc.] and went so far
as to brainstorm the benefits and risks of an
all-in, formal association”
June 2014
Attendees: CDL, Columbia, George Washington, Harvard, IA, LC,
North Texas, Stanford http://bit.ly/1N1GgGj
Collections/Access/QA
Opportunities
• Federation/aggregation/collocation
• Collaboration on collection development
• Crowd sourced selection and QA
• Education and advocacy
• Create and endorse policies, best
practices and standards
Supporting research
• Outreach
• Pilot projects
• Computational analysis tools
• Tools, tools, tools
Opportunities
Technology
• Shared infrastructure/operations
• Data capture tools
• Collaborate on API development
• Preservation solutions
• Tools, tools, tools
Opportunities
Steps toward collaboration: Community
Principles for Web Archiving at Scale
“… a lightweight structure by which web
archiving institutions can work collectively in
order to achieve significant functional goals
and operational efficiencies that they are
unlikely to achieve individually”
September 2014
CDL, Columbia, George Washington, Harvard, IA, LC, North Texas,
Stanford http://bit.ly/1NoB2l1
“…rely on external service
providers whenever possible,
and restrict local efforts to
areas in which institutions
can uniquely add value.”
Value-added services locally
or collaboratively developed
Next Steps
• Complete the migration
• Conduct user research into researcher
needs
• Define, build and share APIs to meet
specialized needs
• Explore feasibility of a national
collaborative model for web archiving
• Continue to look for funding opportunities
to help facilitate this effort (IMLS 2016)
Flickr by stu_spivack
Questions?
Rosalie Lack
rosalie.lack@ucop.edu
Potential architecture

More Related Content

PPTX
PPTX
Levels of Service for Digital Libraries
PDF
RDAP 15 DLF E-Research Network: Developing a Sustainable Community of Practic...
PPTX
DPLA and Wisconsin: What's new, what's next
PDF
Arp and Forbes "It Takes a Village (ITAC): Open Source Software Models of Col...
PPTX
NISO Virtual Conference Open Source Software Camden Opening Keynote
PPTX
Walk this way: Online content platform migration experiences and collaboration
PPTX
Capture All the URLs: First Steps in Web Archiving
Levels of Service for Digital Libraries
RDAP 15 DLF E-Research Network: Developing a Sustainable Community of Practic...
DPLA and Wisconsin: What's new, what's next
Arp and Forbes "It Takes a Village (ITAC): Open Source Software Models of Col...
NISO Virtual Conference Open Source Software Camden Opening Keynote
Walk this way: Online content platform migration experiences and collaboration
Capture All the URLs: First Steps in Web Archiving

What's hot (20)

PDF
PPT
Lwb Open Forum Presentation
PPTX
SGCI ADMI April 2019
PPTX
Texas Digital Library: Research, Learning and Scholarship - Ryan Steans - RDAP12
PPTX
Technology & Archives: Exchange Forum Programmer & Archivist Collaboration
PDF
ALA NISO-BISG Forum - Patron Privacy
PPT
Moving to Online Publications
PPT
Libr 204-team-a-strat-plan
PPTX
Jisc Monitor workshop - Jo Lambert and Brian Mitchell - Jisc Digital Festival...
PDF
Open repositories 2016 floss panel slides
PPTX
Youngen "Community Infrastructure for Streamlining Access to Scholarly Resour...
PDF
Building Web Archiving Collaborations to Save [More of] the Web
PPTX
The research library: scalable efficiency and scalable learning
PDF
Bryant osi talk summit dc
PDF
Practical Considerations for Open Infrastructure
PDF
MetaArchive Cooperative: Case Study in Collaboration
PPTX
Linked Data at Smithsonian Libraries
PPTX
ILS: developments, services and support
PPT
HIKE project presentation
PPTX
Making the most of digital resources - Penny Robertson, Neil Stapleton and Cl...
Lwb Open Forum Presentation
SGCI ADMI April 2019
Texas Digital Library: Research, Learning and Scholarship - Ryan Steans - RDAP12
Technology & Archives: Exchange Forum Programmer & Archivist Collaboration
ALA NISO-BISG Forum - Patron Privacy
Moving to Online Publications
Libr 204-team-a-strat-plan
Jisc Monitor workshop - Jo Lambert and Brian Mitchell - Jisc Digital Festival...
Open repositories 2016 floss panel slides
Youngen "Community Infrastructure for Streamlining Access to Scholarly Resour...
Building Web Archiving Collaborations to Save [More of] the Web
The research library: scalable efficiency and scalable learning
Bryant osi talk summit dc
Practical Considerations for Open Infrastructure
MetaArchive Cooperative: Case Study in Collaboration
Linked Data at Smithsonian Libraries
ILS: developments, services and support
HIKE project presentation
Making the most of digital resources - Penny Robertson, Neil Stapleton and Cl...
Ad

Viewers also liked (9)

PPTX
PCI Media Impact Annual report 2011
PDF
Chemical dynamics and rare events in soft matter physics
PDF
Pci fs 2012
PDF
Newsletter fall2013v web
PPTX
Society of California Archivists (SCA)
PPTX
Tanzania pictrures
PPTX
Was uc3-nov2012wkshps-final
PDF
PCI Media Impact Showpiece
PDF
Credit suisse
PCI Media Impact Annual report 2011
Chemical dynamics and rare events in soft matter physics
Pci fs 2012
Newsletter fall2013v web
Society of California Archivists (SCA)
Tanzania pictrures
Was uc3-nov2012wkshps-final
PCI Media Impact Showpiece
Credit suisse
Ad

Similar to SAA 2015 Web Archiving Roundtable (20)

PPTX
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
PPTX
The Canadian Linked Data Initiative: Charting a Path to a Linked Data Future
PPTX
Science of Team Science 2013: Regional Networks to Stimulate Multi-directiona...
PPTX
OCLC Research Update at ALA Chicago. June 26, 2017.
PDF
Bergstrom, Carpenter, Jakobsen, Jurczyk, McKenna, Morris, and Nadav-Manes "C...
PDF
Helping librarians use the DMPTool as a centerpiece for data management
PDF
Gore lyrasis dpla-2
PPTX
Alamw15 VIVO
PPTX
Strategic Developments in Digital Initiatives at Academic Libraries
PDF
(Nov 2009) Preparing Future Digital Curators
PDF
Carpenter, Hammer, Morris, and Nadav-Manes "Public Engagement Webinar, Collab...
PDF
(Nov 2008) Preparing Future Digital Curators
PDF
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
PPTX
iNACOL Research In Review Webinar: Blended and Online Learning Clearinghouse
KEY
Collaborative Digitization Case Study
PPTX
The OCLC Research Library Partnership
PDF
Today's forecast for your campus: BLUEcloud
PPT
Marc and beyond: 3 Linked Data Choices
PDF
Dpi presentation wisc net 2013
PDF
NCompass Live: Learning Opportunities and Resources from WebJunction
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
The Canadian Linked Data Initiative: Charting a Path to a Linked Data Future
Science of Team Science 2013: Regional Networks to Stimulate Multi-directiona...
OCLC Research Update at ALA Chicago. June 26, 2017.
Bergstrom, Carpenter, Jakobsen, Jurczyk, McKenna, Morris, and Nadav-Manes "C...
Helping librarians use the DMPTool as a centerpiece for data management
Gore lyrasis dpla-2
Alamw15 VIVO
Strategic Developments in Digital Initiatives at Academic Libraries
(Nov 2009) Preparing Future Digital Curators
Carpenter, Hammer, Morris, and Nadav-Manes "Public Engagement Webinar, Collab...
(Nov 2008) Preparing Future Digital Curators
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
iNACOL Research In Review Webinar: Blended and Online Learning Clearinghouse
Collaborative Digitization Case Study
The OCLC Research Library Partnership
Today's forecast for your campus: BLUEcloud
Marc and beyond: 3 Linked Data Choices
Dpi presentation wisc net 2013
NCompass Live: Learning Opportunities and Resources from WebJunction

Recently uploaded (20)

PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
Institutional Correction lecture only . . .
PPTX
master seminar digital applications in india
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Pharma ospi slides which help in ospi learning
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Business Ethics Teaching Materials for college
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
TR - Agricultural Crops Production NC III.pdf
Cell Structure & Organelles in detailed.
Institutional Correction lecture only . . .
master seminar digital applications in india
Anesthesia in Laparoscopic Surgery in India
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Microbial diseases, their pathogenesis and prophylaxis
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Microbial disease of the cardiovascular and lymphatic systems
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Pharma ospi slides which help in ospi learning
PPH.pptx obstetrics and gynecology in nursing
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
2.FourierTransform-ShortQuestionswithAnswers.pdf
Business Ethics Teaching Materials for college
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table

SAA 2015 Web Archiving Roundtable

  • 1. WAS to Archive-It Migration Visualization of linking between websites of different languages, Babel 2012 project Rosalie Lack rosalie.lack@ucop.edu
  • 3. WAS is … • A service of University of California’s California Digital Library • 2004: Funded by National Digital Information Infrastructure Preservation Program (NDIIPP) • 2006: Launched with partner institutions • 2009: Transition to subscription model • 2015: 21 UC institutions; 12 external
  • 4. Archive-It A subscription service from the Internet Archive, which allows institutions to build, manage and search their own web archive Over 300 partner orgs in the U.S. and worldwide www.archive-it.org
  • 7. Lean Books in Wikimedia Commons
  • 11. CUL-hosted Web Archiving Policies and Practice in the US summit “… an articulation of a small number of model programs for web archiving, and development of ‘best practices’ for documenting program elements” May 2012 Attendees: CDL, Columbia, CRL, Cornell, Duke, Georgetown, Frick, Harvard, Indiana, IA, LC, Michigan, North Texas, NYU, Sloan, Stanford, UC Irvine, UT Austin, Virginia Tech https://webarch.cul.columbia.edu/
  • 12. CDL-hosted meeting “… more robust collaboration was desirable in order to collectively address these challenges [research use, intensive resource requirements, the pace of change, fragmented collection development, etc.] and went so far as to brainstorm the benefits and risks of an all-in, formal association” June 2014 Attendees: CDL, Columbia, George Washington, Harvard, IA, LC, North Texas, Stanford http://bit.ly/1N1GgGj
  • 13. Collections/Access/QA Opportunities • Federation/aggregation/collocation • Collaboration on collection development • Crowd sourced selection and QA • Education and advocacy • Create and endorse policies, best practices and standards
  • 14. Supporting research • Outreach • Pilot projects • Computational analysis tools • Tools, tools, tools Opportunities
  • 15. Technology • Shared infrastructure/operations • Data capture tools • Collaborate on API development • Preservation solutions • Tools, tools, tools Opportunities
  • 16. Steps toward collaboration: Community Principles for Web Archiving at Scale “… a lightweight structure by which web archiving institutions can work collectively in order to achieve significant functional goals and operational efficiencies that they are unlikely to achieve individually” September 2014 CDL, Columbia, George Washington, Harvard, IA, LC, North Texas, Stanford http://bit.ly/1NoB2l1
  • 17. “…rely on external service providers whenever possible, and restrict local efforts to areas in which institutions can uniquely add value.”
  • 18. Value-added services locally or collaboratively developed
  • 19. Next Steps • Complete the migration • Conduct user research into researcher needs • Define, build and share APIs to meet specialized needs • Explore feasibility of a national collaborative model for web archiving • Continue to look for funding opportunities to help facilitate this effort (IMLS 2016)

Editor's Notes

  • #7: Photo credit: Daniel Foster https://www.flickr.com/photos/danielfoster/7587227518/
  • #8: https://commons.wikimedia.org/wiki/File%3ASeedlings.jpg By Leon Brooks [Public domain], via Wikimedia Commons from Wikimedia Commons
  • #9: Photo credit: Flickr by James https://www.flickr.com/photos/89422442@N03/ https://www.flickr.com/photos/89422442@N03/9071938319/in/photolist-ePE4cD-7Zrf2z-6oHEHH-ePRkSJ-haocsM-uxYQfA-u62iNY-tqApzb-tqAn1d-tqL5dz-tqAjR3-u62jiA-u62pNU-unB3XZ-tqL5ZK-qMQmxA-24qDcG-q8oq5j-qMYdqz-685V4w-5CFCkA-7AG9Z3-8Q7FBD-8QiTTY-2dKnx-aE4MUA-f81vjy-q1VpHF-8pmXpK-whjS9g-aeHa7H-85Je4q-56aAfr-q58g5W-8xye4p-7gy32Y-5B6z88-uxYPaj-nKMHEd-bW1hqc-aPrwEH-deTybZ-8WBUHi-7gu6JX-akTSeo-hzQ13r-aaXQXz-bhBbe8-f9f2Ww-dPv6be/
  • #11: Photo credit: Mcconnmama https://pixabay.com/get/ac9ef78ee9edbf81e461/1439408348/boy-358300_1280.jpg?direct
  • #21: stu_spivack https://www.flickr.com/photos/stuart_spivack/ https://www.flickr.com/photos/stuart_spivack/183088194/in/photostream/