SlideShare a Scribd company logo
The Internet Archive Presented by Alex Craig
Brewster Kahle Graduated from MIT in 1982 with a degree in Computer Science and Engineering Studied Artificial Intelligence Helped found Thinking Machines Inc., manufacturer of supercomputers using parallel processing
WAIS Wide Area Information Server Developed 1988 Early internet search software Offered searching of the contents and databases of computer servers on the internet  Sold to AOL in 1995
Alexa Internet and the Internet Archive 1996, following the sale of WAIS to AOL, Kahle and partner Bruce Gilliat (WAIS), found both the non-profit Internet Archive and the for-profit Alexa Internet simultaneously
Alexa Internet For-profit Internet Toolbar Tracks user browsing information to aid future internet searches The toolbar makes an archive of each website as it is “crawled” - then donated to Internet Archive  Sold to Amazon in 1999
The Internet Archive: The Wayback Machine Archives “snapshots” of the Internet to create an “internet library” Originally received copies mainly from the Alexa Internet service, now includes other sources of “donations” Allows users to see archived versions of websites as they appeared in the past Because the average lifetime of a website is 100 days, the snapshot is retaken every two months.
The Wayback Machine Today, a single copy of everything that's on the Net -- equal to 15,000 copies of  Encyclopedia Britannica  is added every 2 months The National Science Foundation, Library of Congress, Markle Foundation, Compaq, and Alexa all donate money, software, and equipment to keep the Internet Archive up and running.
Internet Archive: Other Tools Open Library: Searchable Database for books Archive-It: Fee-based subscription service that allows members to permanently archive their data Media Collections: Moving Image, Audio, Live Music, and Text
Internet Archive as Library Made an official library of the state of California in 2007 The Archive is now mirrored at the Bibliotheca Alexandria in Egypt, this is the only external backup of the archive
Why is the Internet Archive Important? Preservation of the past “ Digitized information, especially on the Internet, has such rapid turnover these days that total loss is the norm. Civilization is developing severe amnesia as a result…The Internet Archive is the beginning of a cure - the beginning of complete, detailed, accessible, searchable memory for society, and not just scholars this time, but everyone.” - Stewart Brand (founder of The Long Now Foundation
Mission Statement “ Libraries exist to preserve society's cultural artifacts and to provide access to them. If libraries are to continue to foster education and scholarship in this era of digital technology, it's essential for them to extend those functions into the digital world… without cultural artifacts, civilization has no memory and no mechanism to learn from its successes and failures. And paradoxically, with the explosion of the Internet, we live in what Danny Hillis has referred to as our “digital dark age”. The Internet Archive is working to prevent the Internet - a new medium with major historical significance - and other "born-digital" materials from disappearing into the past…we are working to preserve a record for generations to come.” (from archive.org)
Controversies Suzanne Shell - 2005, demanded $100,000 for archiving her website profane-justice.org - they later settled with the Archive offering this statement: “ Internet Archive has no interest in including materials in the Wayback Machine of persons who do not wish to have their Web content archived”
Controversies  2005 - Healthcare Advocates, Inc. - Attempted to sue the Internet Archive for violating the Digital Millennium Copyright Act.  Settled out of court
DMCA Exemption 2006 court ruling ruling grants exemption to "computer programs and video games distributed in formats that have become obsolete and that require the original media or hardware as a condition of access, when circumvention is accomplished for the purpose of preservation or archival reproduction of published digital works by a library or archive".
"The Net is the No. 1 resource for people…this is how students learn, it's how business is done. If we don't have a memory, we're living in an Orwellian world of our own making." - Brewster Kahle

More Related Content

PPT
Internet Archive
PPT
The Internet Archive
PPT
Peter radoll presentation
PPTX
History of digital_libraries
PPTX
Creative Commons + GLAM
PPTX
Digital History Class Presentation
PPTX
World wide web
PPT
Cyberlaw presentation
Internet Archive
The Internet Archive
Peter radoll presentation
History of digital_libraries
Creative Commons + GLAM
Digital History Class Presentation
World wide web
Cyberlaw presentation

What's hot (20)

PPTX
Creative Commons Licences
PPTX
BiblioBoard Faculty Workshop
PPTX
Historically Speaking, Digital Humanities, EWallis July 2012
PDF
General meeting 2009-2014
PDF
Creating And Sharing Open Cultural Heritage & Data
PDF
why open cultural data
PDF
Libraries and makerspaces/FabLabs (FabLabCon 2013)
PPTX
Brand niemann02042012
PPT
Indigenous Archives: Opportunities for Archival Access in an Information Society
PPTX
As We Move Toward the Future, How Are We Doing?
PPT
Agricultural Information and Knowledge Sharing: Promising Opportunities for A...
PDF
The ongoing journey towards open access
PPTX
University of texas libraries’ copyright crash course
PDF
Connected Learning and FryskLab at Nationaal Bibliotheekcongres 2014
PPT
PPTX
Europeana Network Association AGM 2016 - 9 November - Speaker: Effie Kapsalis
PPT
Web Directions 2007 Wrap-up
PPTX
Curriculum connections: the school library in full flight
PPT
Creative Commons for New Zealand Schools - By Matt McGregor
PPTX
Working Group 6 discussion
Creative Commons Licences
BiblioBoard Faculty Workshop
Historically Speaking, Digital Humanities, EWallis July 2012
General meeting 2009-2014
Creating And Sharing Open Cultural Heritage & Data
why open cultural data
Libraries and makerspaces/FabLabs (FabLabCon 2013)
Brand niemann02042012
Indigenous Archives: Opportunities for Archival Access in an Information Society
As We Move Toward the Future, How Are We Doing?
Agricultural Information and Knowledge Sharing: Promising Opportunities for A...
The ongoing journey towards open access
University of texas libraries’ copyright crash course
Connected Learning and FryskLab at Nationaal Bibliotheekcongres 2014
Europeana Network Association AGM 2016 - 9 November - Speaker: Effie Kapsalis
Web Directions 2007 Wrap-up
Curriculum connections: the school library in full flight
Creative Commons for New Zealand Schools - By Matt McGregor
Working Group 6 discussion
Ad

Viewers also liked (7)

PDF
User Access Patterns in Web Archives
PPTX
Measure All the (Web Archiving) Things!
PPTX
Resurrecting My Revolutionsing Social Link Neighborhood in Bringing Context t...
PPTX
"Archive What I See Now" - NEH ODH overview
PPTX
Who and What Links to the Internet Archive
PPTX
What can linked data do for digital libraries
PDF
The impact of innovation on travel and tourism industries (World Travel Marke...
User Access Patterns in Web Archives
Measure All the (Web Archiving) Things!
Resurrecting My Revolutionsing Social Link Neighborhood in Bringing Context t...
"Archive What I See Now" - NEH ODH overview
Who and What Links to the Internet Archive
What can linked data do for digital libraries
The impact of innovation on travel and tourism industries (World Travel Marke...
Ad

Similar to Internet Archive 2 (20)

PPT
Digitallibrary
PPT
Digital initiatives in archival preservation
PPT
IACE-T Presentation
PDF
Digital Libraries, Digital Repositories, Digital Copyright: Overview, Challen...
PPT
Data Integration Lecture
PDF
AiLibrary, Inc. - A.i. Digital Library by Gordon Kraft
PPT
Broadband and Library Relevance
PDF
Advanced web searching
PPT
new chap16.ppt
PPT
Slideshare1 phpapp01
PPT
Web 2.0, library 2.0, librarian 2.0, innovative services for sustainable car...
PPTX
Internet Archive and Open Library
PPT
Sharing is caring conference in Copenhagen 2011-11-111
PPT
The development of web archiving 3
PPT
Emerging Trends Library Science.ppt
PDF
Digital library-overview
PPT
Hot Topics in Tech
PDF
Digital Archives on a Dime
PPT
Emerging Trends Library Sciennghdtnce.ppt
PPTX
What is a database (for non techies)
Digitallibrary
Digital initiatives in archival preservation
IACE-T Presentation
Digital Libraries, Digital Repositories, Digital Copyright: Overview, Challen...
Data Integration Lecture
AiLibrary, Inc. - A.i. Digital Library by Gordon Kraft
Broadband and Library Relevance
Advanced web searching
new chap16.ppt
Slideshare1 phpapp01
Web 2.0, library 2.0, librarian 2.0, innovative services for sustainable car...
Internet Archive and Open Library
Sharing is caring conference in Copenhagen 2011-11-111
The development of web archiving 3
Emerging Trends Library Science.ppt
Digital library-overview
Hot Topics in Tech
Digital Archives on a Dime
Emerging Trends Library Sciennghdtnce.ppt
What is a database (for non techies)

Recently uploaded (20)

PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Getting Started with Data Integration: FME Form 101
PDF
Encapsulation theory and applications.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Machine Learning_overview_presentation.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
1. Introduction to Computer Programming.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
cuic standard and advanced reporting.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPT
Teaching material agriculture food technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Getting Started with Data Integration: FME Form 101
Encapsulation theory and applications.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
MYSQL Presentation for SQL database connectivity
Group 1 Presentation -Planning and Decision Making .pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Unlocking AI with Model Context Protocol (MCP)
Machine Learning_overview_presentation.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation_ Review paper, used for researhc scholars
1. Introduction to Computer Programming.pptx
Programs and apps: productivity, graphics, security and other tools
cuic standard and advanced reporting.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
NewMind AI Weekly Chronicles - August'25-Week II
Teaching material agriculture food technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
SOPHOS-XG Firewall Administrator PPT.pptx

Internet Archive 2

  • 1. The Internet Archive Presented by Alex Craig
  • 2. Brewster Kahle Graduated from MIT in 1982 with a degree in Computer Science and Engineering Studied Artificial Intelligence Helped found Thinking Machines Inc., manufacturer of supercomputers using parallel processing
  • 3. WAIS Wide Area Information Server Developed 1988 Early internet search software Offered searching of the contents and databases of computer servers on the internet Sold to AOL in 1995
  • 4. Alexa Internet and the Internet Archive 1996, following the sale of WAIS to AOL, Kahle and partner Bruce Gilliat (WAIS), found both the non-profit Internet Archive and the for-profit Alexa Internet simultaneously
  • 5. Alexa Internet For-profit Internet Toolbar Tracks user browsing information to aid future internet searches The toolbar makes an archive of each website as it is “crawled” - then donated to Internet Archive Sold to Amazon in 1999
  • 6. The Internet Archive: The Wayback Machine Archives “snapshots” of the Internet to create an “internet library” Originally received copies mainly from the Alexa Internet service, now includes other sources of “donations” Allows users to see archived versions of websites as they appeared in the past Because the average lifetime of a website is 100 days, the snapshot is retaken every two months.
  • 7. The Wayback Machine Today, a single copy of everything that's on the Net -- equal to 15,000 copies of Encyclopedia Britannica is added every 2 months The National Science Foundation, Library of Congress, Markle Foundation, Compaq, and Alexa all donate money, software, and equipment to keep the Internet Archive up and running.
  • 8. Internet Archive: Other Tools Open Library: Searchable Database for books Archive-It: Fee-based subscription service that allows members to permanently archive their data Media Collections: Moving Image, Audio, Live Music, and Text
  • 9. Internet Archive as Library Made an official library of the state of California in 2007 The Archive is now mirrored at the Bibliotheca Alexandria in Egypt, this is the only external backup of the archive
  • 10. Why is the Internet Archive Important? Preservation of the past “ Digitized information, especially on the Internet, has such rapid turnover these days that total loss is the norm. Civilization is developing severe amnesia as a result…The Internet Archive is the beginning of a cure - the beginning of complete, detailed, accessible, searchable memory for society, and not just scholars this time, but everyone.” - Stewart Brand (founder of The Long Now Foundation
  • 11. Mission Statement “ Libraries exist to preserve society's cultural artifacts and to provide access to them. If libraries are to continue to foster education and scholarship in this era of digital technology, it's essential for them to extend those functions into the digital world… without cultural artifacts, civilization has no memory and no mechanism to learn from its successes and failures. And paradoxically, with the explosion of the Internet, we live in what Danny Hillis has referred to as our “digital dark age”. The Internet Archive is working to prevent the Internet - a new medium with major historical significance - and other "born-digital" materials from disappearing into the past…we are working to preserve a record for generations to come.” (from archive.org)
  • 12. Controversies Suzanne Shell - 2005, demanded $100,000 for archiving her website profane-justice.org - they later settled with the Archive offering this statement: “ Internet Archive has no interest in including materials in the Wayback Machine of persons who do not wish to have their Web content archived”
  • 13. Controversies 2005 - Healthcare Advocates, Inc. - Attempted to sue the Internet Archive for violating the Digital Millennium Copyright Act. Settled out of court
  • 14. DMCA Exemption 2006 court ruling ruling grants exemption to "computer programs and video games distributed in formats that have become obsolete and that require the original media or hardware as a condition of access, when circumvention is accomplished for the purpose of preservation or archival reproduction of published digital works by a library or archive".
  • 15. "The Net is the No. 1 resource for people…this is how students learn, it's how business is done. If we don't have a memory, we're living in an Orwellian world of our own making." - Brewster Kahle