Internet Archive

From District 49 Community Wiki
Jump to: navigation, search

Internet Archive is a non-profit digital library with the stated mission of "universal access to all knowledge."[1][2] It provides permanent storage of and free public access to collections of digitized materials, including websites, music, moving images, and nearly three million public-domain books. As of October 2012, its collection topped 10 petabytes.[3][4] In addition to its archiving function, the Archive is an activist organization, advocating for a free and open Internet.

The Internet Archive allows the public to upload and download digital material to its data cluster, but the bulk of its data is collected automatically by its web crawlers, which work to preserve as much of the public web as possible. Its web archive, The Wayback Machine, contains over 150+ billion web captures.[5][6] The Archive also oversees one of the world's largest book digitization projects.

Founded by Brewster Kahle in 1996, the Archive is a 501(c)(3) non-profit operating in the United States. It has an annual budget of $10 million, derived from a variety of sources: revenue from its Web crawling services, various partnerships, grants, donations, and the Kahle-Austin Foundation.[7] Its headquarters are in San Francisco, Calif., where about 30 of its 200 employees work. Most of its staff work in its book-scanning centers. The Archive has data centers in San Francisco, Redwood City, and Richmond, all in California. Its collection is mirrored for stability and endurance at the Bibliotheca Alexandrina in Egypt.[8]

The Archive is a member of the International Internet Preservation Consortium.[9] This non-profit digital library was officially designated as a library by the State of California in 2007.[10]


  1. "Internet Archive Frequently Asked Questions". Internet Archive. Retrieved April 13, 2013. 
  2. "Internet Archive: Universal Access to all Knowledge". Internet Archive. Retrieved April 13, 2013. 
  3. "10,000,000,000,000,000 bytes archived!". Internet Archive Blogs. October 26, 2012. "On Thursday, 25 October, hundreds of Internet Archive supporters, volunteers, and staff celebrated addition of the 10,000,000,000,000,000th byte to the Archive’s massive collections." 
  4. Brown, A. (2006). Archiving websites: A practical guide for information management professionals. London: Facet Publishing. p. 9.
  5. "Internet Archive: Projects". Internet Archive. Retrieved March 6, 2013. 
  6. Grotke, A. (December 2011). "Web Archiving at the Library of Congress". Computers In Libraries, v.31 n.10, p. 15-19. Information Today.
  7. Womack, David (Spring 2003). "Who Owns History?". Cabinet Magazine (10). 
  8. "Donation to the new Library of Alexandria in Egypt"; Alexandria, Egypt; April 20, 2002. Bibliotheca Alexandrina. Internet Archive.
  9. Template:Wayback International Internet Preservation Consortium.
  10. "Internet Archive officially a library", May 2, 2007. Internet Archive