How The Internet Archive Fights for Digital Preservation
How The Internet Archive Fights for Digital Preservation
The Internet Archive, founded in 1996 by Brewster Kahle, is the world's largest digital library, preserving billions of web pages, books, music, and videos.
What It Preserves
- 835+ billion web pages archived (Wayback Machine)
- 44 million+ books and texts
- 14 million+ audio recordings
- 10 million+ videos
- 1.4 million+ software programs
- 4 million+ TV news broadcasts
How It Works
Wayback Machine: Web crawlers archive web pages at regular intervals. Users can view how any website looked at any point in time since 1996.
Book digitization: 28 scanning centers worldwide. 3,000+ books scanned daily at Internet Archive headquarters in San Francisco.
Audio/Video: Live music archives, radio broadcasts, films, and educational content.
Legal Battles
Hachette v. Internet Archive (2023-2025): Major publishers sued over the National Emergency Library (lending digital copies during COVID). Court ruled against the Archive, ordered to stop lending copyrighted books.
Controlled Digital Lending: The Archive's model of lending one digital copy per physical copy owned. Legal status uncertain after the Hachette ruling.
US vs. Internet Archive: 2024 DOJ investigation into lending practices.
The Mission
"Universal Access to All Knowledge" — preserving humanity's cultural and intellectual heritage for future generations.
Funding
- Primarily donation-funded (small individual donations + foundation grants)
- Annual budget: ~$70 million
- 200+ employees
- Runs on open-source software
Why It Matters
- 40% of web pages disappear within a year
- Link rot: 70% of URLs in academic papers are broken within 20 years
- Cultural memory: Without preservation, digital culture is ephemeral
- Democratized access: Free, no-ads, no-paywall access to knowledge
Current Challenges
- Legal pressure from publishers and content owners
- Technical challenges of preserving dynamic web content
- Funding sustainability
- AI companies scraping Archive data without attribution
How to Help
- Donate at archive.org
- Volunteer for book scanning
- Upload content to the Archive
- Advocate for digital preservation legislation