VM Brasseur gave
the most revealing and informative talk I've attended thus far.
A brief tour:
- Anonymise IP addresses
- Run the Wayback Machine
- Archive for paying customers
- Donated crawled content
- Deep crawl on popular sites
- Broad shallow crawl of known domains
- Targeted crawls.
- On demand crawling from the webpage - so you can archive before it's pulled
from the net.
- Scan 1000 books per day
- 30 scanning centres around the world.
- Runs the Open Library for both books, video and
- TV is now a citable resource
- Live music archive
- Includes every live show of the Grateful Dead.
- Video game archive of classic games playable in the browser.
- Will save anything digital, for free, publicly.
Open Library Api
- Returns JSON (XML also available)
- Evergreen and Koha are FOSS library applications that use the Open Library API.
- "Do We Want It?" API to see what books the library needs.
- There is an advanced search API too.
- Drop in replacement for the Amazon S3 API.
- NASA use the API. All the audio, video and photos are available on the
- ia-wrapper is a Perl client to access the API.
- There is a Python client:
This is the talk that has blown my mind so far. This ought to have been a
keynote. So very glad I made it. I highly recommend watching the video when it
comes out. This was an amazing talk.