Posted by
Craige McWhirter on
Tue Feb 2 15:01:17 2016
Last edited Tue Feb 2 05:07:04 2016
VM Brasseur gave
the most revealing and informative talk I've attended thus far.
A brief tour:
Anonymise IP addresses
Run the Wayback Machine
Archive for paying customers
Donated crawled content
Deep crawl on popular sites
Broad shallow crawl of known domains
Targeted crawls.
On demand crawling from the webpage - so you can archive before it's pulled
from the net.
Scan 1000 books per day
30 scanning centres around the world.
Runs the Open Library for both books, video and
TV.
TV is now a citable resource
Live music archive
Includes every live show of the Grateful Dead.
Video game archive of classic games playable in the browser.
Will save anything digital, for free, publicly.
Open Library Api
Returns JSON (XML also available)
Evergreen and Koha are FOSS library applications that use the Open Library API.
"Do We Want It?" API to see what books the library needs.
There is an advanced search API too.
IAS3 API
Drop in replacement for the Amazon S3 API.
NASA use the API. All the audio, video and photos are available on the
Internet Archive.
ia-wrapper is a Perl client to access the API.
There is a Python client:
internetarchive .
This is the talk that has blown my mind so far. This ought to have been a
keynote. So very glad I made it. I highly recommend watching the video when it
comes out. This was an amazing talk.
Tags:
LCA
LCA2016
Open Knowledge