Categories
Technology

Finding bad edits in the Open Library catalogue: ideas

Note: this post may be updated to accomodate other ideas or replace ideas. The Open Library catalogue can be edited by anyone. That became a problem when a wave of spam bots had found it last year. Now only users capable of deciphering the captcha can edit. There has always been some spam. I have […]

Categories
Technology

More Open Library

After some initial playing, I have started work on a VacuumBot for Open Library. It is supposed to be a general bot that can clean up some of the mess I found in the datadump of January 2012. By now I have compiled several lists of dirty data and key counts in this Gist. Among […]

Categories
Technology

Playing with the Open Library

Lately I’ve been playing (it’s not ‘working’ yet) with Open Library and its data. I’m even on the discuss and tech mailing lists, forked the GitHub repository and did a pull request. Why? Just like I use Discogs.com to keep track of my CDs, I thought I could use an online editable catalog for my […]