After some initial playing, I have started work on a VacuumBot for Open Library. It is supposed to be a general bot that can clean up some of the mess I found in the datadump of January 2012. By now I have compiled several lists of dirty data and key counts in this Gist. Among […]
Categories