
To download the 527 pdf files, use the file 2009_books.txt
It contains their bookids (notably useful for submissions), and the corresponding url to download the pdf file from the Internet archive (for simplicity, you may download them, e.g., through the wget command).

To download OCRed files, simply replace ".pdf" by "_djvu.xml"
