Get the URLs of all 4chan.org textboards (dis.) and scrap and archive them. Use archive.org
Name:
Anonymous2014-04-19 21:44
I have a fair amount of data that was collected last august. We can split up the boards and collect them using a well tested script. Preferrably using the json api to reduce the burden on their servers. Also, do it slowly.
I have copies of threads from newpol, newnew, lang, lounge, and sjis in html