Get the URLs of all 4chan.org textboards (dis.) and scrap and archive them. Use archive.org
Name:
Anonymous2014-04-20 7:26
/lounge/ has been archived. Scraping went much more smoothly. There was only one instance of malformed html. /prog/ was infested with it. Either someone pwned world4ch and was inserting raw data into the database, or someone with access to it was messing around. There are poster dates from 1969. Stuff like that is hard to explain otherwise.