Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon. Entire thread

*takes a moment of silence to remember the old /prog/*

Name: Anonymous 2013-09-03 1:32

[rem][/rem]

Name: Anonymous 2013-09-04 9:11

>>40
Always, Anonymous-san. Always.

Name: Birds &‮ seeB ‭ 2013-09-04 10:40

>>37,39,40
No, read the history:
https://en.wikipedia.org/wiki/.xz
http://tukaani.org/xz/format.html
It comes from 7zip, scaled down only for compression.

Name: Anonymous 2013-09-05 15:21

>>26
/sports/ has some very autistic dedicated users.

Name: Anonymous 2013-09-06 3:42

I'm done scraping lounge. That took a while. I don't know why though, it was only 591MB of data uncompressed. Maybe they added a throttle or something.

Which one should I scrape now?

Name: Anonymous 2013-09-06 3:58

>>44
your mom

Name: Anonymous 2013-09-06 4:30

>>44
I got that too while scraping /sports/; it wrecked the whole scrape. I upped the delay to five seconds and everything goes through. I think it was cloudflare shenanigans.

Name: Anonymous 2013-09-06 4:48

Just curious, are you guys storing the boards as unmodified html? Or are you doing any non-destructive processing before archiving?

Name: Anonymous 2013-09-06 5:02

Reposting this script, because it was posted in a deleted thread. It just saves all threads under a board as files in a directory.

#!/bin/sh

script=`basename $0`
status=0
board=$1
num_threads=$2
delay=$3

if [ -z "${board}" ]
then
echo "${script}: Usage: ${script} board [ num-threads [ delay ] ]"
exit 1
fi

if [ -z ${delay} ]
then
# 2 seconds is generous enough
delay=2
fi

mkdir ${board}

wget https://dis.4chan.org/${board}/subject.txt
if [ $? -ne 0 ]
then
echo "Couldn't get the subject page."
exit 1
fi

if [ -z "${num_threads}" ]
then
sed 's/[^<]*<>[^<]*<>[^<]*<>\([^<]*\)<>.*/\1/' < subject.txt > threads.tt
else
# head gets the top num_threads threads
# sed extracts the thread number
head -n ${num_threads} subject.txt \
| sed 's/[^<]*<>[^<]*<>[^<]*<>\([^<]*\)<>.*/\1/' \
> threads.tt
fi

# check the validity of the thread number. It should be all digits.
# this also protects against shell injection.
grep '^[0-9]\+$' threads.tt > wellformed-threads.tt
grep -v '^[0-9]\+$' threads.tt > non-wellformed-threads.tt

for thread in `cat wellformed-threads.tt`
do
echo "Downloading thread ${board}/${thread}:"
wget https://dis.4chan.org/read/${board}/${thread} -O ${board}/${thread}.html
if [ $? -ne 0 ]
then
echo "Error downloading thread ${board}/${thread}" | tee -a errors.tt
fi
sleep ${delay}
done

if [ -s non-wellformed-threads.tt ]
then
echo "These threads could not be downloaded because the subject was messed up."
cat non-wellformed-threads.tt
echo
status=1
fi

if [ -s errors.tt ]
then
echo "These threads could not be downloaded."
cat errors.tt
echo
status=1
fi

exit ${status}

Name: Anonymous 2013-09-06 5:18

I'm scraping /sjis/ now. It looks smaller, so it shouldn't take long.

Name: Anonymous 2013-09-06 6:09

ok /sjis/ is done. Now moving on to /lang/

Name: Anonymous 2013-09-07 0:48

If you guys have the time, please remember to make really bad shitpost on /pro/g//, and keep anime threads bumped.

Name: Anonymous 2013-09-07 2:18

>>52
Cute! Please keep posting!

Name: Anonymous 2013-09-08 14:27

>>51
Why? It's not like we hold a grudge against our own past. It could also backfire, you could become a non-ironical shitposter, always post there and never come back, which is something I (we?) wouldn't want to happen.

Name: Anonymous 2013-09-15 20:03

Name: Anonymous 2013-09-15 20:45

>>55
Who the fuck keeps making those shitposts? Is it just one /g/ro?

Name: Anonymous 2013-09-15 21:32

Name: Anonymous 2013-09-15 21:37

>>57
The old /prog/'s corpse has been shot by j00t once again.

Name: Anonymous 2013-09-15 22:54

>>57,58
Why would they do that?

Name: Anonymous 2013-09-15 23:50

>>1
Solemn, eyes front
The Last Post echoing in my ears
One last time
*grabs dick*

Name: Anonymous 2013-09-15 23:56

This confirms the status of the old /prog/:
http://dis.4chan.org/read/prog/1379279958/23

It links to an image. Try not to laugh.

Name: Anonymous 2013-09-16 0:06

>>61
Maybe /prog/'s EXPERTISE rubbed onto them (in a safe and consensual manner).

Name: Anonymous 2013-09-16 0:13

>>62
[m]Your instructor is Hal FUCKING Abelson[/m]
Maybe not.

Name: Anonymous 2013-09-16 0:15

>>63
Oh, right, I missed that. They've been drawn to the Anticudder.

Name: Anonymous 2013-09-16 2:32

>>61
http://dis.4chan.org/read/prog/1379279958/29
seems to be in the right path.

Name: Anonymous 2013-09-17 19:39

Name: Anonymous 2013-09-18 19:57

>>66
That's pretty sad. I haven't been back to world4ch since Admin made this board, and if that's what it's like, I'm not missing much.

Name: Anonymous 2013-09-18 22:04

>>67
world4ch is pretty scary right now. Someone (or maybe a couple of people) are seriously just shitposting and manually spamming the board, constantly, and have been doing so for weeks now.

What kind of insanity do you have to fall into where that is what becomes of your life? Terrifying.

Newer Posts
Don't change these.
Name: Email:
Entire Thread Thread List