Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon. Entire thread

A project

Name: Anonymous 2014-04-19 21:23

Get the URLs of all 4chan.org textboards (dis.) and scrap and archive them. Use archive.org

Name: Anonymous 2014-07-15 0:25

To multiquote, [br] was used.

Name: Anonymous 2014-07-15 4:38

>>81
I'm so fucking confused. What does [br] do? Just inline another <br/>? I wish I had a list of bbcode inputs and html outputs to refer to. This project became a nightmare as soon as I entered the actual frontend of shiichan. Oh and I've implemented the over 1000 thread thing. If you look at the source of the threads with over 1000 threads, the post form is still there and you can unhide it by editing the html with firebug. I've seen over 1000 necros from as late as 2013, so there was someway to bypass the limit in shiichan. My current implementation hides the post form after 1000 posts and actually stops accepting posts at 1111 posts so you can get quints with a script if you want to. Maybe I'll introduce some randomness to get the same mysteriousness as shiichan.

Name: Anonymous 2014-07-15 9:17

>>81
No it is just several >

>>82
What does [br] do?
it used to insert line BReak. That was really retarded.

Name: Anonymous 2014-07-15 20:42

>>82
I don't follow the thread, I just look at the latest replies, so I hope I'm not off-topic here. I think you are discussing old /prog/'s bbcode, right? In which case, to multiquote, the technique used was this:
> foo[br]bar

which would appear as
foo
bar

By the way, to escape bbcode, there's [#][/#] (which I just used twice to show to you once).

Name: Anonymous 2014-07-15 21:47

>>84
I don't follow the thread, I just look at the latest replies, so I hope I'm not off-topic here.
The thread's at a mere eighty replies, your time can't be that precious.

Name: Anonymous 2014-07-16 7:24

>>84
Thanks for that explanation of br.

Name: test 2014-07-16 10:59

[spoiler]test[/spoiler]

Name: Anonymous 2014-07-16 13:02

I've noticed some weird spans in the html source. They don't appear to do anything. Does anyone remember the bbcode tags for them?

<span class="abbc">
<span class="math">

There's <span class="code"> too but I think this was [code ] before 2006.

Name: Anonymous 2014-07-16 15:06

bbcode works now. backlinks and hyperlinks are next.

Name: Anonymous 2014-07-16 15:13

>>88
Is the [math] span from /sci/? That's for TEX markup, though I don't recall it ever being used on /prog/.

ABBC is the name of the BBCode library.

Name: Anonymous 2014-07-17 17:27

I need help /prog/. how can I write a regular expression in python that will find all matches for N regular expressions and perform a customized replacement for each given expression? I can't iterate the substitutions one after the other because the replaced strings will be matched.

tldr; I need to lex in python. How can I using the standard library?

Name: Anonymous 2014-07-17 22:46

>>91
why do python programmeres think everything about the world changes when using python?

I can never tell if the posts about "<exteremely common thing> in python" are trying to appeal to people who think anything written in python is somehow better or if they're just proud of themselves for getting anything done in that shitheap of a language.

tl;dr
- lexing with regex = two problems, like everywhere else (except perl 6)
- stop being a baby and lex how you would lex in any other language

Name: Anonymous 2014-07-17 23:45

[math] was a tag that just put the text in a <div>. If jsMath hadn't been totally broken because of an undefined variable (if I recall correctly, it loaded before the page body, thus had nothing to work with) it would have done interesting things. It worked fine in my unreleased experimental SuperW4ch userscript, where I loaded the imagereddit jsmath script instead.

Name: Anonymous 2014-07-18 0:23

>>92
If I was writing in c I would lex using a statemachine that was machine generated with lex or flex. But I'm using python instead of c. I could write the statemachine myself in python but it would be slow as fuck to do that kind of processing at the script level. The only thing I can think of would be to use multiple regular expressions and perform multiple passes and craft the expressions so they somehow don't substitute within each other, but it's difficult and opens doors for html injection if things go wrong. It's frustrating to have a library that can do regex but can't do this conveniently.

Name: Anonymous 2014-07-18 0:31

Someone got the kopipe for parsing HTML with regex?

Name: Anonymous 2014-07-18 0:42

>>94
so emit the fsm in c and use ffi

Name: Anonymous 2014-07-18 6:11

>>96
I took the easy route and decided to only detect hyperlinks beginning with a protocol. The range expressions are functioning as closely as I could emulate. I'm going to take a break from development for a while. If you have any changes you'd like to put in post a diff here or somewhere and I'll merge it in when I see it. You are also welcome to pull the code off the site and host your own. I believe the site is feature complete except for the subject page and json feed. I have no plans to implement a moderation interface and no ip addresses or identities are recorded in the database. Ip addresses still appear in logs though, so be aware of that. I'm not looking but someone else might.

Name: TPOBCI 2014-07-19 17:38

The Pleasure Of Being Cummed Inside.

Name: TPOBCI 2014-07-19 17:49

The Pressure Of Being Cummed Inside.

Name: Anonymous 2014-07-20 10:40

The Pleasure of Getting Dubs.

Name: Anonymous 2014-07-24 7:14

I don't know why you're revamping Shiichan. You should just improve Kareha or Tablecat BBS.

Name: Anonymous 2015-07-23 21:24

fyi heliohost took away the sqlite3 module so w5ch.heliohost.org is broken until that dependency is removed. The admin user name is ``w5ch''. You can type that in to reactivate the account if it has been suspended from me not logging in once a month and looking at heliohost's advertisements in the admin interface.

Name: Anonymous 2015-07-24 9:40

>>34
420.db.xz
SMOKE WEED EVERYDAY

Name: Anonymous 2015-07-26 19:33

It appears that world4ch has been completely removed now. dis.4chan.org no longer resolves.

Thanks for making these archives.

Name: some ass 2015-10-05 17:20

>>62
bring this back please,

are you on the IRC?

Name: Anonymous 2015-10-05 18:47

There was a thread on /vip/ I was going to bump today. OP said he would check back in 3 years to see if his thread was on top. Now he's just going to see that the board is gone ;_;

Name: Anonymous 2015-10-05 19:19

>>106
damn that is a sad story for OP

Name: Anonymous 2015-10-06 6:56

>>106
That's horrible.

Name: Anonymous 2015-11-12 16:23

I'm having trouble setting this up. it worked for the default SJIS db but not the progrider one

Name: Anonymous 2015-11-12 16:28

individual threads work e.g. http://localhost:5000/thread/1280369732 but the titles dont

Name: Anonymous 2015-11-12 21:11

↖ check ‘em

Name: Anonymous 2015-11-14 21:35

>>109,110
What are you doing?

Name: Anonymous 2015-11-14 22:44

>>112
the archive of dis4chan

Name: Anonymous 2015-11-14 22:53

>>113
Like viewing it or something? The archives are just sqlite databases.

Name: Anonymous 2015-11-15 0:43

>>114
yes viewing it with the bibanon viewer

Name: Anonymous 2015-11-15 15:45

>>114
How do you pronounce 'sqlite'?

1. Ess-que-ell-ite
2. See-quel-lite
3. See-quel-ite
4. Squdder (faggot)

Name: Anonymous 2015-11-15 15:48

>>116
sk-light

Name: Anonymous 2015-11-15 18:00

>>116
5. Ess-queue-lite

Name: Anonymous 2015-11-15 20:48

>>9
You forgot ``Flag on the moon... How did it get there?''

Name: Anonymous 2015-11-16 0:13

does anyoen have a copy of the poems?

they were hard to get cause they went away fast..

Newer Posts
Don't change these.
Name: Email:
Entire Thread Thread List