Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon.

Pages: 1-4041-

"API" bullshit.

Name: Cudder !cXCudderUE 2015-07-13 9:39

To web service providers:

Don't make me register for an "API key" or whatever you call it just so you can get my personal info and log, throttle, or charge for my requests for a subset of the data that I can already get freely via your app or website. Don't spread the misconception that your "developer API" is the only way to get at your data.

To "developers":

Don't beg and whine for an API when you can just use your fucking browser or their app to see the data already. Learn what View Source can do for you. Don't be mislead by web service providers who peddle their stupid API bullshit just so they can exert more control over you.

Name: Anonymous 2015-07-13 10:38

Enjoy data bloat when you parse 400% more data than necessary.

Name: Anonymous 2015-07-13 15:20

How else would you limit API usage, Cudder-kun?

Name: Anonymous 2015-07-13 16:12

>>2
you say bloat, I say BIG DATA.

Name: Anonymous 2015-07-13 17:08

This thread sucks harder than Cudder in the port-a-john pulled up to the summer Intel employee barbecue

just so you can get my personal info
Because lists of the email addresses of autists are in high demand
log, throttle
you are retarded if you believe that sane people won't try to cover their ass in these extremely common and simple ways
charge
you are similarly/further retarded if you believe that companies/organizations providing/developing an API will not try to squeeze every cent out of its use
for a subset of the data that I can already get freely via your app or website
Of course! everyone should just write html scrapers--using what's already there, no matter how many ebin hacks are necessary, is always better!

don't beg and whine for an API when you can just use your fucking browser
Learn what View Source
Holy shit IHBT. Where's your HTML parser? You got one in ASM? I have some markup for it to look at whenever you feel like publishing what I'm sure is a monstrous, broken aberration against all that is good. Since you won't ever do that though, in the meantime, you can take this "every integration is a hostile integration" attitude and fucking choke to death on it. RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRREEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEe

Name: Anonymous 2015-07-13 18:03

I parse data from HTML. Further, I use regex to do it. People at my office give me odd looks for doing it, and when I ask them why, they just tell me something about satan or whatever. I think they got that meme from stack overflow.

Name: Anonymous 2015-07-13 18:06

>>6
regex
But Anon, HTML is not a regular language.

Name: Anonymous 2015-07-13 18:23

>>7
And most regex engines are more than simple regular expression parsers. Your point being?

Name: Anonymous 2015-07-13 18:24

>>7
so you can't match nested tags but you can write one off expressions to get the info you want out of some html files

Name: Anonymous 2015-07-13 18:51

>>8
most regex engines are more than simple regular expression parsers
Maybe the cancerous ones
Your point being?
I'll spell it out for you, pinhead: Square peg, round hole

>>9
You should realize by now that absolutely nothing is one off. Have fun dicking with your RE every time there is a subtle markup change or inconsistency.

Name: Anonymous 2015-07-13 19:11

dubs

Name: Anonymous 2015-07-13 20:18

>>10
Maybe the cancerous ones
Regex engines that implement lookahead/lookbehind are able to parse more than just regular languages. Pretty much every decent engine out there supports them.

I'll spell it out for you, pinhead: Square peg with rounded corners, square hole.

Name: >>12 2015-07-13 20:23

Either way, using regular expressions for parsing HTML is retarded. Not impossible but not a good idea.

HTML is verbose XML shit anyway.

Name: Anonymous 2015-07-13 20:28

>>7,13
>>6-kun parses data from HTML. He's not parsing HTML with regex. Learn the difference.

Name: Anonymous 2015-07-13 20:38

oh my god I agree with cudder

Name: Anonymous 2015-07-13 21:06

>>12
lookahead/lookbehind
Cancer! Why even use RE at that point? Might as well write a whole goddamned lexer, at least that has some chance of being legible. Christ I hate REfags.

>>14
Kill urself my man.

Name: Anonymous 2015-07-13 21:18

>>16
Why ``write a whole goddamned lexer'' when you could learn a little syntax?

Name: Anonymous 2015-07-13 21:30

>>17
I will not be trolled.

Name: Anonymous 2015-07-13 22:05

you guys are pathetic arguing about reddit-tier bullshit like appropriate use of regex

I miss old /prog/ when we were above that shit

Name: Anonymous 2015-07-13 22:15

>>19
Be the change you want to see in the world.

Name: Anonymous 2015-07-13 22:16

``I miss old x'' is a tired meme.

Name: Anonymous 2015-07-13 22:32

This may surprise you, but I invented the ``I miss old x'' meme

Name: Anonymous 2015-07-13 23:05

This may surprise you, but I invented the ``This may surprise you, but I invented the ``x'' meme'' meme''

Name: Anonymous 2015-07-13 23:51

I stopped using the YouTube API when they dropped V2 because of this.

The idea of using API keys for client-side access is stupid anyway. Someone should write an API key scraper so we can put an end to that bullshit.

Name: Anonymous 2015-07-14 0:51

>>24
Host it

Name: Anonymous 2015-07-14 1:41

>>19
I don't think we were ever above it, but enough of us didn't take it seriously at least. We have a new vocal 4channer so he could explain the new tone.

>>10,12,13
Propose an alternative that's better then. Keep in mind the alternative needs to have the same performance. Only the data you want is extracted. Everything else is passed over. The criteria are performance and ease of use.

Name: Anonymous 2015-07-14 2:00

>>26
Get an API key.

Name: Anonymous 2015-07-14 5:59

By downloading entire HTML files for every API request, you'll reach your IP's quota much faster.

TOS, nigger. Abide by it or be sued.

Name: Anonymous 2015-07-14 6:32

>>27
get my personal info and log, throttle, or charge for my requests for a subset of the data that I can already get freely via your app or website.
Nope, that's more of a bother.

Name: Anonymous 2015-07-14 6:34

why are you all talking about scraping html? most websites display their content via javascript now, so you can just use their own javascript api.

Name: Anonymous 2015-07-14 6:36

>>28
That's what proxies are for, nigger word user.

Name: Anonymous 2015-07-14 13:53

>>1
Have fun rewriting your scrapers every time the front-end engineer decides he wants to restructure the HTML. At least (good) APIs are stable.

Name: Anonymous 2015-07-14 14:23

dubs

Name: Anonymous 2015-07-14 14:33

>>19
at least I'm not bumping this stupid thread like a nigger.

Name: Anonymous 2015-07-14 14:33

>>32
That's why you hire lazy people that won't go around changing shit on you.

Name: Anonymous 2015-07-14 15:50

>>28
You can always cache.

Name: Anonymous 2015-07-14 16:34

>>36
Only if the API has an affordance for replacement. Most don't.

Name: Anonymous 2015-07-14 22:25

>>1
Learn what View Source can do for you.
Tcpdump too.
Appers are so focused on apps that they write buggy serverside code without sanitization or limits.

Name: Anonymous 2015-07-14 22:50

>>35
So your solution to the changing html problem is to hope that the web developer doesn't change the structure? That's a fantastic and stable plan yo!

Name: Anonymous 2015-07-14 23:56

>>39
It works for me.

Name: Anonymous 2015-07-15 1:09

>>32,39
Those changes are easy to handle. Once you've written a scraper that extracts what you want, updating it for a new site structure is just a matter of changing a few paths and expressions. Scraping becomes difficult when the site appers make things deliberately hard for your bot. Like randomizing the html structure of the page or captchas. The other time when its hard is when the appers can't produce valid html, or the data isn't organized in any reliable way.

I pictured >>39 as a child in a plastic high set at a table in a fast food restaurant throwing opened packets of mustard and ketchup at me as he said that.

Name: Anonymous 2015-07-15 9:00

Smoke weed everyday

Name: Anonymous 2015-07-15 18:19

>>30
The API is rarely documented and usually hidden in megabytes of Javashit cruft. I once tried to reverse https://volafile.io's API, it was awful.

Name: Anonymous 2015-07-15 18:21

>>43
Just stay determined. Reverse engineering is tedious but it has a reward to it. Like checking these sweet dubs.

Name: Cudder !cXCudderUE 2015-07-16 1:15

>>43
Protip: look at the requests being made, it's usually obvious from that.

Name: Anonymous 2015-07-16 15:07

Protip: Cudder is all talk and no action

Name: Anonymous 2015-07-16 15:18

Posting in a shit thread! :^)

Name: Anonymous 2015-07-16 15:26

        Shitty thread Wasshoi!!
     \\ Shitty thread Wasshoi! //
 +   + \\Shitty thread Wasshoi!/+
        ∬ ∬    ∬ ∬    ∬ ∬  +
   +     人      人      人     +
         (__)    (__)    (__)
  +    (__)   (__)   (__)     +
.   +   ( __ )  ( __ )  ( __ )  +
      ( ´∀`∩ (´∀`∩) ( ´∀`)
 +  (( (つ   ノ (つ  丿 (つ  つ ))  +
       ヽ  ( ノ  ( ヽノ   ) ) )
       (_)し'  し(_)  (_)_)

Name: Anonymous 2015-07-16 15:40

>>47
:^)

Name: Anonymous 2015-07-17 1:57

JACKSON 50 GET

Name: Anonymous 2015-07-17 18:04

Does this thread come with an API??

Name: Cudder !cXCudderUE 2015-07-18 15:32

>>51
Look at the URLs, the HTML, and use your brain.

Name: Anonymous 2015-07-18 22:53

>>52
...while minimizing brain use.

Name: Anonymous 2017-12-19 8:59

If I wrote a web browser it would have built in and cached API for all websites it visits.

Name: Anonymous 2017-12-19 11:14

>>54
Cudder would write a web browser but Cudder is all talk and no action!

Name: Anonymous 2017-12-26 21:16

so you can get my personal info and log, throttle, or charge for my requests
and you think they aren't doing that when you GET their webpages? lol. if they give you an API, you can still hide behind a VPN and give them a fake name/pay with a prepaid debit card.
Learn what View Source can do for you.
most websites are migrating to being fully Javascript rendered e.g., React. to use that you'll need browser automation. so in addition to whatever parsing code you need, you'll have to pull down most of WebKit (or whatever) just to extract a little bit of data on a web page. G-d help you if they randomly change one of the CSS selectors.

it's just an extremely brittle design. most people need on-demand, reliable data and don't mind paying for access because they're not mentally ill trannies in some Eastern European hovel.

Don't change these.
Name: Email:
Entire Thread Thread List