CAPTCHAs have failed for 20 years

(browserbase.com)

58 points | by harsehaj 3 hours ago

23 comments

netik 1 hour ago
So this is a basically a shill advertisement ending in "Your AI Agents can avoid captchas if you pay us."
The last example is a false narrative, that captchas will only happen if the "browser looks suspicious". Systems like Altcha put an end to this argument. They don't care if the browser looks suspicious, only that the browser can perform a proof-of-work to get past a captcha designed to slow down the request rate.
When applied consistently, it will effectively block and slow down AI crawlers, which is what this company wants to promote.
[-]
- chrismorgan 56 minutes ago
  Proof-of-work is bad rate limiting: https://news.ycombinator.com/item?id=44093918. The playing field is wildly unbalanced. Even naive attackers tend to have a lot more computing power available than a lot of your normal users, and where it’s SHA-256 (which is almost the worst choice imaginable for a proof of work scheme, yet which every single service that I know of has used), an intelligent attacker goes from being hundreds of times as powerful to millions of times as powerful.
- peeet 48 minutes ago
  More advanced and targeted bots can "bypass" Proof of work as well though, e.g. using something like https://github.com/toman-tom/Incapsula-PoW
- gruez 1 hour ago
  >Systems like Altcha put an end to this argument. They don't care if the browser looks suspicious, only that the browser can perform a proof-of-work to get past a captcha designed to slow down the request rate.
  That doesn't really work out in reality because bots are happy to wait 5 seconds or even 5 minutes for a PoW challenge to complete. Humans on the other hand will not, especially if they're on a mobile device with limited compute and energy.
ramify123 12 minutes ago
Here is some fun captcha instead of that https://feralui.vercel.app/#/captcha
CM30 1 hour ago
The issue is that anything that becomes a standard here automatically becomes a target. If the same sort of captcha protects everything from Gmail to Twitter to Cloudflare and Facebook, then bot creators and spammers have a huge incentive to bypass it no matter what. And if we've learnt anything about spam, it's that pretty much every system we can think of can be bypassed or automated away.
The solution is really a ton of different captcha like systems and anti spam solutions, all unpopular enough that an attacker may not even bother targeting them. If an attacker needs to target a few thousand different captcha style setups to get their spam through, then many of them won't bother.
It's like centralised vs decentralised communication systems. If everything is centralised, a bad actor (like a government, corporation, criminal group, etc) can go after one target to control the narrative. If it's decentralised, then suddenly they have to go after dozens or hundreds of different targets, many of which won't cooperate with them.
epgui 2 hours ago
I thought half the point of captchas was to train vision models?
[-]
- ben_w 1 hour ago
  This is in the article.
  Indeed, half the point for reCAPTCHA: That how Google could justify supplying reCAPTCHA for free, but not why people wanted to use them.
  [-]
  - chinathrow 59 minutes ago
    > That how Google could justify supplying reCAPTCHA for free, but not why people wanted to use them
    This and Pokemon Go for collecting videos: are there other examples of users doing the free work for $large_co?
    [-]
    - ben_w 12 minutes ago
      https://en.wikipedia.org/wiki/Self-checkout
curtisboortz 1 hour ago
The Chrome extension angle is interesting here. We ship an extension that interacts with Gmail and have seen how much variance there is in what Google considers "bot-like" behavior from extensions vs. the browser tab. The line between "automated" and "assisted" is not well defined at the API level, which ends up being a similar underlying problem: distinguishing intent rather than pattern.
hombre_fatal 2 hours ago
As TFA points out, a major change is that bot traffic now comes from honest users via their LLM sessions, so you don't even necessarily want to block automated bots anymore.
The game is shifting to a better ideal: how do you design a service knowing that any user/request might be automated?
Especially in place of the historical, easy solution/hack where you have some sort of gate that, once passed, puts the user in some trusted low-scrutiny tier, like a forum's registration page.
It's a similar question to designing a system so that it's resilient to account take-overs. (i.e. The user was a trusted human until now, and now it's a spammer)
Example: on a forum, run new posts through an LLM to classify it as spam which is a magic solution we always wish we had (remember akismet?) but was too rudimentary.
[-]
- wildzzz 1 hour ago
  You use API tokens for things intended to be machine to machine communication and captchas for things intended to be filled out by humans. Not every site or service wants automated input, even if it's being directed by a human. I dont want forums like HN just filled with a bunch of agents talking to eachother, where's the human connection?
giancarlostoro 1 hour ago
I remember at one point in my teens, someone had made a web app that would snag the captcha and show you only the captcha, and you would just endlessly solve captchas, while the application tried different passwords on a backend, and logging any successful logins.
[-]
- yieldcrv 1 hour ago
  Some of the first bitcoin faucets in 2011, 2012 were bots doing that
  Users thought the captcha was antispam prevention for them to receive bitcoin
  It was really just the bot forwarding a captcha to continue its spam once solved, posting the user in bitcoin
  [-]
  - giancarlostoro 29 minutes ago
    LOL I don't remember doing captcha, but I remember receiving bitcoin from a faucet, thought it was strange.
ra0x3 2 hours ago
TLDR: They're promoting a product they're working on with Cloudfare under the guise of it being an "open standard" [1]. Of course, in the docs, Step 1 is "Sign in with your Cloudfare account". Comes across a bit land-grabby.
[1] https://www.browserbase.com/blog/cloudflare-browserbase-pion...
joehabeebs 1 hour ago
The most recent variations that force you to click the boxes containing a certain artifact are incredibly frustrating and fail half the time. The large influx of AI-SEO optimized content being created makes me question CAPTCHAs efficacy today
matteo8p 1 hour ago
Really nice read Harsehaj!
I haven't looked deeply into Web Bot Auth, but is identification tied to the agent (one identity per agent) or is it tied to the underlying person using the agent (the user)?
Hope that question makes sense, lmk if you need clarification
[-]
- peytoncasper 1 hour ago
  Hey Matt,
  I would say everyone is leaning towards organization/individual right now but I would image that flips as the number of agents grow
thenthenthen 2 hours ago
Omg. I am on various VPN’s and now and again Google Auth (for youtube) throws me a captcha. They are mostly unreadable, but there is an audio option… which is just insane and does not make any sense, anyone had that? It sounds like a recording of 300 people speaking at the same time in a call center while on various dosages of LSD
[-]
- nosioptar 2 hours ago
  I've actually been in a call center with 300 intoxicated folk all talking at once. Its easier to understand than the recaptcha audio.
  (Only a couple folks on hallucinogenics, most on various downers.)
- moralestapia 2 hours ago
  I've got captchas that made me play a small game and I score like 3 points to go ahead, lol. For real.
- willmadden 2 hours ago
  They give you that (or hieroglyphics) if you are using certain VPNs and don't leave a specific browser fingerprint.
  [-]
  - prmoustache 1 hour ago
    There is a point where not leaving fingerprints becomes a fingerprint in itself.
SirMaster 1 hour ago
What about those ones where you need to slide some piece of a puzzle in. I don't see those mentioned at all. Are they effective?
GL26 2 hours ago
Question that I've been wondering, can't attackers record human sessions and use it to attack a website to bypass cloudflare ?
[-]
- bluGill 2 hours ago
  They can. They have already figured out a lot of what cloudflare is looking for and have figured out how to bypass it. (according to the article) Which is why protection is trying something else. I suppose this is why every website wants me to login with my google account (which I never use)
ezst 1 hour ago
They have served to train multiple generations of ANN and ML algorithms, in that, I think they've been a resounding success!
randrus 2 hours ago
Always reminds me of the forces that shape the mechanisms around the exchange of genetic information that powers evolution.
See: Red Queen by Matt Ridley.
visiondude 1 hour ago
although not perfect for other reasons, a captcha made using phone motion and device attestation like prsn.you is a more challenging bypass for today’s agent environments
akimbostrawman 1 hour ago
Failed? They have very successfully pushed people towards chromium browser and traceable residential IPs while also training AI.
throw7 2 hours ago
Just today a website presented me a qrcode captcha. I threw up.
echoangle 2 hours ago
Oh my good I hate AI articles. Why do we have to make an interactive visualization for every single sentence? Thanks for showing me how distorted text is made in steps.
And being a cat and mouse game doesn’t mean the defenders failed.
[-]
- qweqwe14 2 hours ago
  > And being a cat and mouse game doesn’t mean the defenders failed.
  It does though, in the end attackers always win. If something is a "cat and mouse game" then it's unwinnable by design from the defender side.
  Sure, you can keep playing it if you feel like it, but at some point the attacker will be indistinguishable from a legitimate user and you will lose that fight.
  [-]
  - echoangle 2 hours ago
    By that logic, every security task is doomed to fail. Spam detection and antivirus are cat and mouse games too. I wouldn’t say they fail just because they have to adapt over time.
kgwxd 2 hours ago
They're great for keeping humans out. Tried to setup Discord on a new phone yesterday. CAPTCHAs over and over again, just trying to log in. I uninstalled instead.
cute_boi 1 hour ago
It has failed because of these company like browserbase and hackers who hack smart device and TV's for residential proxy.
jmclnx 2 hours ago
They have been around that long ? Does not seem so but the timing could be correct probably because the sites I went to had no need for CAPTCHAs until AI came around.
[-]
- Zak 2 hours ago
  The name wasn't invented until 2003, but yes.
  Guestbooks, contact forms, signup pages, and the like started receiving automated abuse approximately five minutes after they were invented. It didn't take long after that for people to start including a question they expected to be easy for a person and hard to automate with a script.
  What's relatively new is CAPTCHAs merely to browse a site. There are few faster ways to get me to close your site, and maybe send you an unfriendly email.
  [-]
  - nosioptar 2 hours ago
    My first guestbook asked Hagar or Roth. Answering correctly got your message added to the book. Answering Hagar got you sent to an infinite redirect loop for being either a bot or a moron.
- code_duck 2 hours ago
  So in the past few years? Oh dear, no. Captchas have been in common use for much longer than that. reCAPTCHA has been around almost 20 years.
- JohnFen 2 hours ago
  They were introduced in 1997, although I personally didn't start seeing them until a couple of years later.
zuzululu 2 hours ago
so whats the solution then? get people to turn on their camera and hold up 15 fingers ?
[-]
- fusslo 1 hour ago
  it sounds like the article & company are building identity based on fingerprinting/cross-domain behavior. Inferring at multiple levels, including cloudflare's
  It's just more identity verification afaict
- ranger_danger 2 hours ago
  PACT: https://news.ycombinator.com/item?id=48647360
- throwawayffffas 2 hours ago
  The solution is login and paywalls.
  [-]
  - kgwxd 2 hours ago
    That's crazy. People aren't going to pay to be tracked and have ads shoved in their faces! The economy would collapse!