So Cloudflare is creating the infrastructure to actually support the microtransactions discussed in the early days of the web?
Cloudflare creates AI crawler tollbooth to pay publishers
Cloudflare has started blocking AI web crawlers by default in a bid to become the internet's gatekeeper. The term "gatekeeper" has been applied in a pejorative sense to platform companies like Apple and Google that use their contractual and technical control over operating systems to extract monopoly rents from developers …
COMMENTS
-
-
-
Wednesday 2nd July 2025 13:40 GMT heyrick
Google is actually one of the few bots that seems to play reasonably nice.
There are many (hello perplexity, and some random SEO crap) that hammer a site, and then there are a number that don't disclose their identity and use regular user agent strings (but no human can fire off several dozen requests per second).
-
-
-
This post has been deleted by its author
-
-
-
Wednesday 2nd July 2025 08:59 GMT MOH
That "deal" has been broken for years, with Google increasingly placing larger snippets at the top of the SERPs or in sidebars to discourage users from clicking through.
The whole "AI overview" thing is just taking that practice further, but it's not new.
Given how bad search has become, we must be reaching the point where it's worth considering whether everyone should be blocking both AI and search index bots by default.
There's minimal benefit to sites with real content, at best you'll end up listed on page 3 behind dozens of identically inaccurate pages of AI-harvested dreck.
-
Wednesday 2nd July 2025 09:40 GMT tfewster
Oh, for the earlier days of the internet:
- When HP's website contained comprehensive, searchable info, downloads and user forums (before they were taken over by Compaq).
- When AOL kept hoi polloi penned up so they didn't pollute the WWW.
- When searching for "tfewster" returned hundreds of results from posts on tech forums - no, wait, that was enshittification.
-
-
-
-
Wednesday 2nd July 2025 13:51 GMT kmorwath
Re: Cloudflare-in-the-Middle
Not only that - Cloudflare is attempting to become the hub of all internet traffic - putting it in a very privileged position.
See how it offers cunningly to terminate TLS connections for site owner that are too lazy to setup a Let's Encrypt ACME client to obtain certificates, or can't. So it can see the traffic.
Does this offer means Cloudfllare needs to know which content are accessed?
We migh have escaped the "Microsoft Network" - and maybe a Google one - but are we going to be bound to the Cloudflarenet soon?
-
This post has been deleted by its author
-
-
This post has been deleted by its author
-
Wednesday 2nd July 2025 11:10 GMT Anonymous Coward
An earlier name for Gatekeeper was ...
Janitor whose role largely became the gatherer and disposer of trash and other garbage.
The trash and garbage is a pretty apt description of today's internet.
I notice that now when accessing many more sites but especially the "context rich" sites CloudFlare is interposing its gooseberry self more frequently.
After some wiggly-rotatey animation a tick box often appears before the destination site appears. (15s to a minute.)
I assume the animation hides some sort of client or browser detection and the tick box is used to detect human presence somehow.
What I am not clear about is whether the connection is ultimately redirected to the requested site with same magic key or nonce; or whether CloudFlare acts as web proxy (application level gateway) necessarily relaying all the traffic between the client and the content provider.
-
-
Thursday 3rd July 2025 14:19 GMT rgrgrgrg
No it's not a toll gate, which would be on the overall internet channel.
It's an entry gate, chosen by the website owner, Cloudflare are just the provider.
The can still choose to not use Cloudflare, or use them without the fee for AI bots, or indeed use a different CDN which may have a similar entry gate.
What is needed as a standard for this - oh yes, that already exists, we have the HTTP 402 response.
-
-
Saturday 5th July 2025 04:57 GMT Anonymous Coward
Add another option
"The payment service will let publishers block AI crawlers, allow specific ones, charge for access, or grant free access."
Add another option - provide the AI crawler with an intentionally-mangled version of the site. Most info would be altered to be wrong; simply putting "not" into most sentences in strategic places would do it, or adding incorrect adjectives, or swapping nouns with randomly-chosen ones. You could even have an AI generate (only once!) the mangled page.
In other words, if AIs are coming to eat your lunch, poison what they get!