According to Dark Visitors founder Gavin King, most of the major AI agents still abide by robots.txt. âThatâs been pretty consistent,â he says. But not all website owners have the time or knowledge to constantly update their robots.txt files. And even when they do, some bots will skirt the fileâs directives: âThey try to disguise the traffic.â
Prince says Cloudflareâs bot-blocking wonât be a command that this kind of bad actor can ignore. âRobots.txt is like putting up a âno trespassingâ sign,â he says. âThis is like having a physical wall patrolled by armed guards.â Just as it flags other types of suspicious web behavior, like price-scraping bots used for illegal price monitoring, the company has created processes to spot even the most carefully concealed AI crawlers.
Cloudflare is also announcing a forthcoming marketplace for customers to negotiate scraping terms of use with AI companies, whether it involves payment for using content or bartering for credits to use AI services in exchange for scraping. âWe don’t really care what the transaction is, but we do think that there needs to be some way of delivering value back to original content creators,â Prince says. âThe compensation doesn’t have to be dollars. The compensation can be credit or recognition. It can be lots of different things.â
Thereâs no set date to launch that market, but even if it rolls out this year it will be joining an increasingly crowded field of projects intended to facilitate licensing agreements and other permissions arrangements between AI companies, publishers, platforms, and other websites.
What do the AI companies make of this? âWeâve talked to most of them, and their reactions have ranged from âthis makes sense and weâre openâ to âgo to hell,ââ says Prince. (He wouldnât name names, though.)
The project has been fairly quick-turnaround. Prince cites a conversation with Atlantic CEO (and former WIRED editor in chief) Nick Thompson as inspiration for the project; Thompson had discussed how many different publishers had encountered surreptitious web scrapers. âI love that heâs doing it,â Thompson says. If even big-name media organizations struggled to deal with the influx of scrapers, Prince reasoned, independent bloggers and website owners would have even more difficulty.
Cloudflare has been a leading web security firm for years, and it provides a large portion of the infrastructure holding up the web. It has historically remained as neutral as possible about the content of the websites its services; on the rare occasions it made exceptions to that rule, Prince has emphasized that he doesnât want Cloudflare to be the arbiter of whatâs allowed online.
Here, he sees Cloudflare as uniquely positioned to take a stand. âThe path we’re on isn’t sustainable,â Prince says. âHopefully we can be a part of making sure that humans get paid for their work.â
