Cloudflare is luring web-scraping bots into an ‘AI Labyrinth’

2 2 minutes read

Cloudflare, one of the largest internet infrastructure companies in the world, has announced AI Labyrint, a new tool for fighting robots rolling on the Internet that get rid of sites for artificial intelligence training data without permission. The company says in a blog publication that when you discover “inappropriate robot behavior”, it urges the free tool, on the path of connections with trap pages of artificial intelligence “slowing, mixing, and wasting resources” who behave in a bad intention.

Web sites have long used the Robots.TXT system, which is a text file that gives or rejects the permission of the sections, but artificial intelligence companies, even well -known companies such as Antarbur and the bewilderment of artificial intelligence, have been accused of ignoring them. Cloudflare writes that she sees more than 50 billion requests on the Internet a day, and although it contains tools to discover and ban that harmful, this often causes the attackers to switch tactics in the “armament race that never ends”.

Cloudflare says instead of preventing robots, as it is called Ai Labyrint by making it processing data that has nothing to do with the actual data of the website. The company says it is also working as a “generation of the next generation”, a drawing in the crawl of artificial intelligence who keep the following links to fake pages in a deeper way, while a regular person will not do that. This makes the easiest fingerprints harmful to the Cloudflare menu from bad actors in addition to determining “new patterns and signatures of robot”, it will not be discovered in another way. According to this post, these links should not be visible to human visitors.

You can read more about how Ai Labyrint has works on the Cloudflare Blog, but here are more details from the post:

We have found that creating a variety of topics first, then creating content for each topic, produced more varied and persuasive results. It is important for us that we do not generate inaccurate content that contributes to the spread of wrong information on the Internet, and therefore the content that we create is real and related to scientific facts, and not only related or ownership of the site.

Site officials can choose to use the maze of artificial intelligence by moving to the BOT Department in the Cloudflare Dashboard for their site and replacing them. The company says that “this is only the first repetition of the use of obstetric intelligence to thwart robots.” It plans to create “full networks of URLs associated” with the ends of the ends that end will face a difficult time in fake registration. like Art Technica Notes, Amnesty International’s maze looks similar to Nepenthes, a tool designed for a particle crawl for “months” in hell of scrap data created from artificial intelligence.

2025-03-22 18:17:00

2 2 minutes read