Skip to main content

Home/ Digit_al Society/ Group items tagged robots.txt

Rss Feed Group items tagged

dr tech

Google will let publishers hide their content from its insatiable AI - 0 views

  •  
    "Google has announced a new control in its robots.txt indexing file that would let publishers decide whether their content will "help improve Bard and Vertex AI generative APIs, including future generations of models that power those products." The control is a crawler called Google-Extended, and publishers can add it to the file in their site's documentation to tell Google not to use it for those two APIs. In its announcement, the company's vice president of "Trust" Danielle Romain said it's "heard from web publishers that they want greater choice and control over how their content is used for emerging generative AI use cases.""
dr tech

With the rise of AI, web crawlers are suddenly controversial - The Verge - 0 views

  •  
    "In the last year or so, though, the rise of AI has upended that equation. For many publishers and platforms, having their data crawled for training data felt less like trading and more like stealing. "What we found pretty quickly with the AI companies," Stubblebine says, "is not only was it not an exchange of value, we're getting nothing in return. Literally zero." When Stubblebine announced last fall that Medium would be blocking AI crawlers, he wrote that "AI companies have leached value from writers in order to spam Internet readers." "
1 - 2 of 2
Showing 20 items per page