avatar
Ben Schwarz @benschwarz.bsky.social

For sure — afaik all of the major models are trained on stolen content However a distinction in this case: Perplexity ignore/do not read robots.txt, a common mechanism site owners use to set bot rules. When security systems detect & block, Perplexity mutates user-agent to hide & continue

aug 7, 2025, 9:27 am • 1 0

Replies

avatar
Maxime @maxiim3.bsky.social

One could say that at least Perplexity was one of the first to explicitly source the data and sites being parsed. Though I am not really aware of those mechanisms applied to Ai Agents. I will investigate a bit deeper. Thanks.

aug 7, 2025, 9:56 am • 1 0 • view