For sure — afaik all of the major models are trained on stolen content However a distinction in this case: Perplexity ignore/do not read robots.txt, a common mechanism site owners use to set bot rules. When security systems detect & block, Perplexity mutates user-agent to hide & continue