Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
fluidcruft
19 days ago
|
parent
|
context
|
favorite
| on:
Perplexity is using stealth, undeclared crawlers t...
Many people don't want their data used for free/any training. AI developers have been so repeatedly unethical that the well-earned Baysian prior is high probability that you cannot trust AI developers to not cross the training/inference streams.
JimDabell
19 days ago
[–]
> Many people don't want their data used for free/any training.
That is true. But robots.txt is not designed to give them the ability to prevent this.
gunalx
18 days ago
|
parent
[–]
It is in the name, rules for the robots. Any scraping ai or not, and even mass recrsive or single page, should abide by the rules.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: