Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Many people don't want their data used for free/any training. AI developers have been so repeatedly unethical that the well-earned Baysian prior is high probability that you cannot trust AI developers to not cross the training/inference streams.


> Many people don't want their data used for free/any training.

That is true. But robots.txt is not designed to give them the ability to prevent this.


It is in the name, rules for the robots. Any scraping ai or not, and even mass recrsive or single page, should abide by the rules.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: