ai-training-opt-out
ai-training-opt-out copied to clipboard
Known tags and settings suggested to opt out of having your content used for AI training.
AI Training Opt Out
Known tags and settings suggested to opt out of having your content used for AI training.
Contents
- robots.txt A copy-and-paste collection of tags to add to your own robots.txt. (You can automate generation of this file with darkvisitors.com)
- meta-tags.html A copy-and-paste collection of tags to add to your own
<head> - headers.txt HTTP headers you can add to your responses. This is more more involved and installation is outside the scope of this document.
- ai.txt An alternative to robots.txt created by Spawning, the company behind haveibeentrained.com.
- ip-ranges.txt Known IP ranges for AI crawlers. These will change over time, so links to the canonical source is included.
- tdmrep.json A Web protocol, capable of expressing the reservation of rights relative to text & data mining (TDM)
Other Opt-Outs
- OpenAI (Includes ChaGPT and DALL·E): You can opt-out of having your input and output to their services used to train by emailing your organization ID to [email protected]. Note: This doesn't include any data they scraped to train their model.
- StabilityAI: Stable Diffusion 3 will honor opt-out requests on haveibeentrained.com.
- AWS: "AWS may be using your data to train its AI models, and you may have unwittingly consented to it. Prepare to jump through a series of complex hoops to stop it." -- How to Stop Feeding AWS’s AI With Your Data
- Substack "If you do NOT want your publication to be used to train AI, open your publication, go to Settings > Publication details and switch it on."
- Wordpress and Tumblr are both opt-out for your post content.
- The Stack Find your repo(s) on Am I in The Stack? and then click Opt-Out at the bottom to open a request.
References
- How to Block ChatGPT From Using Your Website Content
- All Deviations Are Opted Out of AI Datasets
- OpenAI Terms of Use
- Stability AI plans to let artists opt out of Stable Diffusion 3 image training
- Stop AI Data Mining in its Tracks with AI.txt
- Sites scramble to block ChatGPT web crawler after instructions emerge
- An update on web publisher controls -- Google's VP of Trust
- Dark Visitors: A List of Known AI Agents on the Internet
- TDM Reservation Protocol (TDMRep)