No AI training opt-out declaration

This check looks for a declaration in your robots.txt — or a meta tag in your HTML — indicating your preference about whether AI companies can use your content to train their models. Without a declared preference, each AI crawler makes its own decision about whether to use your content for training. Regulations and industry norms around AI training data are evolving, and having an explicit declaration on record protects your content rights and demonstrates awareness of the issue. The most reliable current approach is to add rules to your robots.txt blocking specific AI crawlers by name (such as GPTBot, ClaudeBot, or CCBot). Some sites also add a meta tag such as meta name="robots" content="noai, noimageai" — however this is an informal convention not yet supported by all crawlers. Check each major AI crawler's own documentation for their current opt-out mechanism, as conventions are still evolving.

Why this matters

Without a declared preference, each AI crawler makes its own decision about whether to use your content for training. Regulations and industry norms around AI training data are evolving, and having an explicit declaration on record protects your content rights and demonstrates awareness of the issue.

How to fix it

The most reliable current approach is to add rules to your robots.txt blocking specific AI crawlers by name (such as GPTBot, ClaudeBot, or CCBot). Some sites also add a meta tag such as meta name="robots" content="noai, noimageai" — however this is an informal convention not yet supported by all crawlers. Check each major AI crawler's own documentation for their current opt-out mechanism, as conventions are still evolving.