Skip to content
Back to notes

Blocking AI crawlers is the new 'noindex'

Blocking GPTBot, Anthropic, and Perplexity trades long-term search reach for short-term control. The math on whether it's worth it for your site.

1 min read

If you're blocking GPTBot, Anthropic, Perplexity, Gemini — you're trading future reach for short-term control.

the math

AI search traffic today: ~1% AI search traffic tomorrow: 25–35%

Let them crawl. Train the discovery layer. Be early.

common AI crawler user agents

CrawlerCompany
GPTBotOpenAI
ClaudeBot / Anthropic-AIAnthropic
PerplexityBotPerplexity
Google-ExtendedGoogle (Gemini)

the robots.txt decision

Blocking these crawlers:

bash
User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

Feels like control. Actually it's invisibility.

why this matters

When someone asks an AI "how do I do X" and your content isn't in the training data, you don't exist in that conversation.

The sites that trained the discovery layer early will own the AI search results later.

Visibility > invisibility.

Adjacent notes