Question 1

Should I block AI crawlers in robots.txt?

Accepted Answer

Distinguish purposes. Search bots (OAI-SearchBot, PerplexityBot, Claude-SearchBot) fetch pages to cite them in AI answers — blocking those makes you invisible exactly where buyers increasingly ask. Training bots (GPTBot, CCBot, Bytespider) only feed model training; blocking them is a legitimate choice that doesn't hurt AI visibility.

Question 2

What's the difference between GPTBot and OAI-SearchBot?

Accepted Answer

Both belong to OpenAI. GPTBot collects content for model training. OAI-SearchBot powers ChatGPT's web search — it decides whether your pages can appear as cited sources in ChatGPT answers. Many sites block GPTBot for policy reasons but should keep OAI-SearchBot allowed.

Question 3

Does blocking an AI bot in robots.txt actually work?

Accepted Answer

Major vendors (OpenAI, Anthropic, Google, Perplexity for its crawler) document robots.txt compliance and publish IP ranges for verification. Some bots, notably Bytespider, are widely reported to ignore robots.txt — blocking those requires firewall rules instead.

robots.txt AI Crawler Checker

Search bots vs. training bots

AI crawler FAQ