Skip to content
visible4ai
EEAT CheckerLearnAI File GeneratorPricing
en|
…
Loading
visible4ai
AI File GeneratorLearnContactPrivacyTermsRefundsImprint
visible4ai - Find out if AI recommends your website — then fix it | Product Hunt

© 2026 visible4ai. A product of Siempi. GEO analysis and real citation visibility.

We measure if you get cited in LLMs and help you improve.

Go back
4 min

AI bot access

If a bot is blocked in robots.txt, that platform cannot crawl and cite your content. Ensure these crawlers are allowed for AI visibility:

Bot / User-AgentPlatform
GPTBot, ChatGPT-UserOpenAI (ChatGPT)
PerplexityBotPerplexity
ClaudeBot, anthropic-aiClaude
Google-ExtendedGemini / AI Overviews
BingbotCopilot

Some crawlers are used only for training (e.g. CCBot). Blocking training-only crawlers does not affect live search/citation; blocking the bots above does.

Example robots.txt rules

robots.txt is a small text file at the root of your site (visible4ai.com/robots.txt). Crawlers read it before fetching any page. Here are the two common cases:

Allow all major AI crawlers

This is the safe default if your goal is to be found in AI products. Explicit Allow rules make your intent obvious to anyone reading the file later.

robots.txt
# robots.txt — allow major AI crawlers
# (Allow: / is also the default when no rules exist;
#  spelling it out makes intent explicit.)

User-agent: GPTBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: Google-Extended
Allow: /

User-agent: bingbot
Allow: /

Block one bot, keep the rest

If you want to stay visible everywhere except one platform (commonly because that platform also uses your content for training), block that specific user-agent and leave a wildcard Allow for everyone else.

robots.txt
# robots.txt — opt one bot out, allow the rest
# Useful if you want to stay citable but exclude a specific
# crawler from training your own content.

User-agent: GPTBot
Disallow: /

User-agent: *
Allow: /

robots.txt is a request, not enforcement. Well-behaved crawlers (the ones listed above) honour it; abusive scrapers ignore it. If you need real blocking, use server-level rules (firewall, rate limit, or 403 by user-agent).

The 10 AI Discovery FilesWhat works by provider