Crawl intelligence

robots.txt & llms.txt (AI Crawlers)

Control search engine crawl budgets with a smart robots.txt generator, then publish an llms.txt policy so AI systems like GPTBot and ClaudeBot can discover your best content.

11 min read All plans robots.txt + llms.txt

Before you start

Crawl Intelligence is a two-tab workspace under Tools & Analytics. The robots.txt tab manages search engine crawl rules; the llms.txt tab publishes an AI content access policy at /llms.txt.

Where to find it Open SEO Rank Genius → Tools & Analytics → Crawl Intelligence, then switch between the robots.txt and llms.txt (AI Crawlers) tabs.

You will need:

  • WordPress Administrator access
  • No conflicting physical robots.txt file in your site root (plugin serves it dynamically)
  • Permalinks enabled (pretty URLs) for /robots.txt and /llms.txt to resolve
Staging sites SEO Rank Genius auto-blocks all crawlers on detected staging environments. Do not override this unless you intentionally want staging indexed.

Open Crawl Intelligence

Navigate to:

SEO Rank Genius → Tools & Analytics → Crawl Intelligence

The section has two tabs in the Tools sidebar:

robots.txtSmart or Advanced mode, section checkboxes, Disallow/Allow KPIs, live preview.
llms.txt (AI Crawlers)AI bot toggles, pillar page KPIs, authority-driven content policy, Markdown preview.

robots.txt Smart mode

Smart mode generates a recommended robots.txt with one click. It blocks low-value paths (admin, feeds, search results) while allowing your sitemap and important content.

/wp-admin/admin.php?page=seo-link-genius_tools&tab=robots-txt
SEO Rank Genius â€ē Tools & Analytics â€ē Crawl Intelligence
robots.txt llms.txt (AI Crawlers)
</>

robots.txt

Publish a search-friendly robots.txt at yoursite.com/robots.txt

Enable
Disallow Rules
18
Allow Rules
4
Sitemaps
2
Crawl Savings
34%
Smart ModeRecommended one-click rules
AdvancedManual custom rules

Block Sections

Block /wp-admin/
Block search results
Block feed URLs
Include sitemap reference

robots.txt tab — Smart/Advanced mode selector, Disallow/Allow KPIs, and section checkboxes.

Toggle Enable in the header to publish the file. Smart mode is the right default for most WordPress sites.

robots.txt Advanced

Switch to Advanced mode for full manual control: add custom Allow/Disallow rules, set crawl-delay directives, and manage user-agent-specific blocks.

Smart Mode
Advanced

Custom Rules

Disallow /private-area/ Add
DISALLOW /wp-json/
ALLOW /wp-content/uploads/

Live Preview

User-agent: * Disallow: /wp-admin/ Allow: /wp-admin/admin-ajax.php Sitemap: https://example.com/sitemap.xml

Advanced mode — custom Allow/Disallow rules with live code preview.

Verify after saving Visit yoursite.com/robots.txt in a private browser window and confirm the preview matches the live file. Clear any CDN or full-page cache first.

llms.txt policy

The llms.txt tab publishes a machine-readable content policy at /llms.txt. AI systems use it to understand which pages represent your authority content and how they may use your material.

/wp-admin/admin.php?page=seo-link-genius_tools&tab=llms-txt
SEO Rank Genius â€ē Tools & Analytics â€ē Crawl Intelligence
robots.txt llms.txt (AI Crawlers)
🤖

llms.txt – AI Content Access Policy

Help AI systems discover your best content

Enable
Pillar Pages
12
Total Posts
248
AI Crawlers
3/4

Markdown Preview

# Example Site > SEO guides and WordPress tutorials## Pillar Content – [WordPress SEO Guide](/wordpress-seo/) – [Schema Markup Basics](/schema-markup/)

llms.txt tab — pillar pages KPI, crawler count, and Markdown preview at /llms.txt.

Enable llms.txt v2 PRO for authority-scored, llmstxt.org-compliant output that prioritizes pillar-cluster content structure.

AI crawler access

Individual toggles control which AI bots may access your content. Each bot has a dedicated user-agent recognized by major AI platforms.

AI CrawlerPlatformAccess
GPTBotOpenAI / ChatGPT
ClaudeBotAnthropic / Claude
PerplexityBotPerplexity AI
Google-ExtendedGoogle Gemini
Priority pillar pages — list authority content first in llms.txt

AI crawler rows — per-bot access toggles and priority pillar pages option.

GPTBotOpenAI crawler for ChatGPT training and browsing features.
ClaudeBotAnthropic crawler for Claude AI discovery.
PerplexityBotPerplexity search index crawler.
Google-ExtendedGoogle bot for Gemini and AI Overviews.

Troubleshooting

/robots.txt shows WordPress default or 404

Enable the robots.txt toggle in Crawl Intelligence and save. Flush permalinks under Settings → Permalinks (click Save without changes). Remove any physical robots.txt file from your server root.

Preview differs from live file

Purge CDN and page cache. Another SEO plugin may also be serving robots.txt — disable its robots feature to avoid conflicts.

/llms.txt returns 404

Enable llms.txt in the tab header, save settings, and flush permalinks. Confirm pretty permalinks are active (not Plain).

AI crawler still blocked after enabling

Check robots.txt for a blanket Disallow: / on the bot user-agent. Smart mode handles AI bots separately — re-save in Smart mode or add an explicit Allow rule in Advanced.

Staging environment blocks everything

This is intentional. The staging detection banner appears at the top of the robots.txt tab. Only override on production domains.