Free Tool · UK 2026
How do AI agents
see your site?
Eight checks across the signals ChatGPT, Perplexity, Claude Web, and Google AI Overviews use to decide whether your domain is citation-worthy in 2026. Free, no sign-up required to see the score.
What this audit actually checks
- llms.txt manifest — the emerging convention AI crawlers follow for a curated, machine-readable site index. Weight: 15%.
- llms-full.txt corpus — single-file full-text dump for retrieval- augmented citation. Weight: 8%.
- AI crawler allow-list in robots.txt — explicit Allow rules for GPTBot, ClaudeBot, PerplexityBot, Google-Extended, Applebot-Extended. Many sites block by default. Weight: 12%.
- Schema.org structured data — JSON-LD presence, depth, and graph coverage (Organization, BlogPosting, FAQPage, BreadcrumbList, etc). Weight: 18%.
- Open Graph + Twitter Card metadata — social-preview metadata completeness for X, LinkedIn, Slack, Discord previewers. Weight: 10%.
- Speakable schema — SpeakableSpecification on long-form content. Powers Google Assistant + AI Overview voice extraction. Weight: 8%.
- Heading hierarchy — single h1, structured h2/h3, no level-skips. AI agents use heading structure for content extraction. Weight: 7%.
- XML sitemap — sitemap presence, URL count, lastmod tags. Weight: 7%.
Why most sites score under 60 in 2026
Most enterprise sites were built before AI agents were a citation surface. They still optimise for Google blue-link SERPs — meta titles, schema, sitemap. AI agents look for additional signals: a curated llms.txtmanifest, an explicit AI-crawler allow-list, Speakable spec on long-form content, and clean structured data on every page rather than only the homepage. The gap between "blue-link ready" and "AI-citation ready" is where most visibility is lost in 2026.
What you do with the score
Each check returns a 0-10 sub-score and a specific remediation hint. Run the audit on your own domain and the top 3-5 competitors in your category — the deltas usually tell you exactly which signals are driving citation share. Most fixes are engineering tasks: a one-time generator script for llms.txt, a robots.txt update, JSON-LD blocks added to page templates. We can help if you want a fixed- scope quote.