www.theguardian.com

Report from 7/2/2026, 8:52:55 PM https://www.theguardian.com/
69
Overall score
weighted CAV (0–100)
FAIL
0–4950–8990–100

Metrics

100%
CRR Content Recovery Good
0.35
SSD Semantic Signal Density Poor
3,160 tok
TC Token Cost Good
58 ms
TTFUT Time to First Useful Token N/A

Token Cost breakdown

Where the page's tokens go (≈25,323 across regions). Most tokens are real content — the agent isn't paying much for chrome.

Content
85.9% · 21,753
Chrome (nav / header / footer)
13.4% · 3,397
Boilerplate (cookie / ad)
0% · 0
Other
0.7% · 173

Final screenshot

Final screenshot of https://www.theguardian.com/

Diagnostics

high SSD Low signal-to-noise for agents

signal 0.70 · JSON-LD 0/1 · missing: structured-data

Fix: Wrap the real content in <main>/<article>, cut repeated nav/boilerplate, and keep the primary content dense and early in the DOM.

Rendered profile: headless

Agent Discoverability 45/100 · Agent Blocked

Access & discovery checks — separate from the gated CAV metrics above. Click an issue for business impact, what we measured, and how to fix. · Take the Agent Readiness course →

Agent files & endpoints

llms.txt Absent at /llms.txt and /.well-known/llms.txt Learn →
robots.txt (AI bots) Blocks: * (all) Learn →
sitemap.xml No /sitemap.xml Learn →
JSON-LD structured data No JSON-LD found Learn →
~ agents.json Absent (emerging standard) Learn →
~ WebMCP endpoint Absent (emerging standard) Learn →
~ OpenAPI / API docs No OpenAPI/Swagger found Learn →

Issues (8)

robots.txt allows AI bots high impact Blocks: * (all)

Business impact If robots.txt blocks AI crawlers you are invisible to ChatGPT, Claude and Perplexity — they skip you and recommend a competitor instead.

What we measured We read /robots.txt and test it against 16 AI user-agents (GPTBot, ClaudeBot, PerplexityBot, …) for a Disallow that blocks them.

How to fix Allow major AI bots to public content; restrict only private paths (/admin, /api).

Learn how to implement →

User-agent: GPTBot
Allow: /
Disallow: /admin/

Spec: https://platform.openai.com/docs/gptbot

No CAPTCHA wall high impact Detected: recaptcha

Business impact CAPTCHAs stop bots — including the AI agents your customers send to shop or book. Content behind a challenge is unreachable.

What we measured We fingerprint reCAPTCHA, hCaptcha and Cloudflare Turnstile in the page.

How to fix Reserve CAPTCHA for login/checkout flows, never public content pages.

Spec: https://developers.cloudflare.com/turnstile/

llms.txt present high impact Absent at /llms.txt and /.well-known/llms.txt

Business impact llms.txt is the robots.txt for AI: it tells agents what your site is, what matters, and where to find it. Without it AI guesses — and guessing means inaccurate recommendations and lost visibility.

What we measured We fetch /llms.txt and /.well-known/llms.txt and validate the spec (H1 title + a one-line blockquote summary). We also note /llms-full.txt (your full content as Markdown).

How to fix Create /llms.txt with a short summary + key pages; optionally /llms-full.txt with full content in Markdown.

Learn how to implement →

# Your Site
> One-line description for AI agents.

## Key pages
- /products — catalog
- /pricing — plans
- /docs — documentation

Spec: https://llmstxt.org

Structured data (JSON-LD) medium impact No JSON-LD found

Business impact Schema.org JSON-LD tells agents what a page IS (product, article, business) with typed fields (price, rating, hours). Without it agents extract less reliably.

What we measured We parse <script type=application/ld+json>, validate it, and check for populated @type fields.

How to fix Add JSON-LD: Organization/LocalBusiness on the homepage, Product on product pages, Article on posts.

Learn how to implement →

<script type="application/ld+json">{"@context":"https://schema.org","@type":"Organization","name":"Your Co","url":"https://example.com"}</script>

Spec: https://schema.org/

XML sitemap present medium impact No /sitemap.xml

Business impact A sitemap is your table of contents for AI crawlers. Without it agents follow homepage links and miss deep pages (products, docs, pricing) — shrinking what they can recommend.

What we measured We fetch /sitemap.xml (and /sitemap_index.xml), confirm valid XML with <loc> entries, and check <lastmod> freshness.

How to fix Generate an XML sitemap of all public pages with current lastmod dates and reference it in robots.txt.

Learn how to implement →

# robots.txt
Sitemap: https://example.com/sitemap.xml

Spec: https://www.sitemaps.org/

~ agents.json discovery low impact Absent (emerging standard)

Business impact agents.json describes what your site can DO for agents (services, endpoints, capabilities) — an emerging discovery standard. Early adopters get native agent integration.

What we measured We check /agents.json and /.well-known/agents.json for a valid configuration.

How to fix Publish /agents.json describing your site's capabilities and actions.

Learn how to implement →

Spec: https://agents-json.org

~ WebMCP endpoint low impact Absent (emerging standard)

Business impact WebMCP lets agents call actions on your site directly (book, buy, query) instead of scraping the DOM. Early adopters get native AI-agent interoperability.

What we measured We check /.well-known/webmcp and /webmcp.json for a valid actions array.

How to fix Add a WebMCP endpoint exposing your key actions to agents.

Learn how to implement →

Spec: https://webmcp.org

~ API documentation low impact No OpenAPI/Swagger found

Business impact Programmatic agents prefer a typed API. An OpenAPI/Swagger spec lets them integrate without scraping.

What we measured We probe /openapi.json, /swagger.json, /api-docs and /.well-known/openapi.json.

How to fix Publish an OpenAPI spec at a well-known path.

Learn how to implement →

Spec: https://www.openapis.org/

Passed audits (4)

✓ No content-blocking cookie wall✓ Machine-readable prices✓ No login wall on public content✓ Server response (TTFB)
Full profile — how to improve · unused JS · network · timing

How to improve

highCut third-party weighthighest leverage

guim.co.uk — 678 KiB, 78 requests, 20 ms main-thread

EvidenceThird-party is 67.2% of total transfer (78 requests); guim.co.uk is the largest.
ImpactFewer bytes + main-thread stalls before your content settles for an agent.
Effort1–3 h
FixLazy-load, self-host, or remove guim.co.uk; defer non-critical third-party scripts.
mediumSplit the heaviest unused bundle

Notice.97af9.js — 93 KiB transferred, 71.9% unused

EvidenceCDP JS coverage: 67 KiB of this bundle is never executed on load.
Impact~-67 KiB transfer; lower Token Cost + faster agent parse.
Effort30–90 min
FixDynamic-import() the below-the-fold parts of Notice.97af9.js, or drop unused dependencies.

Third-party impact

78 third-party requests · 678 KiB (67.2% of transfer) · 20 ms main-thread — code an agent must also fetch/run before your content settles. Fewer, lazier third-parties = faster, cheaper agent reads.

Third-party domainReqsTransferMain-thread
guim.co.uk78678 KiB20 ms

Wasted JavaScript (by bundle)

Transfer-accurate — each bundle's transfer size × its unused %, ranked by wasted bytes (the biggest code-splitting wins). Unused JS also inflates Token Cost (TC).

BundleTransferUnusedWasted
https://sourcepoint.theguardian.com/Notice.97af9.js93 KiB71.9%67 KiB
https://assets.guim.co.uk/assets/6768.client.web.0735efb2fd224c936cf9.js 3P47 KiB80.9%38 KiB
https://sourcepoint.theguardian.com/unified/wrapperMessagingWithoutDetection.js43 KiB53%23 KiB
https://assets.guim.co.uk/assets/index.client.web.f325ca7f544b3b289247.js 3P47 KiB39.6%19 KiB
https://assets.guim.co.uk/assets/3141.client.web.ac54800689b432f5534c.js 3P26 KiB69%18 KiB
https://sourcepoint.theguardian.com/unified/4.40.1/gdpr-tcf.27718c8cb9d29947d2c1.bundle.js29 KiB55.6%16 KiB
https://assets.guim.co.uk/commercial/f2437f3ee970ddd33dfc/graun.standalone.commercial.js 3P17 KiB65.9%11 KiB
https://assets.guim.co.uk/assets/1772.client.web.a186f24ac91544734c64.js 3P11 KiB79.5%9 KiB
https://assets.guim.co.uk/assets/7638.client.web.9d541c811d88f25af734.js 3P13 KiB58.4%8 KiB
https://assets.guim.co.uk/assets/7007.client.web.394554cec0bbd188fd41.js 3P7 KiB89.1%6 KiB
https://assets.guim.co.uk/assets/9883.client.web.8665e0b40aa5591bd877.js 3P7 KiB80.6%6 KiB
https://assets.guim.co.uk/assets/1263.client.web.8787aa23171ec4509ced.js 3P7 KiB82.3%5 KiB
https://assets.guim.co.uk/assets/ScrollableHighlights-island.client.web.b994022df633ec6f53c0.js 3P9 KiB51.2%5 KiB
https://assets.guim.co.uk/assets/EditionSwitcherBanner-island.client.web.e486b879549f8b3ac84b.js 3P5 KiB87.1%4 KiB
https://assets.guim.co.uk/assets/StickyBottomBanner-island.client.web.803408caa0dcc66d9dc5.js 3P6 KiB61.4%4 KiB

Network

102Requests
1010 KiBTransferred
47Scripts
67.2%3rd-party
1Long tasks
Script (47)
458 KiB
Font (8)
185 KiB
Image (20)
179 KiB
Document (2)
144 KiB
Other (7)
22 KiB
XHR (3)
11 KiB
Stylesheet (2)
7 KiB
Fetch (12)
2 KiB
Manifest (1)
1 KiB

Heaviest requests

URLTypeStatusTransfer
https://www.theguardian.com/internationalDocument200142 KiB
https://sourcepoint.theguardian.com/Notice.97af9.jsScript20093 KiB
https://assets.guim.co.uk/assets/index.client.web.f325ca7f544b3b289247.jsScript20047 KiB
https://assets.guim.co.uk/assets/6768.client.web.0735efb2fd224c936cf9.jsScript20047 KiB
https://sourcepoint.theguardian.com/unified/wrapperMessagingWithoutDetection.jsScript20043 KiB
https://interactive.guim.co.uk/fonts/guss-webfonts/GuardianTextSansWeb/GuardianTextSansWeb-Bold.woff2Font20039 KiB
https://interactive.guim.co.uk/fonts/guss-webfonts/GuardianTextSansWeb/GuardianTextSansWeb-Regular.woff2Font20035 KiB
https://sourcepoint.theguardian.com/unified/4.40.1/gdpr-tcf.27718c8cb9d29947d2c1.bundle.jsScript20029 KiB
https://i.guim.co.uk/img/media/c04d724d4dd46313ab6806976c950eee2799c072/1039_0_5843_4675/master/5843.jpg?width=620&dpr=1&s=none&crop=5%3A4Image20029 KiB
https://interactive.guim.co.uk/fonts/garnett/GTGuardianTitlepiece-Bold.woff2Font20026 KiB
https://assets.guim.co.uk/assets/3141.client.web.ac54800689b432f5534c.jsScript20026 KiB
https://i.guim.co.uk/img/media/81fa43c5d6ce2b397830713cf44e7d6282bf1d77/317_0_1350_1080/1350.jpg?width=620&dpr=1&s=none&crop=5%3A4Image20025 KiB
https://i.guim.co.uk/img/media/5539ea739fb958f1e154181a8b0dd3fbbfde36d7/163_0_1666_1333/master/1666.jpg?width=460&dpr=1&s=none&crop=5%3A4Image20023 KiB
https://assets.guim.co.uk/assets/frameworks.client.web.43c344cdcf55a381da73.jsOther20020 KiB
https://i.guim.co.uk/img/media/66962b99fe9905775a724f88778672fa5eaebdc4/185_0_4164_3333/master/4164.jpg?width=460&dpr=1&s=none&crop=5%3A4Image20019 KiB
https://assets.guim.co.uk/static/frontend/fonts/guardian-textsans/noalts-not-hinted/GuardianTextSans-Bold.woff2Font20018 KiB
https://assets.guim.co.uk/commercial/f2437f3ee970ddd33dfc/graun.standalone.commercial.jsScript20017 KiB
https://assets.guim.co.uk/static/frontend/fonts/guardian-headline/noalts-not-hinted/GHGuardianHeadline-Medium.woff2Font20017 KiB
https://assets.guim.co.uk/static/frontend/fonts/guardian-textegyptian/noalts-not-hinted/GuardianTextEgyptian-Regular.woff2Font20017 KiB
https://assets.guim.co.uk/static/frontend/fonts/guardian-headline/noalts-not-hinted/GHGuardianHeadline-Bold.woff2Font20017 KiB
https://assets.guim.co.uk/static/frontend/fonts/guardian-textsans/noalts-not-hinted/GuardianTextSans-Regular.woff2Font20016 KiB
https://i.guim.co.uk/img/media/5cb1e4aa1fc07a824e514c83ce5fc32c468bc225/274_0_2738_2190/master/2738.jpg?width=220&dpr=1&s=none&crop=5%3A4Image20014 KiB
https://assets.guim.co.uk/assets/7638.client.web.9d541c811d88f25af734.jsScript20013 KiB
https://assets.guim.co.uk/assets/1772.client.web.a186f24ac91544734c64.jsScript20011 KiB
https://i.guim.co.uk/img/media/2d1463bd0646bf674ebe3830667974781a4cc649/317_0_7202_5760/master/7202.jpg?width=220&dpr=1&s=none&crop=5%3A4Image20011 KiB
https://i.guim.co.uk/img/media/531c8a8c7c56b2b56438b7e866f8a1052de523e9/0_101_1596_1276/master/1596.jpg?width=220&dpr=1&s=none&crop=5%3A4Image20010 KiB
https://assets.guim.co.uk/assets/Titlepiece-island.client.web.24c6e15e165b074b7cc6.jsScript20010 KiB
https://sourcepoint.theguardian.com/wrapper/v2/messages?hasCsp=true&env=prod&body=%7B%22accountId%22%3A1257%2C%22campaignEnv%22%3A%22prod%22%2C%22campaigns%22%3A%7B%22gdpr%22%3A%7B%22consentStatus%22%3A%7B%7D%2C%22targetingParams%22%3A%7B%22framework%22%3A%22tcfv2%22%2C%22excludePage%22%3Afalse%2C%22isCorP%22%3Afalse%2C%22isUserSignedIn%22%3Afalse%2C%22corPCurrency%22%3A%22GBP%22%7D%7D%7D%2C%22clientMMSOrigin%22%3A%22https%3A%2F%2Fsourcepoint.theguardian.com%22%2C%22hasCSP%22%3Atrue%2C%22includeData%22%3A%7B%22localState%22%3A%7B%22type%22%3A%22string%22%7D%2C%22actions%22%3A%7B%22type%22%3A%22RecordString%22%7D%2C%22cookies%22%3A%7B%22type%22%3A%22RecordString%22%7D%7D%2C%22propertyHref%22%3A%22https%3A%2F%2Ftest.theguardian.com%2Finternational%22%2C%22propertyId%22%3A7417%7D&localState=null&metadata=%7B%22gdpr%22%3A%7B%22applies%22%3Atrue%7D%7D&nonKeyedLocalState=null&ch=514977235514977235fda1&scriptVersion=4.40.1&scriptType=unifiedXHR20010 KiB
https://assets.guim.co.uk/assets/ScrollableHighlights-island.client.web.b994022df633ec6f53c0.jsScript2009 KiB
https://i.guim.co.uk/img/media/6e7b37b3b665d03e4bbb7dc3214238b3619a6711/443_0_2218_1774/master/2218.jpg?width=220&dpr=1&s=none&crop=5%3A4Image2008 KiB

Long tasks (>50 ms)

StartDuration
119 ms75 ms
Analyzing…
running mobile + desktop · ~30s