www.expedia.com

Report from 7/2/2026, 2:18:28 PM https://www.expedia.com
Agent blocked: bot-block (raw HTTP 429, only 42 tokens recovered) — scores reflect the block page, not your content.
71
Overall score
weighted CAV (0–100)
BLOCKED
0–4950–8990–100

Metrics

72%
CRR Content Recovery Poor
0.50
SSD Semantic Signal Density Needs work
42 tok
TC Token Cost Good
133 ms
TTFUT Time to First Useful Token N/A

Final screenshot

Final screenshot of https://www.expedia.com

Diagnostics

high CRR Agents are blocked before they see the page

A non-browser agent got a block/challenge (bot-block (raw HTTP 429, only 42 tokens recovered)). Every score below is measured against that wall, not your content.

Fix: Allowlist legitimate agent user-agents / IP ranges in your WAF or bot-management rules, and serve real content (not a challenge) to them.

high CRR Content is hidden behind JavaScript

28% of content requires JS · 72% of rendered content recovered (rest is placeholder/wrong)

Fix: Server-render or statically generate the main content so a non-JS agent still receives it; make client rendering a progressive enhancement, not the source of truth.

medium SSD Low signal-to-noise for agents

signal 1.00 · JSON-LD 0/1 · missing: structured-data

Fix: Wrap the real content in <main>/<article>, cut repeated nav/boilerplate, and keep the primary content dense and early in the DOM.

Rendered profile: headless

Agent Discoverability 57/100 · Needs Work

Access & discovery checks — separate from the gated CAV metrics above. Click an issue for business impact, what we measured, and how to fix. · Take the Agent Readiness course →

Agent files & endpoints

llms.txt Found at /llms.txt Learn →
robots.txt (AI bots) Blocks: * (all) Learn →
sitemap.xml No /sitemap.xml Learn →
JSON-LD structured data No JSON-LD found Learn →
~ agents.json Absent (emerging standard) Learn →
~ WebMCP endpoint Absent (emerging standard) Learn →
~ OpenAPI / API docs No OpenAPI/Swagger found Learn →

Issues (7)

robots.txt allows AI bots high impact Blocks: * (all)

Business impact If robots.txt blocks AI crawlers you are invisible to ChatGPT, Claude and Perplexity — they skip you and recommend a competitor instead.

What we measured We read /robots.txt and test it against 16 AI user-agents (GPTBot, ClaudeBot, PerplexityBot, …) for a Disallow that blocks them.

How to fix Allow major AI bots to public content; restrict only private paths (/admin, /api).

Learn how to implement →

User-agent: GPTBot
Allow: /
Disallow: /admin/

Spec: https://platform.openai.com/docs/gptbot

No CAPTCHA wall high impact Detected: recaptcha

Business impact CAPTCHAs stop bots — including the AI agents your customers send to shop or book. Content behind a challenge is unreachable.

What we measured We fingerprint reCAPTCHA, hCaptcha and Cloudflare Turnstile in the page.

How to fix Reserve CAPTCHA for login/checkout flows, never public content pages.

Spec: https://developers.cloudflare.com/turnstile/

Structured data (JSON-LD) medium impact No JSON-LD found

Business impact Schema.org JSON-LD tells agents what a page IS (product, article, business) with typed fields (price, rating, hours). Without it agents extract less reliably.

What we measured We parse <script type=application/ld+json>, validate it, and check for populated @type fields.

How to fix Add JSON-LD: Organization/LocalBusiness on the homepage, Product on product pages, Article on posts.

Learn how to implement →

<script type="application/ld+json">{"@context":"https://schema.org","@type":"Organization","name":"Your Co","url":"https://example.com"}</script>

Spec: https://schema.org/

XML sitemap present medium impact No /sitemap.xml

Business impact A sitemap is your table of contents for AI crawlers. Without it agents follow homepage links and miss deep pages (products, docs, pricing) — shrinking what they can recommend.

What we measured We fetch /sitemap.xml (and /sitemap_index.xml), confirm valid XML with <loc> entries, and check <lastmod> freshness.

How to fix Generate an XML sitemap of all public pages with current lastmod dates and reference it in robots.txt.

Learn how to implement →

# robots.txt
Sitemap: https://example.com/sitemap.xml

Spec: https://www.sitemaps.org/

~ agents.json discovery low impact Absent (emerging standard)

Business impact agents.json describes what your site can DO for agents (services, endpoints, capabilities) — an emerging discovery standard. Early adopters get native agent integration.

What we measured We check /agents.json and /.well-known/agents.json for a valid configuration.

How to fix Publish /agents.json describing your site's capabilities and actions.

Learn how to implement →

Spec: https://agents-json.org

~ WebMCP endpoint low impact Absent (emerging standard)

Business impact WebMCP lets agents call actions on your site directly (book, buy, query) instead of scraping the DOM. Early adopters get native AI-agent interoperability.

What we measured We check /.well-known/webmcp and /webmcp.json for a valid actions array.

How to fix Add a WebMCP endpoint exposing your key actions to agents.

Learn how to implement →

Spec: https://webmcp.org

~ API documentation low impact No OpenAPI/Swagger found

Business impact Programmatic agents prefer a typed API. An OpenAPI/Swagger spec lets them integrate without scraping.

What we measured We probe /openapi.json, /swagger.json, /api-docs and /.well-known/openapi.json.

How to fix Publish an OpenAPI spec at a well-known path.

Learn how to implement →

Spec: https://www.openapis.org/

Passed audits (5)

✓ No content-blocking cookie wall✓ Machine-readable prices✓ llms.txt present + valid✓ No login wall on public content✓ Server response (TTFB)
Full profile — how to improve · unused JS · network · timing

A deeper scan (a second render, ~30–60s): network waterfall, unused JavaScript, long tasks, and prioritized fixes. Runs only when you ask; the result is cached so it never re-runs.

Analyzing…
running mobile + desktop · ~30s