www.yelp.com

Report from 7/2/2026, 2:15:30 PM https://www.yelp.com
Agent blocked: bot-block (raw HTTP 403, only 0 tokens recovered) — scores reflect the block page, not your content.
21
Overall score
weighted CAV (0–100)
BLOCKED
0–4950–8990–100

Metrics

0%
CRR Content Recovery Poor
0.00
SSD Semantic Signal Density Poor
0 tok
TC Token Cost Good
TTFUT Time to First Useful Token N/A

Final screenshot

Final screenshot of https://www.yelp.com

Diagnostics

high CRR Agents are blocked before they see the page

A non-browser agent got a block/challenge (bot-block (raw HTTP 403, only 0 tokens recovered)). Every score below is measured against that wall, not your content.

Fix: Allowlist legitimate agent user-agents / IP ranges in your WAF or bot-management rules, and serve real content (not a challenge) to them.

high CRR Content is hidden behind JavaScript

rendered page extracted no content (error state)

Fix: Server-render or statically generate the main content so a non-JS agent still receives it; make client rendering a progressive enhancement, not the source of truth.

high SSD Low signal-to-noise for agents

signal 0.00 · JSON-LD 0/1 · missing: structured-data

Fix: Wrap the real content in <main>/<article>, cut repeated nav/boilerplate, and keep the primary content dense and early in the DOM.

Rendered profile: headless

Agent Discoverability 27/100 · Agent Blocked

Access & discovery checks — separate from the gated CAV metrics above. Click an issue for business impact, what we measured, and how to fix. · Take the Agent Readiness course →

Agent files & endpoints

llms.txt Absent at /llms.txt and /.well-known/llms.txt Learn →
robots.txt (AI bots) Blocks: * (all) Learn →
sitemap.xml No /sitemap.xml Learn →
JSON-LD structured data No JSON-LD found Learn →
~ agents.json Absent (emerging standard) Learn →
~ WebMCP endpoint Absent (emerging standard) Learn →
~ OpenAPI / API docs No OpenAPI/Swagger found Learn →

Issues (10)

robots.txt allows AI bots high impact Blocks: * (all)

Business impact If robots.txt blocks AI crawlers you are invisible to ChatGPT, Claude and Perplexity — they skip you and recommend a competitor instead.

What we measured We read /robots.txt and test it against 16 AI user-agents (GPTBot, ClaudeBot, PerplexityBot, …) for a Disallow that blocks them.

How to fix Allow major AI bots to public content; restrict only private paths (/admin, /api).

Learn how to implement →

User-agent: GPTBot
Allow: /
Disallow: /admin/

Spec: https://platform.openai.com/docs/gptbot

No CAPTCHA wall high impact Detected: captcha-delivery

Business impact CAPTCHAs stop bots — including the AI agents your customers send to shop or book. Content behind a challenge is unreachable.

What we measured We fingerprint reCAPTCHA, hCaptcha and Cloudflare Turnstile in the page.

How to fix Reserve CAPTCHA for login/checkout flows, never public content pages.

Spec: https://developers.cloudflare.com/turnstile/

~ Machine-readable prices medium impact No prices found (fine for non-commercial sites)

Business impact When a user asks an agent 'how much does X cost', it needs readable pricing. Hidden, JS-only or 'request a quote' pricing means the agent can't answer — and recommends someone who can.

What we measured We scan the HTML for price patterns and Schema.org Offer/PriceSpecification markup.

How to fix Expose prices (or a starting-from range) in HTML and add Schema.org Offer.

<script type="application/ld+json">{"@type":"Offer","price":"29.90","priceCurrency":"EUR"}</script>

Spec: https://schema.org/Offer

llms.txt present high impact Absent at /llms.txt and /.well-known/llms.txt

Business impact llms.txt is the robots.txt for AI: it tells agents what your site is, what matters, and where to find it. Without it AI guesses — and guessing means inaccurate recommendations and lost visibility.

What we measured We fetch /llms.txt and /.well-known/llms.txt and validate the spec (H1 title + a one-line blockquote summary). We also note /llms-full.txt (your full content as Markdown).

How to fix Create /llms.txt with a short summary + key pages; optionally /llms-full.txt with full content in Markdown.

Learn how to implement →

# Your Site
> One-line description for AI agents.

## Key pages
- /products — catalog
- /pricing — plans
- /docs — documentation

Spec: https://llmstxt.org

No login wall on public content high impact HTTP 403

Business impact Public content behind authentication is unreachable to agents (they have no account). Gate only genuinely private material.

What we measured We follow redirects and detect 401/403 and login-form signals on content pages.

How to fix Separate public marketing/docs from authenticated areas; keep public pages reachable without login.

Spec: https://llmstxt.org

Structured data (JSON-LD) medium impact No JSON-LD found

Business impact Schema.org JSON-LD tells agents what a page IS (product, article, business) with typed fields (price, rating, hours). Without it agents extract less reliably.

What we measured We parse <script type=application/ld+json>, validate it, and check for populated @type fields.

How to fix Add JSON-LD: Organization/LocalBusiness on the homepage, Product on product pages, Article on posts.

Learn how to implement →

<script type="application/ld+json">{"@context":"https://schema.org","@type":"Organization","name":"Your Co","url":"https://example.com"}</script>

Spec: https://schema.org/

XML sitemap present medium impact No /sitemap.xml

Business impact A sitemap is your table of contents for AI crawlers. Without it agents follow homepage links and miss deep pages (products, docs, pricing) — shrinking what they can recommend.

What we measured We fetch /sitemap.xml (and /sitemap_index.xml), confirm valid XML with <loc> entries, and check <lastmod> freshness.

How to fix Generate an XML sitemap of all public pages with current lastmod dates and reference it in robots.txt.

Learn how to implement →

# robots.txt
Sitemap: https://example.com/sitemap.xml

Spec: https://www.sitemaps.org/

~ agents.json discovery low impact Absent (emerging standard)

Business impact agents.json describes what your site can DO for agents (services, endpoints, capabilities) — an emerging discovery standard. Early adopters get native agent integration.

What we measured We check /agents.json and /.well-known/agents.json for a valid configuration.

How to fix Publish /agents.json describing your site's capabilities and actions.

Learn how to implement →

Spec: https://agents-json.org

~ WebMCP endpoint low impact Absent (emerging standard)

Business impact WebMCP lets agents call actions on your site directly (book, buy, query) instead of scraping the DOM. Early adopters get native AI-agent interoperability.

What we measured We check /.well-known/webmcp and /webmcp.json for a valid actions array.

How to fix Add a WebMCP endpoint exposing your key actions to agents.

Learn how to implement →

Spec: https://webmcp.org

~ API documentation low impact No OpenAPI/Swagger found

Business impact Programmatic agents prefer a typed API. An OpenAPI/Swagger spec lets them integrate without scraping.

What we measured We probe /openapi.json, /swagger.json, /api-docs and /.well-known/openapi.json.

How to fix Publish an OpenAPI spec at a well-known path.

Learn how to implement →

Spec: https://www.openapis.org/

Passed audits (2)

✓ No content-blocking cookie wall✓ Server response (TTFB)
Full profile — how to improve · unused JS · network · timing

How to improve

goodWell optimized

No major issues found in this profile — content is recoverable, JS is lean, and the page is responsive for both humans and agents.

JavaScript coverage

8 KiB unused of 14 KiB (54% unused JS) — unused JS inflates Token Cost (TC) and slows the agent's parse.

ScriptSizeUnused
https://ct.captcha-delivery.com/c.js14 KiB8 KiB (56%)
https://www.yelp.com/0 KiB0 KiB (0%)

Network

4Requests
31 KiBTransferred
1Scripts
0%3rd-party
0Long tasks
Other (1)
15 KiB
Script (1)
14 KiB
Document (2)
1 KiB

Heaviest requests

URLTypeStatusTransfer
https://www.yelp.com/favicon.icoOther20015 KiB
https://ct.captcha-delivery.com/c.jsScript20014 KiB
https://www.yelp.com/Document4031 KiB
https://geo.captcha-delivery.com/captcha/?initialCid=AHrlqAAAAAMApgM7TUR--EgArNomug%3D%3D&hash=3BD2468BAE4D73BEA0B5DE8314D745&cid=PJV8W74_kOXxvqiNUEYX9WCLkD~~dTjG4jzvoutAGws1_7laNQZXvurDVQthHCORBUdRdheAy72nXIsDxw9xyEIctJdk3mzk1SC9yASXbkj_AlrBMc6pHb7K6bifEv_4&t=fe&referer=https%3A%2F%2Fwww.yelp.com%2F&s=52679&e=8c363957488c266ccecf4a260780819f5d9043fa47174c1d6e8e4274684b4795cdb2c339418051e3d68fe94ab12d4d72&dm=cdDocument2000 KiB
Analyzing…
running mobile + desktop · ~30s