Content Checks
C1 Markdown content negotiation, C2 llms.txt, C3 structured HTML.
Agents don't render HTML the way a browser does. They strip it down, look for headings and meta, and try to extract clean prose. Help them by exposing content in formats they don't have to fight.
C1 — Markdown content negotiation (weight 8)
AIScan re-requests your homepage with Accept: text/markdown. If the server replies with atext/markdown content-type and the body looks like markdown (headings, lists, links), the check passes.
How to fix
Two approaches:
- Content negotiation — at the edge or in your framework, when
Acceptincludestext/markdown, render the same page as Markdown instead of HTML. - Sidecar URLs — also serve
/path/index.mdalongside/path. Easier for static sites.
C2 — /llms.txt (weight 6)
Per llmstxt.org, /llms.txt is a markdown file that gives agents a curated index of your site. AIScan checks that the file exists, returns 200, and contains an H1, at least one ## section, and markdown links.
How to fix
Drop a file at the root of your site that looks like:
# Acme Inc.
> A short summary of what Acme does — one or two sentences.
## Docs
- [Getting started](https://acme.com/docs/getting-started.md)
- [API reference](https://acme.com/docs/api.md)
## Examples
- [Example app](https://acme.com/examples/app.md)C3 — Structured HTML (weight 4)
Four basic signals on your homepage: a single <h1>, a non-empty <title>, a <meta name="description">, and at least one application/ld+jsonblock. One point each.
How to fix
For JSON-LD, start with an Organization or WebSite schema sitewide, then add page-type-specific schemas (Article, Product, FAQPage) on the pages that match. Validators: schema.org validator.