Schema Markup for AI: Structured Data That Helps LLMs Understand You
65% of pages cited by Google AI Mode include structured data. For ChatGPT, that number is 71%. Schema markup has gone from an SEO nice-to-have to a prerequisite for AI visibility.
The reason is straightforward. When an LLM retrieves your page, it processes raw text. Schema markup adds a layer of explicit, machine-readable context that tells the model exactly what your content represents: who wrote it, when it was published, what entity it describes, how products are rated. Without that context, the model has to guess. Models that guess cite less.
This guide covers the schema types that matter for LLM visibility, how to implement them with JSON-LD, and how to prioritize which pages to mark up first.
Why schema markup matters more for AI than it did for traditional search
In traditional SEO, schema markup earned you rich snippets. Stars on a review, a FAQ accordion, a recipe card. Useful, but optional. Pages ranked fine without it.
AI search works differently. When ChatGPT, Perplexity, or Google AI Overviews generate a response, they pull from multiple sources and synthesize an answer. The retrieval step uses signals similar to traditional search. But the synthesis step is where schema makes the difference.
Schema gives AI systems three things plain HTML doesn't:
- Entity clarity. Schema tells the model that "Acme Corp" is an Organization, not just a string of text. It defines relationships between entities (this Person is the author of this Article, this Product has this AggregateRating).
- Fact extraction. Structured fields like
datePublished,price,ratingValue, andaddressare unambiguous. The model doesn't need to parse a sentence to find the publication date. It reads the field directly. - Confidence signals. When a model can verify facts through structured data rather than inferring them from prose, it cites with more confidence. A controlled experiment by Aiso found a 30% improvement in accuracy, completeness, and presentation quality when ChatGPT processed pages with schema versus identical pages without it.
Schema-compliant pages get cited 3.1x more frequently in Google AI Overviews than pages without structured data. That gap will widen as AI systems handle a larger share of search queries.
The schema types that actually help LLMs
Not all 800+ schema.org types matter for AI visibility. Five types carry most of the weight.
Article and BlogPosting
Article schema tells AI systems the basics: who wrote this, when it was published, when it was last updated, and what topic it covers. This is the minimum you should have on every content page.
{
"@context": "https://schema.org",
"@type": "Article",
"headline": "Schema Markup for AI: Structured Data That Helps LLMs Understand You",
"author": {
"@type": "Organization",
"name": "Serps.io",
"url": "https://serps.io"
},
"publisher": {
"@type": "Organization",
"name": "Serps.io"
},
"datePublished": "2026-03-25",
"dateModified": "2026-03-25",
"description": "How to implement schema markup that helps LLMs find and cite your content."
}
The dateModified field is particularly important. AI systems have a strong recency bias, with 95% of ChatGPT citations coming from content updated within 10 months. A current dateModified value signals that your content is maintained and trustworthy.
Organization
Organization schema anchors your brand identity. It tells LLMs who you are, where you're located, and how to describe you. Without it, AI systems have to piece together your identity from scattered mentions across the web.
{
"@context": "https://schema.org",
"@type": "Organization",
"name": "Serps.io",
"url": "https://serps.io",
"logo": "https://serps.io/logo.png",
"description": "AI search analytics and SERP tracking platform.",
"sameAs": [
"https://twitter.com/serpsio",
"https://linkedin.com/company/serpsio"
]
}
The sameAs property is where you connect your brand across platforms. LLMs use these links to build a more complete picture of your entity, which directly affects how they describe you in responses. If you're working on brand mentions for AI visibility, Organization schema is the foundation.
FAQPage
FAQ schema maps directly to how users query AI systems. When someone asks ChatGPT a question and your FAQ schema contains that exact question with a clear answer, the model has a pre-packaged, citable response.
{
"@context": "https://schema.org",
"@type": "FAQPage",
"mainEntity": [
{
"@type": "Question",
"name": "Does schema markup help with AI search?",
"acceptedAnswer": {
"@type": "Answer",
"text": "Yes. Pages with structured data are cited 3.1x more frequently in AI Overviews. Schema markup provides explicit, machine-readable context that helps LLMs extract facts and cite sources with higher confidence."
}
},
{
"@type": "Question",
"name": "What schema types matter most for LLMs?",
"acceptedAnswer": {
"@type": "Answer",
"text": "Article, Organization, FAQPage, Person, and Product schema have the most impact on LLM visibility. Article and Organization are foundational. FAQPage directly maps to conversational AI queries."
}
}
]
}
Pages with FAQ or HowTo schema are 78% more likely to be cited by AI systems. Keep each answer to 40-60 words, self-contained, and factually complete. For more on why self-contained answers matter, see our guide on structuring content for AI citations.
Person
Person schema establishes author identity and expertise, both critical E-E-A-T signals that AI systems use to decide what to cite. Attach Person schema to every author page and reference it from your Article schema.
{
"@context": "https://schema.org",
"@type": "Person",
"name": "Jane Smith",
"jobTitle": "Head of SEO",
"worksFor": {
"@type": "Organization",
"name": "Serps.io"
},
"url": "https://serps.io/team/jane-smith",
"sameAs": [
"https://linkedin.com/in/janesmith",
"https://twitter.com/janesmith"
]
}
Connecting Person schema to Article schema via the author property creates an explicit author-content relationship that LLMs can follow. This is stronger than just putting an author name in a byline.
Product and AggregateRating
For product pages, Product schema with AggregateRating is what gets you into AI-generated shopping recommendations. Products with comprehensive schema markup appear 3-5x more frequently in AI-generated product comparisons.
{
"@context": "https://schema.org",
"@type": "Product",
"name": "Serps.io Pro Plan",
"description": "AI search analytics with daily SERP tracking and citation monitoring.",
"offers": {
"@type": "Offer",
"price": "99",
"priceCurrency": "USD"
},
"aggregateRating": {
"@type": "AggregateRating",
"ratingValue": "4.8",
"reviewCount": "142"
}
}
The Aiso experiment confirmed this: user ratings and compliance certifications only appeared in ChatGPT responses when schema markup was present. Without it, ChatGPT ignored those data points entirely, even though they were visible on the page.
How to implement schema markup with JSON-LD
JSON-LD is the format all major AI systems read. Google, Bing, Perplexity, and ChatGPT all process JSON-LD. Over 53% of all websites already use it. If you're using a different format (Microdata, RDFa), switch to JSON-LD.
Where to place it
Add JSON-LD in a <script type="application/ld+json"> tag in the <head> of each page. In Next.js, you can add it directly in your page component or layout:
export default function ArticlePage({ article }) {
const jsonLd = {
"@context": "https://schema.org",
"@type": "Article",
"headline": article.title,
"author": {
"@type": "Organization",
"name": "Serps.io",
"url": "https://serps.io",
},
"datePublished": article.publishedAt,
"dateModified": article.updatedAt,
"description": article.excerpt,
}
return (
<>
<script
type="application/ld+json"
dangerouslySetInnerHTML={{ __html: JSON.stringify(jsonLd) }}
/>
{/* page content */}
</>
)
}
One type per page, or multiple?
You can stack multiple schema types on a single page. An article page might include Article, Organization, and FAQPage schema. A product page might include Product, AggregateRating, and Organization.
The key is that every schema type you add should reflect content that's actually on the page. Adding FAQPage schema to a page with no visible FAQ section violates Google's structured data guidelines and can result in manual actions.
Validation
Test every page with Google's Rich Results Test and the Schema.org Validator. Fix errors before deploying. Common mistakes that break schema for AI systems:
- Missing required fields (Article without
headline, Product withoutname) - Mismatched content (schema says one price, page says another)
- Invalid date formats (use ISO 8601:
2026-03-25) - Nesting errors (Organization inside Article should use
@type, not a plain string)
Which pages to mark up first
You can't mark up every page at once. Here's how to prioritize based on AI citation potential.
Tier 1: pages already ranking in the top 10
76% of AI Overview sources come from the organic top 10. If a page already ranks well in traditional search, adding schema to it has the highest probability of generating AI citations. Start here.
For these pages, add Article (or Product) schema, Organization schema, and FAQ schema if the page answers specific questions.
Tier 2: high-value content pages
Blog posts, guides, and resource pages that demonstrate topical authority. These pages may not rank #1 for competitive terms, but they contain the kind of factual, well-sourced content that LLMs prefer to cite.
Add Article schema with complete author information and FAQ schema for any question-answer content.
Tier 3: brand and product pages
Your homepage, about page, pricing page, and key product pages. Organization schema here ensures LLMs have accurate brand information. Product schema on pricing pages gets you into AI-generated comparisons.
Tier 4: everything else
Category pages, tag pages, archive pages. These are lower priority because they typically contain thin content that LLMs are less likely to cite. Add basic WebPage or CollectionPage schema to complete your site's structured data coverage.
Measuring whether schema helps your AI visibility
Adding schema without measuring the impact is guessing. Here's what to track.
Before adding schema, document your baseline:
- Which pages appear in AI Overviews (check Google Search Console's search appearance filters)
- Citation frequency in ChatGPT and Perplexity for your target queries
- Rich result appearances in traditional search
After adding schema, check the same metrics at 30, 60, and 90 days. Schema changes don't take effect instantly. Google needs to recrawl and reprocess your pages. LLMs need to re-index your content.
Track at the page level, not the site level. A site-wide average will dilute the signal. You want to see whether specific pages that received schema markup are getting cited more than they were before.
If you're tracking AI search visibility at scale, tools like Serps.io monitor AI Overview appearances and citation patterns across your keyword set, so you can tie schema changes directly to visibility changes.
Common mistakes that waste your time
Marking up content that isn't on the page. If your FAQ schema contains questions and answers that don't appear in the visible page content, Google can penalize you and AI systems won't trust the markup. Schema must reflect what users actually see.
Ignoring dateModified. A page with Article schema but a stale dateModified value signals to AI systems that the content is outdated. Update this field every time you revise the content, and actually revise the content when you update the date.
Using schema generators without reviewing the output. Automated tools often produce valid but incomplete markup. They'll generate an Article with a headline and author but skip dateModified, description, and publisher. Review and complete every schema block manually.
Adding schema but neglecting content quality. Schema markup makes your content easier for AI systems to read. It doesn't make bad content worth citing. If the content itself doesn't follow AI citation best practices (answer-first structure, inline citations, factual density), schema alone won't save it.
Where this is heading
Microsoft confirmed in March 2025 that its LLMs use structured data to interpret web content. Google's AI Overviews pull disproportionately from pages with schema. ChatGPT's web browsing capabilities increasingly rely on structured signals to verify and cite information.
The pattern is clear: as AI systems handle more search queries, structured data becomes the primary way to communicate with them. The pages that speak the language AI systems understand (explicit entities, clean relationships, verified facts in machine-readable format) are the pages that get cited.
Start with Article and Organization schema on your highest-ranking pages. Add FAQ schema where you answer specific questions. Validate everything. Measure the results. For the broader strategy behind getting cited by AI systems, see our guides on answer engine optimization, generative engine optimization, and the differences between SEO, GEO, and AEO.