31. Rendered Text Content
Metadata and Contextual Signals
title tags, meta descriptions, and schema
Semantic Structure
the meaning, sentiment and tone of content.
Content Hierarchy
headings, markup, structure, clarity.
Language Patterns (EEAT?)
expertise, clarity & comprehensiveness
Interpretation:
entities, relationships, and factual density
Vector embedding & representation
Full DOM crawling
Hidden SEO hacks & Black Hat
(e.g. cloaking, keyword stuffing)
Link equity/authority
in the traditional PageRank sense
Robots.txt restrictions
unless respected by the platform…
Doesn’t ingest
Does ingest
33. Jarno van Driel’s view…
"In the age of LLMs the role of markup
remains the same; It's still about enabling,
well-documented features in a cost-effective
way. Nothing more, and nothing less."
35. Google’s view on 21st May…
Make sure structured data matches the visible content
Structured data is useful for sharing information about your
content in a machine-readable way that our systems consider
and makes pages eligible for
certain search features and rich results.
39. Schema adds structured meaning;
Markdown adds structured format
-
they work better together
40. What exactly is llms.txt?
● Guide/Overview of a site for LLMs
● Improves AI understanding & retrieval
● Complements structured data (not
replaces it)
● Can futureproof visibility beyond search
engines
● llms-full.txt extends directives and content
● Live example: yoast.com/llms.txt
# Title
> Optional description goes here
Optional details go here
## Section name
- [Link title](https://link_url):
Optional link details
## Optional
- [Link title](https://link_url)
41. schema.org llms.txt
Audience Search Engines LLMs / AI Assistants
Purpose Defines what content means Defines how content is structured
Strengths
Semantic depth, entities and
relationship rich results
Lightweight, human-readable, AI-
ingestible
Limits
Needs dev work,
can’t guide LLMs alone
Lacks semantic meaning,
can’t power rich results
45. Easy to produce
Easy to upload
Isn’t time consuming or resource
intensive
Costs nothing
Doesn’t need [much] maintenance
Several early adopters
Offers competitive advantage
Supports standardisation
Still early days
Lack of mass adoption
Subjective opinion
Can be abused/manipulated
Some SEOs say it’s BS
LLMs won’t need it
Arguments
against
Arguments for
50. What is the MCP?
“MCP is an open protocol that standardizes
how applications provide context to LLMs.
Think of MCP like a USB-C port for AI
applications… provid[ing] a standardized way
to connect AI models to different data sources
and tools.”
55. What is NLWeb?
"NLWeb leverages semi-structured formats like
Schema.org, RSS and other data that websites
already publish, combining them with LLM-powered
tools to create natural language interfaces usable
by both humans and AI agents."
61. Things to do when you’re back at work
● Audit your current schema: is it valid, useful, and up to date?
Don’t neglect it.
● Add and maintain an llms.txt file
● Optimise with entities, relationships and vector embeddings in
mind.
● Ensure content has clear contextual and semantic structure
● Explore MCP and how you could utilise it.
● Make some of this easier with Yoast SEO!
#6:Marked the introduction of semantic search in SERPs
https://developers.google.com/search/blog/2012/10/rich-snippets-guidelines
https://searchengineland.com/google-search-now-supports-microformats-and-adds-rich-snippets-to-search-results-19055
#12:https://developers.google.com/search/docs/appearance/structured-data/search-gallery
Brought order and interoperability to the web’s meaning layer
#13:Structured data seeps to a lot of Google assets - it’s everywhere.
Now they’re getting into emails with contextual ads
#18:Requies SEOs to consider how our content can contribute to these ongoing dialogues
rather than just answering isolated queries
#20:Structured data supports discoverability in voice, video, documents, and beyond
You’re not optimising for algorithms anymore - you’re optimising for understanding
https://ipullrank.com/vector-embeddings-is-all-you-need
#21:Structured data supports discoverability in voice, video, documents, and beyond
You’re not optimising for algorithms anymore - you’re optimising for understanding
https://ipullrank.com/vector-embeddings-is-all-you-need
#22:https://blog.google/products/search/ai-mode-multimodal-search/
However, AI Overviews and zero-click features are reducing visibility
#30:Don’t assume they’ll interpret everything unassisted
Brands (and individuals) may develop their own “LLM memory banks”
Structured data may be how your content lives inside someone else’s LLM
Connecting with the consumer via these agents
Agents right now are lazy so any saving or efficiency boost will be favoured
#34:https://invisiblegraph.com/semantic-stories/does-structured-data-markup-influence-eeat/
https://x.com/JarnoVanDriel/status/1857038935290970608/photo/1
SEOs must feed the models — don’t assume they’ll interpret everything unassisted
#35:Top ways to ensure your content performs well in Google's AI experiences on Search
#37:Don’t assume they’ll interpret everything unassisted
Brands (and individuals) may develop their own “LLM memory banks”
Structured data may be how your content lives inside someone else’s LLM
#38:Don’t assume they’ll interpret everything unassisted
Brands (and individuals) may develop their own “LLM memory banks”
Structured data may be how your content lives inside someone else’s LLM
#40:Some may say “you’re showing a thief around your house”.
No. This is more like an estate agent info pack.
#46:Ignore it, fine. But what if you shouldn’t?
It’s like buying a book without the synopsis in the back, knowing that’s where one would naturally look.
That way you have it
#49:Traditional methods, requiring individualized integrations and more manual oversight
https://norahsakal.com/blog/mcp-vs-api-model-context-protocol-explained/
#51:Unified interface for AI agents to dynamically interact with external data/tools
https://www.linkedin.com/posts/whitebeth_modelcontextprotocol-ai-aiagents-activity-7329132096680009728-iUTI/
https://norahsakal.com/blog/mcp-vs-api-model-context-protocol-explained/
#52:Unified interface for AI agents to dynamically interact with external data/tools
https://modelcontextprotocol.io/introduction
#53:2012 - same as Knowledge graph
November 25, 2024
(12 year gap, this is 8 weeks)
https://www.webdesignmuseum.org/gallery/google-2000
#54:Open project conceived and developed by R.V. Guha, creator of widely used web standards such as RSS, RDF and Schema.org
#56:Every NLWeb instance is also an 𝐌𝐂𝐏 𝐬𝐞𝐫𝐯𝐞𝐫, which allows websites to make their content discoverable and accessible to agents via one core method, 𝘢𝘴𝘬, which is used to query a website with a question in natural language.
#57:2012-today (13 years)
This is just under 6 months
May 19th 2025,
#59:The year ahead
https://commons.wikimedia.org/wiki/File:Tesseract-1K.gif
#60:Ignore it, fine. But what if you shouldn’t?
That way you have it
#61:You’re not optimising for algorithms — you’re optimising for understanding
Optimise content for discoverability, adding LLM parsing into consideration