Programmatic SEO (pSEO) has always been misunderstood. At its core, it was never about "automating content." It was about systematically generating pages from structured data at scale. What is changing now is not the concept itself, but the environment it operates in. Search systems are evolving. Retrieval systems are evolving. And with that, the criteria for what actually gets surfaced, reused, and trusted is evolving too.

The Original Promise of pSEO

Programmatic SEO emerged as a way to scale content production using templates and datasets. It allowed publishers to capture long-tail queries efficiently and build coverage across large combinations of variables, such as locations, products, attributes, and comparisons.

In a traditional search environment, this worked well when queries were keyword-driven and matching was heavily lexical. Repetition and exact phrasing played a larger role in determining relevance. In that context, pSEO functioned as a force multiplier.

Check out the SEO Founders’ User Guide, a resource drawn from the collective experience of the SEO sector. Cut through the chaff and discover your green pastures.

Where pSEO Started to Break Down

As systems like Google improved their understanding of intent and meaning, the limitations of scaled content became more visible. The issue was never scale itself. It was this: scale without incremental value produces noise, not signal.

Data proves this point. A recent 16-month study tracking 2,000 fully AI-generated articles across 20 new domains found that while 71% were quickly indexed and gained early visibility, only 3% of those pages maintained top 100 rankings after three months. Many programmatic pages repeated the same underlying information with minimal variation. They failed to introduce new insight per page and relied on template swapping rather than differentiated substance. As a result, systems became better at detecting redundancy, thin variations, and a lack of informational contribution.

Scale without incremental value produces noise, not signal. Modern systems are far better at distinguishing between the two.

The Shift We Are Now In

We are now moving beyond traditional search into systems that do not just rank pages, but interpret, synthesise, and reuse information directly. This introduces a fundamental shift in how pSEO should be understood. Visibility will shift from ranking #1 to being included in generated answers with traceable influence.

This is not theoretical. Recent network traffic analysis of Google's AI Overviews reveals it uses a completely separate retrieval system from organic search, extracting individual passages and scoring them for relevance in under 200 milliseconds. This means a page ranking #15 organically can earn a citation while the #1 result is ignored. Furthermore, forecasts predict that daily usage of generative AI within search will be 300% more common in 2026.

In this environment, the goal is no longer just to create pages that can rank. It is to create information that can be retrieved, interpreted, and reused within generated outputs.

From Semantic Relevance to Outcome Effectiveness

Historically, optimisation focused on semantic relevance: keyword matching, topical alignment, and on-page signals. That layer still exists, but it is no longer sufficient on its own. Systems are increasingly optimising for outcome effectiveness.

In other words: Does this information help resolve the user's task? Does it contribute to a useful, coherent answer? Does it improve the quality of the final output? This changes how programmatic content is evaluated. It is no longer enough for pages to be "relevant." They must be useful in context of an outcome.

Structure Over Markup, Signals Over Tags

In traditional SEO, schema markup was often treated as a tactical lever. In practise, its value was always indirect. Schema fades as a tactic; structure remains as a requirement. What matters is not the presence of markup alone, but the underlying clarity of entities, consistency of relationships, and organisation of information.

Because modern AI systems use passage-level extraction to build answers, content structured with distinct, self-contained answer blocks inherently has a higher citation surface area. Systems are increasingly capable of inferring structure from content itself. Which means poorly structured content cannot be rescued by markup alone, and well-structured content often does not need heavy reliance on markup. Structure becomes a prerequisite for interpretation, not a shortcut.

Authority Is No Longer Static

Another shift is the way authority is evaluated. In older models, authority was often treated as relatively stable: accumulated links, historical signals, and established domain reputation. In newer systems, authority is becoming more dynamic.

As systems improve, readers are increasingly practising lateral reading to verify sources. Systems mirror this behaviour. Authority becomes time sensitive, not static. This means fresh contributions can influence perception, outdated signals can decay in relevance, and consistency over time matters more than isolated spikes. Authority is no longer just something you "have." It is something that is continuously inferred.

Originality and First-Hand Data

As systems improve, they become better at distinguishing between repackaged information and original contributions. This leads to a clear trend: originality and first-hand data become disproportionately valuable.

Despite the influx of automated content, analysis of 42,000 search results shows Position 1 is still dominated by content featuring original research and human insight. Programmatic SEO, when done well, often sits on top of structured datasets. The differentiator becomes whether that data is unique, whether it provides new insight, and whether it cannot be easily reconstructed elsewhere. Pages that merely rephrase existing information at scale will lose relative value. Pages that introduce proprietary data, unique comparisons, or first-hand analysis will gain increasing advantage.

Identifying True Contribution

As retrieval and synthesis systems improve, they will become more capable of answering a key question: who actually contributed something original? This is not just about content quality in isolation. It is about traceability, consistency across sources, and alignment with other signals in the ecosystem.

Systems will increasingly evaluate not just what is said, but how often and where similar contributions appear. This is where entity-level understanding becomes important. Not pages in isolation, but the entities behind them.

The EEAT Question, Reframed

The debate around EEAT is often framed incorrectly. The question is not: Does EEAT exist? The more relevant question is: What is the minimum set of signals required for a system to confidently reuse your information?

Because that is what modern systems are optimising for: confidence in reuse, reliability of synthesis, and stability of outputs. EEAT, in this context, is not a checklist. It is an emergent property of consistency, credibility, corroboration, and originality over time.

What This Means for Programmatic SEO

Programmatic SEO is not obsolete. But its constraints are clearer than ever. To remain effective, it must evolve from mass page generation into structured, meaningful, differentiated information generation at scale.

That means each page must contribute something non-trivial, reflect a clear intent or use case, and exist for a reason beyond templated variation.

The Direction of Travel

All of these changes point in the same direction:

Visibility will shift from ranking positions to inclusion in generated outputs.

Systems will prioritise outcome effectiveness over pure semantic matching.

Structure will matter more than markup.

Authority will become dynamic and time-sensitive.

Originality and first-hand data will carry increasing weight.

Systems will get better at identifying who actually contributed meaningful information.

Final Thought

Programmatic SEO was built for a world of indexed pages and ranked results. We are moving toward a world of retrieved fragments, synthesised answers, and traceable influence within generated outputs.

The core principle remains the same: scale is only valuable when it produces signal. The difference now is that systems are far better at distinguishing between the two. And that changes everything about how pSEO must be approached going forward.

E-E-A-T in the Age of LLMs: Why Content Features Don't Translate to Trust

The most common misunderstanding about E-E-A-T is also the most consequential. It is not a checklist. It is not a set of content features you add to a page. Yet the industry has largely collapsed it into exactly that.

This matters because as systems shift from ranking pages to retrieving and synthesizing information, the gap between what people think E-E-A-T is and what it actually represents becomes a critical vulnerability.

Why This Diagram Misses the Point

E-E-A-T SEO Annotation diagram showing content features incorrectly labeled as E-E-A-T signals — Common misconception: E-E-A-T as on-page content features

This isn't really how E-E-A-T works.

What you've shown are content features , things that can support perception (first-person narrative, expert quotes, before/after images, etc.) , but E-E-A-T itself isn't something you "add" to a page like annotations.

It's an evaluative framework, not a checklist.

Google doesn't assign "experience" because there's a personal story, or "expertise" because a doctor is quoted. Those elements can help, but they're proxies at best.

Actual E-E-A-T is inferred from a much broader set of signals:

consistency of quality over time
reputation beyond the site
corroboration across sources
accuracy and alignment with known knowledge
who is behind the content, not just what's on the page

This kind of diagram risks reducing it to on-page decoration, when in reality it's mostly off-page and longitudinal.

What E-E-A-T Actually Is

E-E-A-T originates from Google's Search Quality Rater Guidelines. It is an evaluative framework used by human raters to assess whether a source would be trusted outside of search. The question it attempts to answer is straightforward: Would this source be trusted in the real world?

That means E-E-A-T is fundamentally rooted in signals that exist independently of any webpage:

Real-world reputation
Verifiable expertise
Independent validation
Historical consistency
External corroboration

Not in:

On-page formatting
Self-declared credentials
Trust badges
Author bios
Schema markup

This distinction is critical because it reveals the core problem: the strongest E-E-A-T signals are precisely the ones that large language models are worst equipped to verify.

The Real Model: Mr Richard Hanson, MB, MCh, FRCS (Plast)

Consider a real-world professional: Mr Richard Hanson, MB, MCh, FRCS (Plast) , an Irish plastic surgeon with genuine, verifiable credentials. His authority cannot be easily faked because it is grounded in verifiable, external systems.

His trust signals exist in places like:

Medical registries (General Medical Council, Irish Medical Council)
Hospital affiliations and surgical privileges
Research publications in peer-reviewed journals
Conference participation and speaking engagements
Peer recognition and professional recommendations
Independent patient reviews and documented outcomes

These signals are:

External to any webpage he might create
Verifiable through independent systems
Accumulated over time and difficult to manipulate
Rooted in institutional validation

This is genuine E-E-A-T substrate.

Now contrast this with how the SEO industry typically approaches E-E-A-T. The standard playbook is: add an author bio, display credentials prominently, include trust badges, apply schema markup, use professional photography. This creates a dangerous simplification: presentation of trust gets confused with evidence of trust.

Two websites can look nearly identical on the surface. One represents real authority backed by external validation. The other imitates it through presentation alone. But underneath, one has genuine institutional backing and the other has self-assertion. E-E-A-T exists in that difference.

What E-E-A-T Actually Is

That means E-E-A-T is fundamentally rooted in signals that exist independently of any webpage:

Real-world reputation
Verifiable expertise
Independent validation
Historical consistency
External corroboration

Not in:

On-page formatting
Self-declared credentials
Trust badges
Author bios
Schema markup

This distinction is critical because it reveals the core problem: the strongest E-E-A-T signals are precisely the ones that large language models are worst equipped to verify.

The Surgeon Model: Where E-E-A-T Becomes Obvious

Consider a real-world professional: a surgeon with credentials such as MB, MCh, FRCS (Plast). Their authority cannot be easily faked because it is grounded in verifiable, external systems.

Their trust signals exist in places like:

Medical registries (General Medical Council, Irish Medical Council)
Hospital affiliations and privileges
Research publications in peer-reviewed journals
Conference participation and speaking engagements
Peer recognition and recommendations
Independent patient reviews and outcomes

These signals are:

External to any webpage they might create
Verifiable through independent systems
Accumulated over time and difficult to manipulate
Rooted in institutional validation

This is genuine E-E-A-T substrate.

Now contrast this with how the SEO industry typically approaches E-E-A-T. The standard playbook is:

Add an author bio
Display credentials prominently
Include trust badges
Apply schema markup
Use professional photography

This creates a dangerous simplification: presentation of trust gets confused with evidence of trust.

The LLM Problem: Text Representations, Not Reality

Large language models operate under a fundamental constraint that makes E-E-A-T verification nearly impossible.

LLMs do not:

Query live registries to verify credentials
Maintain persistent, trusted identity graphs
Access real-time institutional databases
Verify professional affiliations independently

Instead, they operate on:

Text corpora from training data
Statistical patterns in language
Learned relationships between concepts
Frequency of mention across sources

This creates a structural mismatch. E-E-A-T is strongest when it is grounded in reality,external, hard to fake, independently validated. LLMs only see representations of those things in text form.

For a surgeon, this means:

A genuine medical profile from a verified registry
A well-written affiliate page claiming expertise

Can look structurally similar when reduced to text. The model sees patterns and language consistency but cannot access the underlying verification system that distinguishes one from the other.

The Fallback: Frequency Becomes Authority

When models cannot verify truth directly, they default to proxies. And those proxies often correlate with something very different from actual expertise.

When LLMs encounter conflicting or unverifiable information, they rely on:

Frequency of mention across sources
Consistency across indexed content
Prominence in training data
Prevalence in high-ranking pages

Which often means:

Widely cited industry content
Repeated narratives in SEO-driven sites
Content that ranks well in Google
Aggregated publisher perspectives

This creates a substitution: "widely said" becomes "likely true".

This is not a bug in LLM design. It is a rational fallback when ground truth cannot be accessed. But it creates a reinforcing cycle.

The 2026 Evidence: How LLMs Cite Blog Consensus Over Expertise

This pattern is not theoretical. March and April 2026 studies provide concrete evidence of exactly how this mechanism operates in real time.

Example 1: Ranking Factors Query

When major models (ChatGPT, Gemini, Perplexity, and others) are asked "What are the most important ranking factors for Google in 2026?", they overwhelmingly cite popular SEO blogs and listicles that currently rank in Google's top 10–20. These include articles from Ahrefs, SEMrush, Search Engine Journal, and well-known "best of" guides. The models repeat claims like "EEAT is now the #1 factor" or "focus on topical authority clusters",phrases that have become content consensus precisely because they rank well.

But what the models actually do, according to March 2026 studies, reveals the mechanism:

GPT-5.3 still sends approximately 32% of citations to blog posts for non-branded SEO queries
Pages with high referring domains (i.e., Google authority signals) are 3.5× more likely to be cited than lower-authority pages
Real veteran practitioners with 20+ years of documented results and zero hype often get ignored unless their content also matches the "popular blog" style and ranks well

The model is not evaluating first-hand experience or long-term survival through updates. It is using Google ranking plus consensus as a shortcut for "truth."

Example 2: Helpful Content Update Query

When asked "How has Google's Helpful Content Update changed SEO strategy?", LLMs consistently regurgitate the narrative from the highest-ranking blog posts,many of which were written shortly after each update by agencies chasing traffic.

The real pattern from 2026 data:

Only approximately 38% of pages cited in Google AI Overviews (and similar LLM answers) even rank in the top 10 for the query
Yet models still heavily favour listicles, "ultimate guides," and opinion-heavy blogs that dominate current SERPs
Citation velocity and "content-answer fit" (how closely the page mirrors how the LLM itself writes) matter far more than demonstrated real-world experience or survival through multiple HCU or core updates

In short: If it ranks well on Google, LLMs treat it as authoritative, even if the author has never run a large-scale campaign or survived the updates they are writing about.

The Feedback Loop: Search Visibility Becomes Perceived Authority

There is a compounding effect between search ranking and LLM training:

1. Google ranks content based on its algorithm

2. That content becomes highly visible and indexed

3. It is included prominently in training data for LLMs

4. LLMs reproduce those patterns and perspectives

5. Users consume and reinforce those narratives

The result: search visibility → training prominence → perceived authority

Not necessarily actual expertise.

This is particularly problematic because it means content that ranks well in Google,for whatever reason,gains a secondary advantage: it becomes more likely to be reproduced by LLMs, which further amplifies its perceived authority.

A page that ranks #1 for a query is more likely to be in training data. A page in training data is more likely to be referenced by an LLM. A page referenced by an LLM gains credibility through association with the model's output.

This creates a compounding visibility advantage that has nothing to do with actual expertise or real-world authority.

Why LLMs Cannot Distinguish Real Authority from Claimed Authority

LLMs struggle to reliably differentiate between:

Self-declared expertise
Independently validated expertise

They process:

"World-renowned expert"
"20 years of experience"
"Certified professional"

But cannot inherently verify:

Is this registered anywhere?
Is this corroborated by external sources?
Is this reputation earned or simply claimed?

So they treat both as similar signals in text space.

There is another structural issue: LLMs are trained heavily on publishers, SEO-driven sites, aggregated content, and media platforms. They are trained less directly on practitioners themselves.

This means LLMs often "know":

What is written about experts
Better than the experts themselves

This shifts perceived authority toward content producers and publishers, not domain authorities. A well-written article about a surgeon can rank higher in LLM outputs than the surgeon's own professional profile.

The Content Features Trap: Why Annotations Don't Create E-E-A-T

This brings us back to the diagram that prompted this analysis. The image shows content features,first-person narrative, expert quotes, before/after images, professional photography,and labels them as E-E-A-T signals.

These are not E-E-A-T. They are content features that can support perception of E-E-A-T. But they do not create it.

The distinction matters because:

Content features are easy to replicate
Real E-E-A-T signals are difficult to fake
LLMs can process features
LLMs cannot verify the reality behind them

A well-optimised page with professional photography, expert quotes, and a polished author bio might look authoritative. But if that authority is not grounded in real-world verification systems, it is presentation without substance.

For an LLM, this creates a problem: it cannot tell the difference. So it defaults to treating both as similar signals.

Why E-E-A-T Is Fundamentally Anti-Textual

This is the key insight: the strongest E-E-A-T signals are the ones that exist outside of text.

The most important trust signals are:

Off-page signals (links, mentions, citations)
Real-world reputation (professional registries, institutional affiliations)
Historical consistency (track record over time)
Independent validation (peer review, external corroboration)
Longitudinal signals (sustained authority, not spikes)

LLMs are strongest at:

Processing text
Generating language
Finding patterns in corpora
Reproducing learned relationships

So: the most important trust signals are the ones LLMs are worst at accessing.

The result is fluent but not grounded. LLMs can produce confident, well-written answers that are based on consensus rather than truth. They can synthesize widely accepted narratives without verifying them. They can sound authoritative while being slightly off or overgeneralized.

This is why you see LLMs confidently providing information that is:

Slightly inaccurate
Overgeneralized
Based on what is widely said rather than what is true
Influenced by search ranking rather than expertise

The Correct Mental Model

E-E-A-T does not work like this:

Add signals → gain trust → rank

It works like this:

Be trusted in reality → signals emerge → systems detect them → visibility follows

The direction matters. You cannot add E-E-A-T to a page. You can only build the real-world foundation that causes E-E-A-T signals to emerge.

For a surgeon, this means:

Maintain credentials and registrations
Publish in peer-reviewed journals
Speak at conferences
Build reputation through outcomes
Accumulate independent mentions and citations

For a publisher or content creator, this means:

Produce consistently accurate, original information
Build reputation through corroboration and citation
Maintain consistency over time
Contribute unique insights that cannot be easily reconstructed
Earn mentions and links from authoritative sources

These are not tactics. They are the foundation upon which authority is built.

The Implication for Programmatic SEO

This creates a specific challenge for programmatic SEO in the LLM era.

Programmatic content at scale can only remain effective if it introduces genuine value per page. Content that merely repackages existing information with template variation will lose relative advantage because:

1. LLMs can identify redundancy across sources

2. Systems reward originality and first-hand data

3. Authority is increasingly tied to unique contribution, not just presence

But programmatic content that is built on structured, original datasets,unique comparisons, proprietary analysis, first-hand research,can gain advantage because:

1. It cannot be easily reconstructed elsewhere

2. It introduces information that systems cannot find in other sources

3. It creates a reason for citation and inclusion in generated outputs

The difference is not scale. It is whether that scale produces signal or noise.

Final Thought

LLMs are not "wrong" about E-E-A-T because they are poorly designed. They struggle because E-E-A-T is grounded in real-world verification systems and LLMs operate on statistical representations of text.

So they compensate by leaning on frequency, visibility, consensus, and search-influenced content. Which leads to outputs that reflect the web as it is indexed and ranked, rather than reality itself.

Understanding this gap is critical for anyone building content strategy in an LLM-influenced search environment. The most effective approach is not to optimise for how LLMs perceive authority. It is to build actual authority and let the signals emerge naturally.

Programmatic SEO in the Age of AI: From Scale to Signal

Programmatic SEO in the Age of AI: From Scale to Signal

The Original Promise of pSEO

Where pSEO Started to Break Down

The Shift We Are Now In

From Semantic Relevance to Outcome Effectiveness

Structure Over Markup, Signals Over Tags

Authority Is No Longer Static

Originality and First-Hand Data

Identifying True Contribution

The EEAT Question, Reframed

What This Means for Programmatic SEO

The Direction of Travel

Final Thought

E-E-A-T in the Age of LLMs: Why Content Features Don't Translate to Trust

Why This Diagram Misses the Point

What E-E-A-T Actually Is

The Real Model: Mr Richard Hanson, MB, MCh, FRCS (Plast)

What E-E-A-T Actually Is

The Surgeon Model: Where E-E-A-T Becomes Obvious

The LLM Problem: Text Representations, Not Reality

The Fallback: Frequency Becomes Authority

The 2026 Evidence: How LLMs Cite Blog Consensus Over Expertise

Example 1: Ranking Factors Query

Example 2: Helpful Content Update Query

The Feedback Loop: Search Visibility Becomes Perceived Authority

Why LLMs Cannot Distinguish Real Authority from Claimed Authority

The Content Features Trap: Why Annotations Don't Create E-E-A-T

Why E-E-A-T Is Fundamentally Anti-Textual

The Correct Mental Model

The Implication for Programmatic SEO

Final Thought

You might also like

From Semantics to AI: The Same Debate, Repackaged

HCU, thin content and the myth of word count

Blue Sky Thinking for LLMs: Why AI Citations Will Be Gamed