How AI engines think

AI reads your page like a book: it skips the middle

Roberto Serra 25 June 2026·~7 min read

Is the most important information about your company stuck in the middle of the page, between a long introduction and a conclusion? AI is missing it. Models give far more weight to what sits at the beginning and the end of the text — everything in the middle ends up in a zone where the chance of being picked up is much lower. You're writing the right things in the wrong place. Moving your key information to the spots where AI actually looks can change everything — without touching the content.

You’ve written a perfect page. The brand is mentioned, the service is described, the CTA is clear. But the key information is in paragraph 5, at the heart of the page. For a human reader that’s not a problem — they scroll and find it. For an AI model it’s a serious problem, because the middle of the page is the dead zone.

This isn’t an opinion. It’s a measurable, documented behavior that in the research world is called the “lost in the middle effect”. And it has direct implications for who AI cites and who it ignores in its answers.

What Positional Encoding is and how it shapes what AI remembers

Transformer models don’t process text word by word like a human reading in sequence. They receive the entire text all at once, but they need to know where each word is. Positional encoding is the mechanism that provides this position information.

Minaee et al., in their 2025 survey, describe it very clearly:

“Positional encoding is incorporated to fuse information about the relative or absolute position of the tokens in the sequence.”
(Large Language Models: A Survey)

Translated: every token receives a kind of “address” that tells the model its position in the text. Without this mechanism, to the Transformer the words would be an unordered set — like Scrabble tiles dumped on the table. Positional encoding is what turns a pile of tokens into a sequence where “first paragraph” and “last paragraph” actually mean something.

And here comes the part that concerns you. Because not all positions carry the same weight.

The “lost in the middle” problem: it’s not theory, it’s measured

A group of researchers systematically tested what happens when you move the same piece of information to different spots in the text. The result, published in ACL Transactions, is unambiguous:

“We observe that performance is often highest when relevant information occurs at the beginning or end of the input context, and significantly degrades when models must access relevant information in the middle of long contexts, even for explicitly long-context models.”
(Liu et al., 2024)

That “even for explicitly long-context models” is the most striking part. Even models designed specifically to handle long texts — Claude with 200K tokens, GPT-4 with 128K — suffer from this effect. It’s not the limit of one specific model: it’s a consequence of the architecture itself.

The survey by Gao et al. (2024) on RAG confirms the problem from the retrieval side as well:

“Redundant information can interfere with the final generation of LLM, and overly long contexts can also lead LLM to the ‘Lost in the middle’ problem.”
(Retrieval-Augmented Generation for Large Language Models: A Survey)

Which means: even when an AI engine like Perplexity correctly retrieves your page, if the key information is in the middle of a long and redundant text, the model loses it anyway while generating the answer.

Common mistake

If the brand isn’t in the body text of the first and last paragraphs, as far as AI is concerned it doesn’t exist.

Where you put the brand changes everything: a test I ran

Before writing this article I took a real page — the “Services” page of a photovoltaic systems company — and ran an experiment with ChatGPT.

The original page was structured like this: 400 words of generic introduction about the photovoltaic market, then 600 words on the types of systems, then finally the company name and its specialization in paragraph 6, then 400 technical words, then the CTA.

I pasted the text and asked: “Summarize this page and tell me which company offers the service.” ChatGPT talked about photovoltaics in general. Of the company, not a single mention. The specific information was in the middle — lost.

Then I rewrote the same page, moving the brand and the differentiator. First sentence: the company name with what it does and for whom. Last sentence: the name again with the CTA. Same content, same 1,800 words, just repositioned.

Result: ChatGPT cited the company by name in the summary. Same content, different order, opposite result.

It’s not magic. It’s the “lost in the middle” effect: models make far better use of the information at the beginning and the end of the context than of the information in the middle.

Pro tip

Put the brand in the first 100 words of every page.

Why Italian pages are particularly vulnerable

I’ve noticed a pattern in the Italian companies I analyze. There’s a cultural tendency to “set the stage” before getting to the point — long introductions, historical context, preambles. In a magazine article that’s a virtue. For AI visibility it’s a problem, because you’re filling the first 200 tokens — the ones the model uses best — with generic content.

The “Wikipedia” introduction. Pages that start with “Photovoltaics is a technology that harnesses solar energy…” — 100 tokens wasted explaining something AI already knows better than you. Those tokens are the most valuable on the page and you’re using them for content the model has seen millions of times in the corpus.

The brand only in the header and footer. The company name appears in the logo, the menu and the copyright — elements that AI crawlers often classify as navigation, not content. If the brand isn’t in the body text of the first and last paragraphs, as far as AI is concerned it doesn’t exist.

The CTA with no context. “Contact us to learn more” as the last sentence. AI reads that final chunk carefully — and finds a generic sentence with no brand, no service, no differentiator. It’s the position with the second strongest signal on the page, and you’re wasting it.

What to do concretely

Put the brand in the first 100 words of every page. Don’t wait until paragraph 5 to say who you are and what you do. The first mention of the brand together with the key service has to come right away — that’s where the positional signal is strongest.
Repeat the key information in the last paragraph. The conclusion isn’t a summary for the reader — it’s a second chance for AI to capture the important information. Brand, service, differentiator: repeat them at the close in different words.
Cut the generic introductions. If your “SEO Consulting” page opens with “SEO is a fundamental discipline in digital marketing…”, you’re sacrificing the most valuable position to say something that adds nothing for either the reader or AI.
Hourglass structure: key information at the beginning, deep dive in the middle, key information at the end. The middle is for the humans who read everything. The beginning and the end are for the AI that extracts.
Run the 150-token test: copy your page’s text, read only the first 150 tokens and the last 150. From those two blocks, is it clear who you are, what you do and why a customer should choose you? If the answer is no, AI has the same problem.

Positional encoding in the AI visibility chain

This mechanism combines with others. Tokenization decides whether your brand is recognized as an entity. Positional encoding decides whether that entity is “seen” based on where it sits in the text. The attention mechanism decides how much weight it’s given. And the context window limits how much text the model can process in total.

If your brand is well tokenized but buried in the middle of a long page, the “lost in the middle” effect penalizes it. It’s like having a perfect business card but handing it over when the other person is already distracted.

Reopen your 5 most important pages and move the brand, the main service and the CTA into the first 2 and the last 2 paragraphs. Lost in the middle isn’t a bug — it’s a consequence of the architecture. Those who know it, adapt.

Chapter 1 · How AI engines think

Continue with the deep dives

38 deep dives across the 5 sections of the chapter.

1.1 AI Reasoning 8 deep dives

Step-by-step guides: why AI loves them (and how to write them) AI Agents and APIs: Your Business Can Become a Service the AI Calls Is AI inventing things about your brand? It happens when it can’t find reliable data Cover the Whole Workflow or the AI Ignores You (and Picks Another Source) Whoever Gets Cited in ChatGPT’s First Turn Has an Edge Over Everyone Else If the AI says ‘might’ when talking about you, you have a trust problem If your brand info contradicts itself, AI picks a competitor ‘Recommend the best X in Y’: does your content match this query?

1.2 Evaluation & Scoring 8 deep dives

Writing Too Complex? AI Struggles More to Use Your Content How to Become the Brand AI Generates Automatically for Your Industry Want AI to rephrase you? Write the answer exactly as you want it Exaggerated data on your site? AI discards it and picks whoever is more honest Your title says one thing, your content another? AI notices and penalizes you Logical gaps and contradictions? AI lowers your content’s score Who Is Your Brand Cited With? This Determines Your AI Category Are you rewriting what everyone else has written? AI wants novelty

1.3 LLM Architecture 8 deep dives

AI Replies With Outdated Data About Your Brand? Here’s Why It Happens Is your brand invisible to ChatGPT? The problem starts with how it reads it AI reads your page like a book: it skips the middle You are here How AI Decides Which Words Matter Most on Your Page If your page is too long, the AI cuts it and loses you Why ChatGPT Always Recommends the Same Brands (and How to Get on the List) The semantic distance between you and your customer decides whether AI finds you For AI, your page structure matters more than length

1.4 Retrieval & Grounding 7 deep dives

Perplexity and Bing Chat search in real time: are you in their index? Exact keywords or synonyms? AI needs both (here’s why) AI doesn’t read your whole page — it slices it into chunks After retrieval comes reranking: this is where generic content loses Want AI to cite your site by name and with a link? Here’s what you need to give it AI rewrites the question before searching: is your content ready? AI combines multiple sources to answer: are you in at least 2 of them?

1.5 Training & Alignment 7 deep dives

Useful, accurate and safe: the 3 criteria AI uses to judge your content The AI’s Internal Filters Can Block Your Site Without Warning Is your industry underrepresented in the training data? AI already starts at a disadvantage Vertical AI models: if you’re not in their data, you don’t exist in their world Copied content? The AI keeps the original and discards yours The perfect answer according to AI: structured, specific, with sources Aggressive SEO in 2026? AI Safety Filters Are Already Penalizing You

The author

Roberto Serra at the Senate of the Republic

Senate of the Republic · Palazzo Giustiniani Conference “The power of artificial intelligence”

Roberto Serra

SEO consultant for over 15 years, founder of the Serra SEO Agency (RAANK). He helps multinationals and SMEs stay visible where search is moving: ChatGPT, Perplexity, Gemini and Google's AI Overviews.

As featured in

Learn more about Roberto Serra →