Entities and Knowledge Graph

Competitor Entity Graph: why AI always cites the same 4-5 brands in your industry

Roberto Serra 25 June 2026·~7 min read

The same four or five brands always appear in AI answers about your industry — and you're not one of them, even if you're bigger or technically superior. It's not a matter of budget or how many articles you publish: the AI doesn't know you belong to the same competitive market as those brands, so it doesn't consider you when a customer is deciding who to call. Those are exactly the most valuable queries — and you're not in them. Getting into the right cluster is possible, and it requires specific content that teaches the model the relationship.

Across 50 B2B industry queries I tested over the last 6 months, the AI always cites the same 4-5 brands. The other competitors in the industry, even if bigger or better, never show up. Here’s why.

The reason isn’t the quality of the content, it isn’t the domain’s DR, it isn’t even the content budget. It’s that the AI doesn’t know those other brands belong to the same “competitive family”. And if you’re not in the family, I don’t cite you when someone asks for “alternatives to X”.

Let me explain what this means, how to check whether your brand is in the right cluster, and what to do to get into it.

The competitive cluster is an entity, not a list

When you ask ChatGPT or Perplexity “who are the leading architecture and urban planning firms in Puglia”, the model doesn’t run a SERP. It builds a cluster: a group of entities it considers comparable in terms of geography, size, service offered and project type. Then it pulls out 4-5 to cite for you.

This cluster isn’t a static list. It’s a graph of relationships the model learns by observing how brands appear together across the web’s texts: in industry rankings, in comparisons, in comparative reviews, in tenders, in mutual references.

In the world of graph research, unsupervised clustering is an open problem:

“However, important unsupervised problems on graphs, such as graph clustering, have proved more resistant to advances in GNNs.”

Tsitsulin et al., 2023

Translated: grouping entities into coherent clusters without human labels is hard. The model has to infer on its own who goes with whom. And it does so using the signals it finds in the content.

The operational consequence for you is direct. If your firm never appears alongside the other firms in the cluster within the industry’s content — not cited with them, not compared with them, not listed with them — then as far as the model is concerned you’re not part of that cluster. Period.

Why this comes upstream of everything else

You can have the best entity record on Wikidata, the best Organization schema, the best “About us” page optimized for E-E-A-T applied to AI. If the model doesn’t place you in the right competitive family, you won’t appear in comparative queries.

And comparative queries are the ones that convert. “Best architecture firms in Bari”, “alternatives to [famous firm]”, “urban planning firms in Puglia specialized in urban regeneration”: these are shortlist queries. The customer is deciding who to call.

If your entity exists in the vector space but is far from the cluster of your true competitors, the model treats you as an outlier. It sees you, but it doesn’t include you. That’s worse than not existing: you exist out of context.

Common mistake

Pages that talk only about your own firm, without ever naming the competitive context.

The test you can run in 20 minutes

You don’t need a complicated tool. You need ChatGPT or Perplexity, a list of competitors, and an Excel sheet.

Write down the 10 competitors you believe to be your direct rivals (the ones you fight for the same tenders, the same clients, the same budgets).
Open ChatGPT or Perplexity and run 5 real comparative queries from your industry. Examples for an architecture firm in Puglia: “leading architecture firms in Bari”, “urban planning firms in Puglia specialized in urban regeneration”, “Puglia architects working on public construction”, “alternatives to [well-known local firm]”, “best designers for renovated masserie in Puglia”.
For each query, note which brands get cited.
Count: how many of your 10 assumed competitors show up? And do you show up?

If the brands cited by the AI are always the same 4-5 and don’t match your list, you have two valuable pieces of information. First: the cluster you think you inhabit isn’t the one the model recognizes. Second: you know exactly who you need to be seen together with to get into the family.

Pro tip

Write 2-3 pages of explicit comparison along the lines of “Our approach to urban regeneration vs [Firm X] vs [Firm Y]”.

The test I ran myself

Across 50 B2B industry queries monitored over the last 6 months — various geographic and vertical combinations, from professional services to manufacturing — the pattern was always the same: 4-5 dominant brands cited in 70-80% of the answers, and a long tail of brands that never show up, even when they objectively have revenues, portfolios and sizes comparable to the top ones.

The deciding factor wasn’t the quality of the website. It was the presence in comparative contexts: industry rankings, “top X in [area]” articles, comparison pages, mentions in panels and roundtables.

An indicative test, not a controlled study: the sample is qualitative, the measure is on aggregate citations. But the pattern is too sharp to be noise. Whoever is never named alongside the cluster leaders is, to the AI, not a competitor of those leaders.

The mistakes I see most often

Talking only about yourself. Pages that talk only about your own firm, without ever naming the competitive context. The model has no way to hook you to a cluster.

Generic comparisons. “We’re different from the other firms” without saying which ones. The AI needs proper names to build the competitive relationship.

Avoiding competitors’ names out of fear. The fear is commercial, not technical. But the cost is invisibility: if you never name your competitors in legitimate contexts (market reviews, industry analyses, reflection posts), the model doesn’t associate you with them.

Thinking link building is enough. Authoritative links help with authority, but they don’t build the competitive graph on their own. You need deliberate textual co-occurrences.

What to do in practice

Write 2-3 pages of explicit comparison along the lines of “Our approach to urban regeneration vs [Firm X] vs [Firm Y]”. Honest, not disparaging, with clear criteria (methodology, project type, territory served).
Publish 1-2 market articles per year in which you name the 5-10 players in your cluster. “Overview of architecture firms in Puglia in 2026” works well.
Request to be included in industry rankings and panels where the other firms in the cluster appear.
Use Author Entity Recognition to tie your designers as entities to concrete projects: the model associates people with firms with types of work.
Check periodically with new comparative queries whether you’re entering the clusters you care about.

These actions aren’t magic and don’t produce results in two weeks. But over 3-6 months they start to shift how the model places you in the graph. Real analysis of your positioning in the cluster requires professional tools for monitoring AI answers and semantic analysis of industry content — what I’ve described to you is a first check to understand whether you have a problem, not a complete diagnosis.

Where your cluster ends, your AI visibility ends

The point I want to leave you with: your visibility in AI answers for comparative queries depends on one thing alone — being recognized as a member of the right cluster. Entity graph, schema markup, authority, all the rest comes after. If you’re not in the family, I don’t cite you when someone asks about its members.

In the next articles in this series I dig deeper into how to build relationships between entities in a structured way, how to use Wikidata to explicitly declare competitive relationships, and how to monitor the cluster’s evolution over time.

Chapter 4 · Entities and Knowledge Graph

Continue with the deep dives

40 deep dives across the 5 sections of the chapter.

4.1 Entity Monitoring & Maintenance 8 deep dives

Entity Confidence Testing: reading the AI’s language to understand how much it trusts your brand Your brand exists in four parallel versions (and you don’t know it) Entity Decay: Why AI Stops Citing You (and How to Get Back Into Answers) Entity recovery after a reputation crisis: how to clean up your entity in the Knowledge Graph Periodic Entity Audit: Why Your Data Ages Even When You Don’t Notice AI Response Monitoring for Entities: how to find out if AI tells your brand’s story with the right data Knowledge Graph Poisoning Prevention: how to protect your entity from false information that AI takes as true Training Data Lifecycle: why corrections to your site don’t reach the AI right away

4.2 Entity Recognition 8 deep dives

Entity disambiguation: when AI confuses your brand with a namesake Entity salience: why being named once is like never being named at all Your brand shows up in AI answers, but classified as what? Entity Linking: why 50 mentions of your brand are worth zero if the AI doesn’t connect them When the AI stops understanding that “we” means you: the coreference problem Are your brand in Italian and in English the same entity to AI? Probably not New brand invisible in AI answers: how to speed up recognition Named Entity Recognition: how AI decides whether your brand is “someone” or just text

4.3 Entity Relationships 8 deep dives

AI doesn’t cite you in isolation: it cites you within a network of relationships Competitor Entity Graph: why AI always cites the same 4-5 brands in your industry You are here Entity-place association: why Perplexity knows who to recommend in Sardinia (and you maybe don’t) Industry Vertical Classification: the category that makes you visible (or invisible) in AI answers Supply Chain Entity Mapping: how partners tell AI who you are Speaking at events: why every conference is an authority node for AI Alumni & Affiliation Network: your institutional connections in the AI graph Client Portfolio as Entity Network: why your clients are nodes that strengthen you

4.4 Knowledge Graph Optimization 8 deep dives

Entering Google’s Knowledge Graph: why without it you’re just text to Gemini Wikidata as semantic backbone: the entry that makes your brand exist for AI Complete Organization schema: the machine-readable ID card of your brand sameAs: the glue that holds your identity together for AI Your brand’s tax code in the AI ecosystem When AI finds three different versions of your company, it stops recommending you Rich Entity Attributes: why AI cites only “fat” entities in detail Product Entity vs Brand Entity: why AI can know your name without knowing what you sell

4.5 Vertical & Local Entities 8 deep dives

Google Business Profile as the primary entity: why AI looks there before your website NAP Consistency: why AI sends clients to the wrong number Who is the Cagliari dentist according to ChatGPT? The answer depends on your city’s Knowledge Graph Why AI Recommends Generalist Blogs Instead of Your Medical Practice (and How to Turn It Around) Franchises and multi-location: why AI doesn’t add up the authority of your locations Professional Registry Entity: why the professional register is your proof of existence for AI Vertical industry directories: why AI pulls its recommendations from there Product/Service Schema for Transactional Queries

The author

Roberto Serra at the Senate of the Republic

Senate of the Republic · Palazzo Giustiniani Conference “The power of artificial intelligence”

Roberto Serra

SEO consultant for over 15 years, founder of the Serra SEO Agency (RAANK). He helps multinationals and SMEs stay visible where search is moving: ChatGPT, Perplexity, Gemini and Google's AI Overviews.

As featured in

Learn more about Roberto Serra →