Page loaded

What Gets Cited: Why AI Answer Engines Cite Competitors

2026-06-17·14 min·By Ethan

A practical reading of the What Gets Cited paper: why AI answer engines cite one competitor over another, and how to audit Competitive GEO with prompts, source logs, and fix priorities.

AI visibility is no longer just a question of whether a page ranks. The more uncomfortable question is what happens after your page and a competitor's page are both available to an AI answer engine. Why does the answer cite the competitor first? The May 2026 paper What Gets Cited: Competitive GEO in AI Answer Engines gives content teams a useful starting point: topic match, source position, explicit price information, and freshness repeatedly affect which source becomes the first citation. The takeaway is not that GEO has a secret formatting recipe. The takeaway is that Competitive GEO should diagnose why a page was retrievable but not chosen. Last updated: 2026-06-17
Competitive GEO citation audit workflow

Key Takeaways

Competitive GEO compares your content against the pages AI systems already cite. It separates two problems that often get mixed together: whether your page entered the candidate source pool, and whether it won the citation once it was there.
  • The paper's strongest practical signals are topic match, candidate position, price information, freshness, and source usefulness.
  • Traditional SEO still matters. Google says AI Overviews and AI Mode supporting links require pages that can be indexed and are eligible for snippets; OpenAI separately documents OAI-SearchBot for ChatGPT search result display.
  • Formatting alone is a weak first fix. Tables, FAQs, and headings help when they carry facts, criteria, evidence, and answer-ready conclusions.
  • The best audit deliverable is not a vague "optimize for AI" memo. It is a prompt set, a citation log, a competitor gap table, and a retest plan.

What The Paper Actually Tested

The What Gets Cited paper studies a narrow but valuable question: when two candidate documents are put into a controlled answer-generation setup, which one is cited first? The authors used a two-document RAG environment, changed one factor at a time, and ran 252,000 trials across six models. That design matters. The paper is not trying to recreate every messy part of a live search product. It removes many outside variables so the citation-selection step becomes easier to inspect. The tested factors include topic relevance, query term coverage, price, specifications, evidence, timestamps, brand signals, position bias, and formatting. The headline result is useful because it is operational. Across models, the strongest and most stable factors are not decorative page patterns. They are whether the candidate source directly matches the question, whether it contains facts the answer needs, whether it is fresh, and whether it appears earlier in the candidate list.
Paper findingWhat it means for content teamsAudit question
On-topic beats off-topicA page must answer the prompt, not merely the broad subjectDoes the section answer the user's exact question?
Price beats no priceCommercial pages need comparable factsDoes the page give price, range, package logic, or a clear reason pricing is custom?
Recent timestamp beats old timestampFast-moving topics need visible updatesIs the update date visible, and did the facts actually change?
Candidate position is powerfulRanking and retrieval still affect AI citation opportunitiesDid the URL enter the source pool before we judged the copy?
Formatting-only factors are weakerStructure should carry evidence, not replace itDid this edit add facts, proof, comparisons, or just polish?
My read is conservative: use the paper as an audit-priority map, not as an AI citation formula. Real answer engines also depend on indexing, crawl access, link graphs, interface rules, user location, live retrieval quality, and brand trust.

Competitive GEO vs Traditional GEO

Traditional GEO asks how to improve visibility inside generated answers. The original GEO: Generative Engine Optimization paper formalized that idea with benchmark tasks, visibility metrics, and content adaptations for generative engines. Competitive GEO asks a sharper question: when your page and a competitor's page are both plausible sources, why does the model cite one and not the other? That is closer to the daily problem for B2B teams. You are not optimizing in an empty room. You compete with SERP leaders, third-party reviews, competitor documentation, community discussions, media pages, and comparison hubs for a limited number of answer citations.
LayerTraditional SEO questionGEO questionCompetitive GEO question
DiscoveryCan my page be crawled and ranked?Can my page enter an AI-usable source pool?Does my page or the competitor enter the pool more often?
Answer useDoes the search snippet earn a click?Does the answer mention or cite me?Does the answer cite the competitor before me?
DiagnosisTitle, links, technical SEO, content coverageEntities, answer blocks, evidence, extractabilityCompetitor facts, first-citation reason, source quality, retrieval position
FixImprove ranking and clicksImprove answer readinessImprove citation win rate against known cited sources
For Convertos.ai, this is the difference between a brand mention report and a useful operating loop. A team needs to know which prompts cite competitors, which pages those citations point to, and which missing facts or proof blocks explain the gap. A quick first pass can start with the AI visibility checker before deeper page-by-page comparison.

The Four Citation Gates

The paper's strongest signals map well to four audit gates. They do not explain every answer-engine decision, but they are the right first places to look.

1. Topic Match

AI answer engines usually generate from retrieved candidate sources. If the page is generally related but does not answer the prompt, it is a weak citation candidate. A page about "GEO strategy" may be relevant to "best GEO tools for B2B SaaS," but it is not enough if it lacks tools, pricing, criteria, limitations, or selection advice. Google's AI features guidance also keeps this grounded in search: eligible AI features rely on Search systems, supporting links, and content that can be used as part of the search experience. The page must first be understandable as an answer to a query.

2. Comparable Commercial Facts

Price showed up as a strong factor in the controlled tests. That makes intuitive sense. If an answer needs to compare products, recommend tools, or explain tradeoffs, the model needs facts it can reuse. A page that only says "contact sales" gives the system less to work with than a page that explains package boundaries, usage limits, who the plan fits, and what drives custom pricing. This does not mean every SaaS company must publish a full price table. It does mean the page should give enough comparison material for a reader and an answer engine to understand how the offer is evaluated.

3. Freshness With Real Updates

Freshness matters most when the subject changes quickly. AI search, crawler behavior, Google AI features, and model access policies are all moving targets. The safe pattern is a visible update date plus evidence that the body was actually refreshed. For example, OpenAI's crawler documentation distinguishes OAI-SearchBot, which is associated with search result display in ChatGPT, from GPTBot, which is associated with model training crawling. A page that discusses AI search visibility without version-aware crawler details can become stale quickly.

4. Candidate Position

Candidate position was one of the strongest effects in the paper. That does not mean "rank first and everything is solved," but it does mean SEO is not dead. If a page cannot be crawled, indexed, internally linked, or shown with usable snippets, Competitive GEO fixes will not get a fair test. Google's robots meta tag documentation is especially relevant here because snippet controls affect whether content can be used as a direct input for AI Overviews and AI Mode. A page with avoidable snippet restrictions may lose before the citation contest starts.

A Competitive GEO Audit Table

Use a 0 to 3 score for each row: 0 means absent, 1 means present but weak, 2 means usable, and 3 means strong and verifiable. The goal is not fake precision. The goal is agreement on what to fix first.
Check0-point pattern3-point patternWhy it affects citation
Prompt matchPage covers a broad topic but not the exact questionThe opening and relevant H2 answer the prompt directlyRetrieval and generation both reward semantic match
Query terms and entitiesMissing key entities, category terms, or synonymsNaturally covers entities, alternatives, and comparison languageHelps the source match the question and answer frame
Price or cost boundaryOnly says "contact sales"Gives price, range, package logic, or custom-pricing driversRecommendation answers need comparable facts
Specs and limitsNo feature boundaries, limitations, or audience fitLists capabilities, limits, fit, and non-fit casesThe answer can extract sharper conclusions
Competitor comparisonPretends alternatives do not existGives fair criteria, scenarios, and caveatsComparison prompts need side-by-side evidence
Evidence near claimsMarketing claims without sourcesKey facts sit near papers, docs, tests, examples, or methodologyCitations need defensible source material
Visible freshnessNo date or stale bodyUpdated date plus current factsFreshness was a stable factor in the paper
Technical accessBlocked, noindexed, JS-only body, or no snippet eligibilityCrawlable, indexable, canonical, text-rendered pageNo candidate source means no citation
Brand/entity consistencyNames, URLs, and descriptions vary across pagesProduct, company, schema, social, and third-party facts alignReduces entity confusion in AI answers
Extractable answer blocksLong paragraphs with no conclusionEach major section has a standalone answer, table, or checklistEasier for AI systems to summarize and cite
If you already know which competitor URL is being cited, use the competitor URL comparison tool to compare your page against the cited source. Do not just compare word count. Compare facts, evidence, dates, answer blocks, and the exact prompt job each page serves.

First Separate Retrieval Failure From Citation Failure

Many GEO audits jump too quickly into content rewriting. First decide where the failure happened.
  1. Run a stable set of prompts across the target surfaces, such as ChatGPT, Perplexity, Gemini, and Google AI features.
  2. Record whether your brand, product, or URL appears.
  3. Record the cited URLs and competitor URLs.
  4. Check whether your relevant page can be found, crawled, indexed, and summarized in traditional search.
  5. If the page is not entering the candidate pool, fix technical SEO, internal linking, canonicalization, indexability, and snippet eligibility.
  6. If the page is retrievable but loses to a competitor, move into Competitive GEO content analysis.
  7. Retest the same prompts after the fix, using the same platform and similar timing.
The boundary matters. A noindex page does not have a citation-quality problem. A thin page that ranks but loses to a richer comparison page does.

Build A Prompt Set That Reflects Real Demand

Do not test one flattering prompt and call it GEO monitoring. A brand may appear for definition prompts and disappear for purchase prompts. It may be visible in English and absent in localized Chinese prompts. A useful first set often includes 25 to 40 prompts across five intent groups.
Prompt groupExampleWhat to measure
Definition"What is competitive GEO?"Category understanding and source framing
Comparison"GEO vs AEO" or "Convertos alternatives"Whether the brand enters the right comparison set
Purchase"best GEO tools for B2B SaaS"Recommendations, citations, price and limits
Problem solving"why is my brand not cited by ChatGPT?"Whether the page answers the pain directly
Brand"Convertos.ai AI visibility checker"Brand accuracy and competitor confusion
For each run, log platform, prompt, date, answer summary, brand presence, citation URL, competitor URL, answer errors, cited page type, and next action. A local paragraph check, such as the citation readiness checker, can help tune individual answer blocks, but it should not replace whole-page and prompt-level tracking.

Why AI Cites A Competitor Instead Of You

The boring explanation is often the right one: the competitor page gives the model more usable material. It may answer the prompt faster, provide clearer prices, explain limits, include current evidence, use a better comparison structure, or rank higher in the candidate list.
Gap typeTypical symptomFix
Relevance gapPage is about the topic but misses the exact promptAdd a direct answer in the intro and the matching H2
Completeness gapMissing price, specs, limits, audience, or alternativesAdd comparable facts and explain uncertainty honestly
Evidence gapClaims are unsupported or far from sourcesPut official docs, papers, tests, examples, or methodology near the claim
Competitive gapCompetitor has comparison pages, third-party reviews, or clearer category pagesBuild owned comparison assets and improve external profile consistency
Owned content is only one layer. AI answers often cite third-party sources, reviews, directories, documentation, and community pages. Competitive GEO should inspect the actual sources being cited, then decide which can be improved directly and which require PR, partner, directory, documentation, or review work.

How To Prioritize Fixes

Prioritize fixes by impact, evidence strength, implementation cost, and retestability. Do not rewrite one paragraph for every prompt failure. Start with pages that affect many valuable prompts: category pages, product pages, comparison pages, pricing pages, docs, case studies, and third-party profile pages.
PriorityWhen to fix firstExample
HighAffects purchase or comparison prompts, and the page is already discoverableProduct page lacks supported platforms, price boundaries, and competitor comparison
MediumPage answers the question but has weak evidence or stale factsGEO guide lacks 2026 AI feature and crawler updates
LowOnly affects a niche long-tail prompt or cosmetic formattingTurning a paragraph into a list without adding facts
Ask four questions before assigning work:
  1. Does this issue affect multiple high-value prompts?
  2. Is the page already close to the candidate source pool?
  3. Will the fix add evidence, facts, criteria, or a clearer answer?
  4. Can we retest citation rate, mention rate, or answer accuracy after the change?
If the answer is no, the work may still be useful, but it should not be called a Competitive GEO priority.

Measurement Plan

Competitive GEO should be reported as a trend, not as a one-off screenshot. A 30-day first measurement window is reasonable because crawling, indexing, AI answer behavior, and result interfaces can lag behind content updates.
MetricHow to record itBoundary
Brand mention ratePrompts where the brand appears / total promptsMention does not equal recommendation
Citation ratePrompts citing owned URLs / total promptsPlatforms show citations differently
First-citation rateTimes owned URL is the first cited sourceClosest to the paper's competitive setup
Answer accuracyBrand-related answers without factual errorsRequires human review
Competitor shareCompetitor recommendations or citations by prompt groupSplit by intent type
Fix responseBefore/after movement on the same prompt setLook for repeated movement, not one sample
The GEO-16 paper supports the broader idea of auditing AI citation behavior with page-quality signals such as freshness, metadata, semantic HTML, and structured data. It is observational, so do not treat it as proof of universal causation. Use it as a measurement framework alongside controlled prompt logging.

FAQ

Does Competitive GEO replace SEO?

No. Competitive GEO sits after basic SEO. A page first needs to be crawlable, indexable, understandable, and eligible for snippets before citation competition has a fair chance. Source signal: SERP related questions and the What Gets Cited paper title.

Why does the paper say formatting is weak when GEO advice often recommends structure?

Structure is useful when it carries answer-ready facts. A table that compares pricing, limits, and evidence can help; a table that only rearranges vague copy usually will not. Source signal: People Also Ask-style GEO questions and the paper's formatting-factor results.

Should every SaaS page publish exact pricing?

Not always. But purchase and comparison pages should provide some cost boundary: price range, package logic, usage limits, free trial terms, or the variables that drive custom quotes. Without comparable facts, AI answers have less to cite. Source signal: SERP pricing/comparison intent and the paper's price factor.

What should I do first if competitors are cited and I am not?

Identify the cited competitor URLs, check whether your page is discoverable for the same prompt, then compare prompt match, facts, evidence, freshness, and extractability. If the page is not discoverable, fix technical and retrieval issues first. Source signal: related questions about why AI answers cite competitors and the Competitive GEO audit workflow.

Sources And Methodology

This article was prepared on 2026-06-17. It is based on the arXiv paper What Gets Cited: Competitive GEO in AI Answer Engines, the earlier GEO paper, the GEO-16 citation audit paper, Google documentation on AI features and robots meta snippet controls, and OpenAI documentation on crawler behavior. The workflow also uses SERP research run for the Competitive GEO query cluster and Convertos.ai's existing publishing rules: GEO is the category, not a normal tag.

Need practical guidance?

Talk to me about your SEO / GEO bottlenecks

Reach me by email, WeChat, or LinkedIn. I can help you prioritize issues and suggest a practical first step.

Email: Send emailWeChat: 15765565449LinkedIn