← All Research

Query Intent, Not Google Rank: What Best Predicts AI Citation Behavior

Author

Anthony Lee — AI+Automation

Status

Preprint v3 — March 2026 | Not yet peer-reviewed

Download PDF

Key Findings

Google Rank Does Not Predict AI Citations

Only 6.8% URL overlap between Google's top results and ChatGPT citations. Domain-level overlap is higher (28.7-49.6%), meaning AI platforms use the same trusted domains but pick different pages.

Query Intent Varies Dramatically by Vertical

19,556 queries across 8 verticals. Intent distributions are statistically different by industry. Content strategy must match your vertical's intent pattern, not generic SEO assumptions.

7 Page-Level Features Predict Citation

Internal links (strongest predictor, OR = 2.07), self-referencing canonicals (OR = 1.92), schema markup (OR = 1.69), word count, heading structure, content-to-HTML ratio, and visible timestamps.

AI Platforms Have a 2-vs-2 Architectural Split

ChatGPT and Claude fetch pages live during conversations. Perplexity and Gemini use pre-built search indices only. This requires different optimization strategies for each group.

Reddit Gets Zero API Citations, But Shapes Recommendations

Reddit occupied 38.3% of Google Top-3 positions but received zero AI citations via API. Web UI testing revealed 8.9-15.6% citation rates. Reddit's influence flows through training data, not live retrieval.

Read Full Abstract

The rapid integration of AI chatbots into consumer search behavior has spawned a cottage industry of Generative Engine Optimization (GEO) advice, much of it built on untested assumptions about how AI platforms select sources for citation. Industry practitioners widely assert that Google ranking determines AI visibility, that community-consensus platforms like Reddit confer citation advantages, and that AI recommendations are too inconsistent to warrant optimization efforts. We tested these claims empirically across four major AI platforms - ChatGPT, Claude, Perplexity, and Gemini - using a multi-study design that combined large-scale query intent classification (n = 19,556 queries across 8 verticals), Google rank cross-referencing (120 queries via API, plus 100 queries via web UI against both Google and Bing Top-3 results), server-side fetch verification via Vercel middleware logging, and page-level technical analysis of 479 cited and non-cited pages. Our results challenge all three prevailing claims. These findings suggest that effective GEO strategy requires intent-aware, platform-specific optimization rather than the one-size-fits-all approach currently advocated by industry practitioners.

Keywords

Generative Engine Optimization, GEO, AI citation behavior, AI search, ChatGPT, Claude, Perplexity, Gemini, query intent, brand recommendations, live page fetch, robots.txt compliance

Citation

Lee, A. (2026). Query intent, not Google rank: What best predicts AI citation behavior. Preprint v3, AI+Automation.

DOI: https://doi.org/10.5281/zenodo.18653093

ORCID ORCID: 0009-0002-4815-6373 aiXiv: aixiv.260215.000002