Query Intent, Not Google Rank: What Best Predicts AI Citation Behavior
Anthony Lee — AI+Automation
Preprint v3 — March 2026 | Not yet peer-reviewed
Key Findings
Google Rank Does Not Predict AI Citations
Only 6.8% URL overlap between Google's top results and ChatGPT citations. Domain-level overlap is higher (28.7-49.6%), meaning AI platforms use the same trusted domains but pick different pages.
Query Intent Varies Dramatically by Vertical
19,556 queries across 8 verticals. Intent distributions are statistically different by industry. Content strategy must match your vertical's intent pattern, not generic SEO assumptions.
7 Page-Level Features Predict Citation
Internal links (strongest predictor, OR = 2.07), self-referencing canonicals (OR = 1.92), schema markup (OR = 1.69), word count, heading structure, content-to-HTML ratio, and visible timestamps.
AI Platforms Have a 2-vs-2 Architectural Split
ChatGPT and Claude fetch pages live during conversations. Perplexity and Gemini use pre-built search indices only. This requires different optimization strategies for each group.
Reddit Gets Zero API Citations, But Shapes Recommendations
Reddit occupied 38.3% of Google Top-3 positions but received zero AI citations via API. Web UI testing revealed 8.9-15.6% citation rates. Reddit's influence flows through training data, not live retrieval.
Read Full Abstract
The rapid integration of AI chatbots into consumer search behavior has spawned a cottage industry of Generative Engine Optimization (GEO) advice, much of it built on untested assumptions about how AI platforms select sources for citation. Industry practitioners widely assert that Google ranking determines AI visibility, that community-consensus platforms like Reddit confer citation advantages, and that AI recommendations are too inconsistent to warrant optimization efforts. We tested these claims empirically across four major AI platforms - ChatGPT, Claude, Perplexity, and Gemini - using a multi-study design that combined large-scale query intent classification (n = 19,556 queries across 8 verticals), Google rank cross-referencing (120 queries via API, plus 100 queries via web UI against both Google and Bing Top-3 results), server-side fetch verification via Vercel middleware logging, and page-level technical analysis of 479 cited and non-cited pages. Our results challenge all three prevailing claims. These findings suggest that effective GEO strategy requires intent-aware, platform-specific optimization rather than the one-size-fits-all approach currently advocated by industry practitioners.
Keywords
Generative Engine Optimization, GEO, AI citation behavior, AI search, ChatGPT, Claude, Perplexity, Gemini, query intent, brand recommendations, live page fetch, robots.txt compliance
Citation
Lee, A. (2026). Query intent, not Google rank: What best predicts AI citation behavior. Preprint v3, AI+Automation.