π€ THE "AI" MONOLITH MYTH
Most marketers treat "AI Search" as one big, scary thing.
They ask: "How do I optimize for AI?"
That is the wrong question. Itβs like asking, "How do I optimize for 'The Internet'?"
We ran an experiment to analyze 120 queries across 10 different industries (Health, Finance, Tech, etc.). We compared the citations from ChatGPT, Perplexity, and Gemini.
The results were shocking.
ChatGPT and Perplexity only agreed on 6.8% of their citations.
That means 93% of the time, they are looking at completely different websites. They are not variations of the same system. They are practically different species.
β‘ The Quick Facts (The Architecture Matrix)
To win, you have to understand how they hunt.
| Metric | ChatGPT | Perplexity | Gemini |
|---|---|---|---|
| Primary Source | Bing Index (plus live fetch) | Proprietary Index | Google Index |
| Crawling Behavior | Live-Fetches User URLs | Background Crawling (Only) | No Bot (Uses Googlebot) |
| Freshness Bias | High (fetches live) | Extreme (favors <1 month) | Tied to Google |
| Agreement with Google | 7.8% (Very Low) | 29.7% (Medium) | ~100% (It is Google) |
π§ͺ 1. CHATGPT: THE "LIVE FETCHER"
Architecture: ChatGPT uses Bing to find your page, but then it sends its own bot (ChatGPT-User) to actually read it.
The Insight: ChatGPT is trying to be "live," but it relies on Bing's map, which is often outdated. In our "llms.txt" experiment, ChatGPT tried to visit 12 pages on our site. 11 of them were deleted pages (404s).
Why? Because Bing hadn't updated its index yet.
The Strategy:
- Get Indexed in Bing: If you aren't in Bing, you are invisible to ChatGPT.
- Authoritative Bias: ChatGPT loves "Editorial Authority." It cites Wikipedia, Forbes, and TechRadar far more than niche blogs.
- Pro-Niche Authority: For commercial queries, it ignores Google's top results (only 4.4% overlap). But it doesn't hate reviews - it hates generic listicles. It cites deep, niche experts (like specialized outdoor or tech review sites) ~70% of the time, skipping the big generalist publishers.
π§ͺ 2. PERPLEXITY: THE "FRESHNESS FREAK"
Architecture: Perplexity maintains its own index (PerplexityBot). Unlike ChatGPT, it does not visit your site during the conversation. It reads its own memory.
The Insight: Perplexity is obsessed with recency. For news and tech topics, the average citation age was 1.8 days.
If your article hasn't been updated in 6 months, Perplexity often treats it as "expired milk."
The Insight: It is also the most volatile. If you ask it the same question three times, you will often get three different lists of sources. It is a "Discovery Engine," always rotating new content.
The Strategy:
- Update Schema Dates: Ensure your
dateModifiedis current. - Publish Frequently: It rewards velocity.
π§ͺ 3. GEMINI: THE "INVISIBLE PASSENGER"
Architecture: Gemini is Google.
There is no "GeminiBot" visiting your server. It uses the Googlebot data that powers standard Search.
The Insight: If you have zero visibility in Google Search, you have zero visibility in Gemini.
The Strategy:
- Traditional SEO: Everything you do for Google (Backlinks, Core Web Vitals, Content) applies here directly.
π§ THE "REDDIT PARADOX"
A correction: our original finding β "AI chatbots cite Reddit zero times" - was incomplete. We tested through the API. When we retested through the web UI (how real humans use these tools), the numbers looked completely different.
| Platform | API Reddit Citations | Web UI Reddit Citations |
|---|---|---|
| Google AI Mode | N/A | 44% |
| Perplexity | 0% | 20% |
| ChatGPT | 0% | 17% |
| Claude | 0% | 0% |
For "best X for Y" recommendation queries specifically, Reddit appeared in 71% of Google AI Mode responses and 46% of Perplexity responses.
Why the gap? The API and the web UI use different retrieval pipelines. The web UI does a live search (where Reddit dominates), assembles results, and cites them. The API uses a more curated, restricted index β likely filtered for enterprise liability and quality-control reasons.
The deeper layer: Even when AI doesn't cite Reddit, it still thinks like Reddit. There is a strong statistical correlation (0.554) between what Reddit communities upvote and what AI recommends. Reddit is baked into the training data. The brands Reddit likes are the brands AI suggests β with or without a citation link.
Reddit's relationship with AI operates at three layers:
- Training data (invisible): Reddit shapes what AI recommends regardless of citations.
- Web UI (visible): Reddit is cited 17β44% of the time when users interact through a browser.
- API (excluded): Reddit is filtered out when developers call AI programmatically.
The Insight: Don't write off Reddit for AI visibility. The access channel β API vs. browser β determines whether Reddit shows up as a source. For consumer-facing, recommendation-style queries, Reddit presence matters significantly.
π οΈ THE PLAYBOOK: HOW TO KNOW WHO IS CRAWLING YOU
You can't "Optimize for AI" because "AI" isn't one thing.
- To win ChatGPT, you need Bing technical SEO + Editorial Tone.
- To win Perplexity, you need Freshness + Structured Data.
- To win Gemini, you need Google Authority.
But how do you know if it's working?
You can't guess. You have to look at the traffic.
- ChatGPT appears in your logs as
ChatGPT-User. - Perplexity appears as
PerplexityBot(in background crawls). - Gemini is invisible in logs, but tracks with your GSC performance.
We use BotSight to track all of this in one dashboard. It tells you exactly which bot is visiting which page, so you can see your "Visibility Gap."
Are you ranking on Google (Gemini) but missing out on ChatGPT (Bing)? Are you fresh enough for Perplexity?
Don't fly blind. The bots are already deciding if you matter. Make sure they can see you.
CHEERS!