Glossary

AI Crawlers & Technical

Bot Traffic

Bot traffic is automated, non-human activity on a website, ranging from search and AI crawlers to scrapers and malicious bots. In AI search measurement, the relevant slice is crawls by AI bots like GPTBot or PerplexityBot, which signal that AI platforms are reading your content but are invisible to client-side analytics.

The bot landscape on a modern site

A large fraction of requests hitting any public website come from machines: search crawlers, AI crawlers, uptime monitors, SEO tools, scrapers and attack bots. For AI visibility work, the interesting subset is well-identified AI bots, training crawlers like GPTBot and ClaudeBot, retrieval crawlers like OAI-SearchBot and PerplexityBot, and user-triggered fetchers like ChatGPT-User. Each visit is evidence that an AI system is ingesting or retrieving your content, which is upstream of every citation and mention you will ever earn.

Why client-side analytics cannot see it

Tools like Google Analytics count visitors by executing JavaScript in a browser. Bots fetch HTML and leave; the tracking script never runs, so the visit never registers. This creates a systematic blind spot: a page can be crawled daily by five AI platforms while its analytics show nothing. The only reliable observation points are server logs, CDN logs, or an edge worker that inspects the user agent of every request before the response is served. Server-side detection also catches the reverse problem: spoofed bots faking AI user agents, identifiable by checking against published IP ranges.

Turning bot logs into GEO signal

Raw bot hits become useful when segmented by bot, page and time. Crawl coverage tells you which content AI platforms can actually see; crawl frequency shifts often precede citation changes; absence of crawls on key pages flags robots.txt mistakes or rendering problems. Pair bot data with AI referral traffic to see the full chain from crawl to citation to human click. Geonimo captures this server-side through a Cloudflare Worker, classifying each AI bot and mapping its activity page by page in AI traffic analytics.

Frequently asked questions

Is AI bot traffic good or bad for my site?

Identified AI crawler traffic is generally positive: it means AI platforms are reading content that can later surface in answers and citations. The costs are bandwidth and the training-data question. Malicious or unidentified bot traffic is a separate problem handled with rate limiting and bot management, not robots.txt.

How do I distinguish real AI bots from spoofed ones?

User agent strings can be faked by anyone. Major operators publish official IP ranges or support reverse DNS verification, so authentic GPTBot or Googlebot requests resolve to their infrastructure. Edge-level checks comparing claimed user agent against source IP catch impostors scraping under a trusted bot's name.

Should bot traffic be excluded from my analytics reports?

It should be separated, not discarded. Mixing bot hits into human metrics corrupts conversion and engagement data, but AI bot activity is valuable intelligence in its own right. The clean setup is two views: human analytics from client-side tracking, and a dedicated server-side view of AI crawler behavior.

Related terms

Last updated: 2026-06-11

Track this for your brand

Geonimo monitors how ChatGPT, Perplexity, Claude, Gemini and Google AI talk about your brand — and generates the content that gets you cited.

Get your free audit