Glossary

AI Crawlers & Technical

Google-Extended

Google-Extended is a robots.txt control token that lets site owners opt out of having their content used to train Google's Gemini models. It is not a separate crawler; Googlebot still crawls normally. Blocking Google-Extended does not remove a site from Google Search or AI Overviews, which follow standard search indexing.

A control token, not a crawler

Google-Extended never appears in your server logs as a visiting bot. It is a directive token that Google's existing crawlers check when deciding whether fetched content may be used for Gemini model training. Adding User-agent: Google-Extended with Disallow: / to robots.txt tells Google: keep crawling and indexing as usual, but do not feed this content into generative model training. This design separates the training decision from the search indexing decision entirely.

The AI Overviews misconception

The most consequential misunderstanding about Google-Extended: blocking it does not remove you from Google AI Overviews or AI Mode. AI Overviews are built on Google's normal search index, populated by standard Googlebot crawling. The only way to stay out of AI Overviews is to restrict regular search indexing with noindex or nosnippet controls, which also damages your classic search presence. So publishers can block Google-Extended to withhold training data while keeping every bit of their Search and AI Overviews visibility intact. The two systems are governed independently.

Deciding your Google-Extended policy

Because blocking Google-Extended carries zero search or AI Overviews penalty, it is the lowest-risk training opt-out available, and many large publishers use it. The trade-off is subtler: content excluded from Gemini training may leave future Gemini models with weaker baseline knowledge of your brand, relevant as Gemini powers more Google surfaces. Brands prioritizing AI visibility typically leave it allowed; teams tracking how Gemini and AI Overviews represent their brand can monitor outcomes with Geonimo's multi-model monitoring before changing policy.

Frequently asked questions

Does blocking Google-Extended remove me from AI Overviews?

No. AI Overviews and AI Mode draw on Google's standard search index built by Googlebot. Google-Extended only controls whether your content trains Gemini models. To stay out of AI Overviews you would need to block normal indexing or snippets, which also harms classic Google Search visibility.

Should I block Google-Extended?

It is the safest training opt-out, since search rankings and AI Overviews are unaffected. Publishers protecting content value often block it. Brands wanting Gemini to know them well usually allow it, because training exposure shapes how the model describes them. Decide based on whether training inclusion helps or hurts you.

Will I see Google-Extended in my server logs?

No. Google-Extended is not a distinct bot with its own user agent visits; it is a robots.txt token that standard Google crawlers consult. Your logs will continue to show Googlebot and other Google fetchers. The directive changes how fetched content may be used, not who fetches it.

Related terms

Last updated: 2026-06-11

Track this for your brand

Geonimo monitors how ChatGPT, Perplexity, Claude, Gemini and Google AI talk about your brand — and generates the content that gets you cited.

Get your free audit