Glossary

How LLMs Work

Token (LLM)

A token is the basic unit of text a language model processes, typically a word fragment of about four characters or three-quarters of a word in English. Models read, generate, and price everything in tokens. Token limits shape how much of a web page an AI can ingest when composing an answer.

How tokenization works

Before a model sees text, a tokenizer splits it into tokens drawn from a fixed vocabulary: common words become single tokens, rarer words split into pieces ("optimization" might become "optim" and "ization"). English averages roughly four characters per token; other languages and unusual strings tokenize less efficiently.

Everything downstream is counted in tokens: the context window is a token budget, API pricing is per token, and generation speed is tokens per second. Tokens are the currency of the entire LLM economy.

Why tokens matter for content and visibility

When an AI engine retrieves your page during RAG, it does not pass the whole page to the model, it chunks content into token-sized passages and injects only the most relevant few hundred tokens. If your key claim is spread across a meandering 800-word section, it may never fit cleanly into a retrieved chunk.

Dense, front-loaded passages survive chunking: the answer in the first sentence, support after. Token-efficiency is the unglamorous reason concise expert writing outperforms padded content in AI answers.

Practical token awareness for marketers

You do not need to count tokens, but you should write as if every retrieved chunk must justify its budget: one idea per section, no preamble, concrete facts early. Unusual brand spellings and stylized names tokenize into fragments, one more reason consistent naming across the web helps models recognize your entity reliably.

Token costs also explain platform behavior: engines summarize rather than reproduce, cite a handful of sources rather than dozens, and truncate long pages, all budget management you can design content around.

Frequently asked questions

How many tokens is a typical web page?

A 1,500-word article is roughly 2,000 tokens in English, about 0.75 words per token on average. AI engines rarely ingest full pages when answering; they retrieve chunks of a few hundred tokens each. That is why individual passages, not whole pages, compete for inclusion in AI answers.

Why do tokens matter for AI search visibility?

Retrieval systems chunk your content into token-limited passages and inject only the best ones into the model's context. Key claims buried mid-section can miss the chunk that gets retrieved. Front-loading answers and keeping sections focused increases the odds a chunk containing your message reaches the model.

Do tokens affect how AI handles my brand name?

Somewhat. Common, consistently spelled names tokenize cleanly and are recognized reliably; stylized spellings, unusual punctuation, or inconsistent variants fragment into pieces and weaken entity recognition. Using one canonical brand spelling everywhere helps models associate your name with your category and facts.

Related terms

Last updated: 2026-06-11

Track this for your brand

Geonimo monitors how ChatGPT, Perplexity, Claude, Gemini and Google AI talk about your brand — and generates the content that gets you cited.

Get your free audit