← Index/On scale/01

How much is one million tokens?

Language models don't read words. They read tokens — roughly three-quarter-word chunks — and count them the way a meter counts water. One million is the number you'll hear most often. Here's what it actually holds.

The short answer

One million tokens is about three-quarters of a million English words. Roughly 8 average novels, or 2700 paperback pages, or about 65 minutes of video, or 8.7 hours of audio.

In numbers

English words

Per token

About four characters, or three-quarters of a word. The word “tokenisation” is three.

First, what a token is.

A token is a fragment of text — sometimes a whole word, sometimes a piece of one. Models don’t read letters or even always words; they read these fragments, and count them relentlessly.

For English text, the rule of thumb is simple: one token is about four characters, or three-quarters of a word. A short word like “the” is one token. A long, unusual word like “tokenisation” might be three. Punctuation, spaces, and emojis all count too.

Tokens are how a model measures its appetite, and how you get billed. When a provider advertises a “1M token context window,” or charges $3 per million tokens, that number is what they mean.

§ I

In text

How many tokens it takes to…

A tweetOne short burst
280 characters, give or take.
~70
tokens
An emailA paragraph or two
A typical work email.
~400
tokens
A Wikipedia articleOne encyclopaedia entry
About 700 English words, the average article length.
~930
tokens
A short storyA single sitting
Something you'd read on a train.
~10,000
tokens
The Great GatsbyA slim American classic
50,061 words.
~66,748
tokens
An average novelA weekend's reading
Around 90,000 words.
~120,000
tokens
An average textbookA semester of study
Around 120,000 words.
~160,000
tokens
War and PeaceAlmost a whole million
Tolstoy, 587,287 words. Just fits.
~783,049
tokens
The King James BibleSlightly over the line
783,137 words — a fraction past one million tokens.
~1,044,183
tokens

≈ eight average novels, spine to spine

Stack eight ordinary paperbacks on a desk. That is the amount of prose a modern model can hold in its head at once, if you hand it every page.

§ II

In moving pictures

Gemini default sampling

Video and audio cost a different rate. Google’s Gemini models tokenise video at about 258 tokens per second by default, and audio at about 32 tokens per second. Other providers land in the same neighbourhood, with similar trade-offs between fidelity and cost.

A voice messageFifteen seconds of voice
Audio runs about 32 tokens per second.
~480
tokens
A TikTokOne 34-second clip
Video runs about 258 tokens per second at default sampling.
~8,772
tokens
A songA 3½-minute track
Audio-only, without video frames.
~6,720
tokens
A sitcom episodeTwenty-two minutes of TV
A third of a million tokens, already.
~340,560
tokens
A podcast episodeForty minutes of talk
Audio-only transcription budget.
~76,800
tokens
A feature film110 minutes of cinema
Comfortably exceeds one million tokens.
~1,702,800
tokens

65 minutesof video

8 hr 41 minof audio

~114TikToks

~13podcast episodes

§ III

Everything at once

Interactive

Slide from a handful of tokens up to ten million. Every equivalent follows along.

Scale the numberLog scale · 100 → 10M

1.00Mtokens

WordsEnglish words

750,000

Pagespaperback pages

2,727

Novelsavg novels (~90k words)

8.3

Textbooksavg textbooks (~120k words)

6.3

Wiki articlesavg Wikipedia articles

1,071

War and Peacecopies of Tolstoy's novel

1.28

Videoof video, Gemini default

65 minutes

Audioof audio, Gemini default

8 hr 41 min

TikToksavg 34-second clips

114

Sources & notes

Verified April 2026

Token-to-word ratio (≈ 0.75 English words per token, ≈ 4 characters per token) is OpenAI’s own rule of thumb, documented in their tokenizer tool and pricing guides.
Video and audio token rates (≈ 258 tokens/second video, ≈ 32 tokens/second audio) follow Google’s Gemini API documentation for default media sampling. Other providers publish similar figures; all three treat multimodal input as a budget of per-second tokens.
Published word counts for named works are drawn from their standard editions. War and Peace (587,287), the King James Bible (783,137), The Great Gatsby (50,061), Harry Potter and the Sorcerer’s Stone (76,944).
“Average” figures — novel length (~90k words), textbook length (~120k words), Wikipedia article length (~700 words), TikTok duration (~34 seconds), podcast duration (~40 minutes) — are rounded industry baselines. Real distributions vary widely; treat these as a ruler, not a measurement.

Loading…

First, what a token is.

In text

In moving pictures

Everything at once

Sources & notes

Turning the page.

First, what a token is.

In text

In moving pictures

Everything at once

Sources & notes