Question 1

Are these real token counts or an estimate?

Accepted Answer

Real. The tool loads the model's actual tokenizer (for example o200k_base for GPT-4o, the Llama 3 tokenizer, or WordPiece for BERT) via transformers.js and runs it in your browser, so the count matches what the API would charge, not a heuristic like characters divided by four.

Question 2

Is my text uploaded anywhere?

Accepted Answer

No. The tokenizer runs entirely in your browser. Your text never leaves your device, and tokenizers are tiny (a few megabytes, no model weights), so there is almost nothing to download.

Question 3

Why do GPT, Llama and BERT give different counts?

Accepted Answer

Each model is trained with its own vocabulary and merge rules, so the same text splits into a different number of tokens. That is why the tool lets you compare them: a prompt that is 100 GPT-4o tokens may be a different count for Llama or BERT.

Question 4

How accurate is the cost estimate?

Accepted Answer

It multiplies your token count by each model's published per-token price. Prices change often and vary by provider and by input vs output, so treat it as an approximate input-side estimate and confirm current pricing with your provider.

Question 5

What is the coloured token view?

Accepted Answer

Each box is one token as the tokenizer sees it, coloured so adjacent tokens are easy to tell apart. Leading spaces show as a middle dot and newlines as a return glyph, which makes it clear that, for many tokenizers, a leading space is part of the token.

LLM Token Counter

How to count LLM tokens

Examples

Frequently asked questions

Related tools

Character Counter

Word Counter

AI Text Summarizer

.env to JSON

ASCII Table

Aspect Ratio Box Generator