How is the token count calculated?

For OpenAI models (GPT-5.x, o3) we use BPE encoding via gpt-tokenizer (MIT license, runs entirely in your browser) — an approximation, since OpenAI's newest tokenizer revisions are not published as JS libraries. Claude uses words × 1.33 (Anthropic's published approximation). Gemini uses characters ÷ 4. All approximations are disclosed inline.

Why do different models give different token counts for the same text?

Each model uses a different tokenizer with a different vocabulary. GPT uses BPE (Byte Pair Encoding), Claude uses a similar BPE vocabulary but different merges, Gemini uses SentencePiece. The same word may tokenize differently — for example, "tokenization" could be 1 token in one vocabulary and 2 tokens in another.

What is the context window?

The context window is the maximum number of tokens a model can process in a single request — both your prompt (input) and the generated response (output) must fit within it. Exceeding the context window causes truncation or an API error.

Is my text sent to any server?

No. All tokenization runs in your browser using a WebAssembly/JavaScript library. Your text is never sent to Anthropic, OpenAI, Google, or any other service.

AI Token Counter — GPT-5, Claude, Gemini Free Online

CodeLint.Dev Dev Tools

Developer Tools

JSON Tools JSON Minify JSON Escape / Unescape Code Formatter GeoJSON Tools Regex Checker Regex Generator Diff Checker Webhook Tester

AI Tools

LLM Model Comparison Token Counter Prompt Builder AI Cost Calculator Context Window Visualizer Prompt Diff / A-B Tester Regex for AI Outputs System Prompt Analyzer

Currency & Forex

Currency Converter Exchange Rate Tracker Crypto Price Tracker Crypto Unit Converter

Blog

AI Tools

LLM Model Comparison
Token Counter
Prompt Builder
AI Cost Calculator
Context Window Visualizer
Prompt Diff / A-B Tester
Regex for AI Outputs
System Prompt Analyzer

Token Counter & Cost Estimator

GPT models use tiktoken (cl100k/o200k). Claude ≈ words × 1.33. Gemini ≈ chars ÷ 4. 100% client-side.

Your Text

Paste text to count tokens

Cost estimates update live across all major models

About

The Token Counter uses the gpt-tokenizer library (MIT-licensed, pure JavaScript) to count exact GPT tokens via the cl100k_base and o200k_base encodings. Claude token counts use a calibrated approximation (words × 1.33) and Gemini uses characters ÷ 4 — both are disclosed inline. The tool shows token count, character count, word count, and line count for the input text, plus a results table listing all 11 supported models (July 2026 lineup) with their tokens, input cost estimate, output cost estimate, and a context-window utilization bar. All processing is local — your text never leaves the browser.

How to use

1 Paste or type your text in the editor on the left.
2 Instantly see the token count, character count, and word count update for all models.
3 Review the per-model table showing exact or approximate token counts and cost estimates.
4 The context bar shows what percentage of each model's context window your text occupies.
5 Note: GPT models use exact tiktoken counts; Claude and Gemini use disclosed approximations.

How is the token count calculated?: For OpenAI models (GPT-5.x, o3) we use BPE encoding via gpt-tokenizer (MIT license, runs entirely in your browser) — an approximation, since OpenAI's newest tokenizer revisions are not published as JS libraries. Claude uses words × 1.33 (Anthropic's published approximation). Gemini uses characters ÷ 4. All approximations are disclosed inline.
Why do different models give different token counts for the same text?: Each model uses a different tokenizer with a different vocabulary. GPT uses BPE (Byte Pair Encoding), Claude uses a similar BPE vocabulary but different merges, Gemini uses SentencePiece. The same word may tokenize differently — for example, "tokenization" could be 1 token in one vocabulary and 2 tokens in another.
What is the context window?: The context window is the maximum number of tokens a model can process in a single request — both your prompt (input) and the generated response (output) must fit within it. Exceeding the context window causes truncation or an API error.
Is my text sent to any server?: No. All tokenization runs in your browser using a WebAssembly/JavaScript library. Your text is never sent to Anthropic, OpenAI, Google, or any other service.

CodeLint.Dev Dev Tools

Your code never touches our servers. Everything runs locally in your browser.

WCAG 2.1 AA

Tools

JSON Tools JWT Encoder / Decoder Hash Generator Morse Code Password Generator 1D & 2D Barcode Generator QR Code Generator Code Formatter CSV ⇄ JSON GeoJSON Tools UUID Generator Percentage Calculator Countdown Timer Stopwatch Date Calculator Clock Time Utility Regex Checker String Tuner Countries Reference

Company

About Contact Privacy Policy Cookie Policy Terms of Service Do Not Sell

Support

Support / Donate

We and our partners (including Google) use cookies to serve ads based on your prior visits to this and other sites. Your tool data is never collected or shared. Cookie Policy