TokenScale · Changelog

2 July 2026 New Price Change

Fable 5 returns — and this time it keeps the flagship slot

Some models get a launch. Fable 5 got a saga. Anthropic's Mythos-class flagship held our Pro slot for exactly 48 hours in June — 10th to 12th — before a US export-control directive suspended it worldwide, and the board reverted to Opus 4.8. We left those two $10/$50 points in the history because they happened. Yesterday's audit confirmed it: Fable 5 is back on Anthropic's price list at $10/$50, and as of today it retakes the Anthropic Pro slot. Opus 4.8 gets the Sonnet 4.6 treatment — it moves to the all-models view at $5/$25, still listed, still available, nothing deleted.

Read the chart honestly: the Pro line shows the June blip, three weeks of Opus at $5/$25, and a step to $10/$50 today. That step is a model swap, badged as one — the slot changed which model it tracks; nobody's Opus bill doubled. In real terms, the priciest way to write The Hobbit on this board is now about $6.36 (127K output tokens at $50/M), against $0.08 on the cheapest tier. And yes, we note the poetry: the model that now tops the board is the same one that spent yesterday auditing it.

1 July 2026 Audit Data

The July drift audit — we re-checked all 66 tiers, and here's everything that was wrong

Once in a while the right move is to stop trusting your own machinery and check everything by hand. On 1 July we ran a full drift audit: every one of the 22 providers × 3 tiers on this board, re-verified against the provider's own pricing page, in one sweep, by Claude Fable 5. The score: 40 of 66 tiers checked out exactly. The other 26 are why this entry exists — and in the spirit of this site, we're publishing the misses, not just the fixes.

The worst find is a billing trap, not a typo. Our xAI lite tier still showed Grok 4.1 Fast at $0.20/$0.50. That model was retired on 15 May — but the old API slugs don't error. They silently redirect to Grok 4.3 and bill at $1.25/$2.50, roughly 6× the price you thought you were paying. If you had an agent pointed at that slug, your invoice already knows. The slot now shows Grok Build 0.1 ($1/$2) — xAI no longer sells a sub-dollar text model at all.

Two models on our board never existed. "Kimi K2.6 Turbo" and "MiniMax M2.7 Pro" appear in no official price list — they were plausible-sounding names that slipped in during fast-moving update nights and survived because their prices looked reasonable. A third, Llama 4 Behemoth, was announced in 2025 and never publicly shipped — we were quoting real prices for a ghost. All three are gone: the Kimi slot now tracks K2.7 Code HighSpeed ($1.90/$8), MiniMax gets its actual new flagship M3 ($0.30/$1.20 — flagship capability at lite money, the cheapest flagship tier on the board), and Meta's third slot falls back to Maverick, the biggest Llama you can actually buy.

Then there's the quiet Llama exodus. Eight of our 66 tiers pointed at Llama models their hosts no longer serve: Fireworks has dropped every serverless Llama, Cerebras retired both of its (one the day after our last hand-check), SambaNova's 405B has been gone for over a year, Together delisted the 3B and 405B, DeepInfra dropped the 405B, and Groq's DeepSeek R1 distill was shut down last October. Groq's two remaining Llamas die 16 August — diarised. The GPT-OSS models have quietly become the standard budget tier across the fast-inference hosts, and the board now reflects that. Mistral, meanwhile, had moved a full generation (Small 4, Medium 3.5, Large 3) — and yes, Mistral Large 3 now costs less than Mistral Medium 3.5. Their own FAQ still contradicts their own API table; we've flagged it and gone with the table. Alibaba's whole tracked lineup had been reclassified "legacy, not recommended" by Alibaba itself — replaced with the 3.6/3.7 generation. And Zhipu's "GLM-5-Air" was our mislabel: the $0.20/$1.10 model is GLM-4.5-Air. Right price, wrong name, ours.

The honest bit about our own machinery. The nightly price check did its job — it verified prices. What it couldn't see is that some of those prices belonged to models that had been retired, delisted, or never existed: it was faithfully price-checking ghosts. Three providers' "verified" stamps had quietly frozen at 26 May while the footer said "verified nightly." That's drift, it was ours, and it's the exact failure mode this site exists to catch in others. Fable found it, Fable fixed it, and model-existence checks are joining the nightly routine so the graveyard tends itself. Speaking of which: today's departures — the dead Llamas, the phantom Turbo and Pro, the never-born Behemoth — are all being laid to rest with dates and final prices in the Model Graveyard →

What survived the audit untouched: Google, OpenAI, Anthropic, DeepSeek, Cohere, Perplexity, OpenRouter and Sakana — every tier exact. AWS Bedrock remains the one provider we could not re-verify from source (their pricing page won't render without JavaScript); it's consistent with Anthropic's own global-endpoint pricing, and it keeps its 23 June stamp until we can read it from AWS itself. That's the whole ledger. Next full audit: when the data earns one — the nightly watch continues in the meantime, every night, at Price Moves →

30 June 2026 New Price Change

Claude Sonnet 5 takes the mid tier — same sticker price, a heavier tokenizer

Anthropic launched Claude Sonnet 5 on 30 June. It steps into TokenScale's Anthropic Mid slot in place of Sonnet 4.6, at the same standard list price — $3/M in, $15/M out — and Anthropic says it closes much of the gap to Opus. Sonnet 4.6 isn't deleted: it moves to the "all models" view, still listed and still priced, the way we kept Opus alongside Fable rather than rewriting the record.

Two things worth knowing before you point an agent at it. First, a launch discount: through 31 August, Sonnet 5 runs at $2/M in, $10/M out, then reverts to $3/$15. We track the standing list price as canonical, so the headline number stays $3/$15 and the discount is a temporary window, not history. Second — and very much our beat — Sonnet 5 ships a new tokenizer that turns the same text into roughly 30% more tokens. The per-token price is unchanged, but the same email or novel can cost about a third more to run. That's exactly the kind of hidden shift a sticker price hides and a content-size lens makes visible.

26 June 2026 New Preview

GPT-5.6 lands in preview — Sol, Terra and Luna, gated to a handful of partners

OpenAI unveiled the GPT-5.6 family on 26 June — three models named Sol, Terra and Luna, a new naming scheme that replaces the old mini/nano tiers with capability tiers inside one generation. Access is the story in itself: it's a limited preview reachable only through an OpenAI account rep, after OpenAI shared the models and its release plan with the US government first. General availability is promised "in the coming weeks."

Verified list pricing, per million tokens: Luna $1/$6, Terra $2.50/$15, Sol $5/$30 — Sol matches GPT-5.5's flagship rate, and Terra lands exactly on today's GPT-5.4 mid-tier. Because you can't yet buy these off the shelf, TokenScale keeps GPT-5.5 and GPT-5.4 as the live Lite/Mid/Pro tiers and lists Sol, Terra and Luna in OpenAI's "all models" view, marked preview. We'll promote them to the headline slots the moment they're generally available — the same way we handled Claude Fable 5's brief appearance rather than rewriting the board around a model most people can't reach.

21 June 2026 Update

We turned on anonymous engagement measurement — here's exactly what it does

TokenScale now records a small set of anonymous, cookieless engagement events — things like "the page was scrolled past the pricing", "the calculator was used", or "a provider was viewed" — so we can see which parts actually help and build the right things next. In the spirit of the site, we're spelling out precisely what this is and isn't.

What we don't do: no cookies, no IP address stored, no fingerprinting, and the text you paste into the calculator is never sent — it stays in your browser, exactly as before. To follow a single visit's flow we tag its events with a random ID that lives only in your browser's memory, is recreated on every page load, and vanishes when you close the tab; it can't be linked across visits or back to you. We also honour "Do Not Track" and Global Privacy Control — switch either on and we collect nothing at all.

The full detail is on the privacy page →

16 June 2026 New Price Change

GLM-5.2 takes a flagship tier for open-weight money

Zhipu launched GLM-5.2 on 16 June — the first MIT-licensed, 1M-context model to hold a flagship tier. TokenScale's Zhipu GLM flagship slot moves to $1.40/M in, $4.40/M out, against GPT-5.5's $5/$30 at the same context. The +43% over GLM-5.1 is a generational step up, not a repricing.

+43%Flagship in & out · $0.98→$1.40 / $3.08→$4.40

$1.40/Minput · vs GPT-5.5 $5/M

MITlicensed · 1M context

13 June 2026 Price Change

A five-tier reshuffle in one night

13 June re-priced five tiers at once, in both directions. Cerebras jumped +299% while AWS Bedrock cut hosted Llama 3 405B output from $16 to $2.40 — the night's biggest drop, the opposite direction on the same run. Mistral, Qwen and an OpenRouter route moved too. A clean illustration of how fast the floor shifts.

5tiers re-priced in one night

+299%Cerebras

−85%AWS Llama 3 405B out · $16→$2.40

12 June 2026 Correction

Correction: Anthropic's Pro tier is back to Opus 4.8 ($5/$25)

The 10 June note below recorded Anthropic's Pro slot moving to Fable 5 at $10/$50. That didn't hold. Between 10–12 June the Pro tier round-tripped — $5→$10→$5 in, $25→$50→$25 out — settling back at Opus 4.8 ($5/$25). TokenScale tracks Opus 4.8 as Anthropic's flagship Pro tier; Fable 5 lives in the all-models view, not the headline slot. We're leaving the 10 June entry up as a record of the round-trip rather than rewriting it.

round-trip$5→$10→$5 in · $25→$50→$25 out

$5/$25Opus 4.8 · Pro tier now

11 June 2026 Price Change

DeepSeek V4 Pro: another −75% cut

DeepSeek cut its Mid/Pro tier (V4 Pro) by 75% on 11 June — $1.74→$0.435 in, $3.48→$0.87 out. The relentless undercutting that's kept the Novel Index floor near half a cent.

−75%Mid/Pro · V4 Pro

$0.435/Minput · was $1.74

$0.87/Moutput · was $3.48

10 June 2026 New Price Change

Claude Fable 5 lands: Anthropic's Pro slot just doubled

Anthropic launched Claude Fable 5 on 9 June, its first generally available Mythos-class model: a tier above Opus. TokenScale's Anthropic Pro slot now maps to Fable 5 (high) at $10/M in, $50/M out. That makes it the most expensive model on the board, at exactly 2× Opus 4.8. Reading The Hobbit now costs $1.27 on Anthropic's top tier, up from $0.63.

+100%Pro tier in & out · $5→$10 / $25→$50

$1.27The Hobbit, input · was $0.63

$50/Moutput · priciest rate tracked

Opus 4.8 isn't gone: it stays in the all-models view alongside Haiku, Sonnet and Fable. Batch pricing halves Fable's rates to $5/$25, and cached reads drop input to $1/M. Worth knowing before you point an agent at a novel.

2 June 2026 New

Five new labs — TokenScale now tracks 21 providers

DeepSeek used to be the only Chinese lab on the board. That stopped making sense. Added four open-weight frontier labs — Alibaba Qwen, Zhipu GLM, Moonshot Kimi and MiniMax — plus Meta's own first-party Llama API, so you no longer have to price Llama through a reseller.

21providers tracked · was 16

63price points verified nightly

5new labs added

Four of the five are open-weight and price like it — often an order of magnitude below a US frontier flagship. A handful of top-tier prices that aren't public yet are careful estimates, marked for correction. The full story is in the journal →

31 May 2026 New

Price Moves — every change, now shareable

The month's biggest swings now live on their own page — and each move exports as a branded card you can drop straight into a thread. A percentage tells a developer something; a whole novel going from $1.91 to $0.32 tells everyone.

−83%Grok mid · biggest cut

+275%OpenAI Pro · biggest rise

4¢a whole novel on DeepSeek V4

Each card is drawn on your own device — no server, no tracking, pure static HTML and a little canvas. See all seven moves →

20 May 2026 Price Change

Gemini Flash 3.5 quietly got 5× more expensive

Google updated pricing on their mid-tier Gemini model. Silver (Flash-Lite) stayed the same. Gold (Flash 3.5) did not.

+400%Flash 3.5 input

+260%Flash 3.5 output

—Flash-Lite (unchanged)

Rate changes per million tokens

Model	Input before	Input after	Change
Flash-Lite 2.5 · Silver	$0.10	$0.10	—
Flash 3.5 · Gold	$0.30	$1.50	+400% ↑

🧙 The Hobbit — 95,356 words — what it now costs to process

Model	Before	After	Difference
Flash-Lite 2.5 · Silver	$0.06	$0.06	—
Flash 3.5 · Gold	$0.35	$1.33	+$0.98 · +380% ↑
Gap between tiers	$0.29	$1.27	4.4× wider

The irony

TokenScale launched saying "The Hobbit = $0.06 on Gemini Flash-Lite." That's still true. But the next model up — Flash 3.5 — had its Hobbit cost jump from $0.35 to $1.33 almost immediately after launch. The gap between Silver and Gold went from $0.29 to $1.27 overnight. If you were building on Flash 3.5, you just got a 380% bill increase with no warning. This is exactly what TokenScale exists to catch.

What this means practically

If you're sending novel-length context (100K tokens) to Gemini, Flash-Lite is still the smart choice — unchanged, and still the cheapest full-context model in the comparison. Flash 3.5 now costs 22× more for the same input. The tier gap matters more than ever.

20 May 2026 v2.2 · Launch Day

We caught a pricing error on our own launch morning

Hours before posting to Hacker News, we found that our hero number was quoting input cost only. Here's what we fixed — and why it made for a better story.

The correction

What we said	What it actually was	Corrected to
The Hobbit on Gemini Flash	$0.04 (input only)	$0.06 (total)
Input cost	$0.01	$0.01
Output cost	missing	$0.05
Correct total	$0.04 ✗	$0.06 ✓

The lesson

The very thing TokenScale is built to prevent — quoting only input cost and missing output — was in our own marketing copy. Caught and corrected before the HN post went live. The distinction between input and output pricing is more important than it looks. Output tokens are usually 3–5× more expensive per token, and most real conversations generate far more output than people expect.

Fixed Silver tier pill visibility in dark modes (charcoal / midnight / pitch)
Fixed Silver tier hero slider right-track opacity
Added Hobbit word count verification: ✓ 95,356 words · Project Gutenberg
Added llms.txt for AI model discoverability
Added robots.txt explicitly welcoming all major AI crawlers
Updated Product Hunt description + gallery before launch traffic arrived

19 May 2026 v2.1 · Pre-Launch Fix

Quiz button crash — caught one day before launch

The "Which model should I use?" quiz was silently failing on first run. A missing DOM element meant the result screen crashed before anyone could see a recommendation.

Fixed hmcProviderLink — element referenced in JS but missing from HTML
Added provider name to CTA: now reads "See Mistral on TokenScale →"
Fixed blank-screen flash when quiz recommends Anthropic (the default provider)
Added 500ms fallback in goToDashboard() so cleanup always runs

Why this matters

The quiz is the main personalisation hook — it routes new visitors to the right provider before they see pricing. A silent crash on the most common recommendation (Anthropic) would have cost us conversions on HN launch day without us ever knowing.

18 May 2026 v2.0 · Launch

TokenScale ships — 16 providers, one page

The first public version. A single HTML file, no backend, no sign-up. Pricing for 16 AI providers expressed in content you recognise.

Pricing spread at launch — mid-tier input cost per million tokens

Provider	Model	$/M input	Hobbit cost
DeepSeek	V4 Flash	$0.14	$0.02
Groq	Llama 3.1 8B	$0.05	$0.01
Gemini	Flash-Lite 2.5	$0.10	$0.06
Mistral	Small	$0.10	$0.01
Anthropic	Claude Haiku 4.5	$1.00	$0.13
OpenAI	GPT-5.4	$2.50	$0.32
OpenAI	GPT-5.5	$5.00	$0.63
Spread cheapest → most expensive		35× apart

The insight that launched this

"$5 per million tokens" tells you nothing. "The Hobbit costs $0.06 on Gemini Flash-Lite, and $0.63 on GPT-5.5" tells you everything. TokenScale was built to make that translation automatic — for any content size, across all 22 providers, verified nightly.