Twice the Price, 5.7% More Intelligence

$10 per million input tokens. $50 per million output tokens. That is what Anthropic charges for Claude Fable 5, its most powerful public model — exactly double Opus 4.8 and more than three times Sonnet 4.6.

For that 2× premium, independent benchmarking says you get 5.7% more measured intelligence. And for the productivity gain every launch deck implies — the reason a buyer would pay top-of-lineup rates — the entire public evidence base is a single, unaudited customer story.

This is not a hit piece. Fable 5 is, on the aggregators, the highest-scoring model Anthropic has ever shipped publicly. The problem is the gap between what is documented (the price) and what is demonstrated (the productivity). One is airtight and primary-sourced. The other is a testimonial. If you are signing a 2026 AI budget on the strength of “Fable 5 makes our people X% more productive,” you are extrapolating from data that does not yet exist.

Executive Summary

Metric	Value
Fable 5 list price (input / output per 1M tokens)	$10 / $50
Price vs Opus 4.8 ($5 / $25)	2.0×
Price vs Sonnet 4.6 (~$3 / $15)	~3.3×
Position in active Claude lineup	Most expensive
Blended rate (7:2:1 cache-hit : input : output)	$7.70 / 1M
Intelligence Index vs Opus 4.8	64.9 vs 61.4 · +5.7%
GDPval-AA knowledge-work Elo vs Opus 4.8	1,932 vs 1,890 · +2.2%
Cost of one full Intelligence Index run	~$9,940 vs ~$4,970
Controlled human-baseline productivity studies	0
Human-baseline enterprise proof points	1 (unaudited)
Launch → general availability	2026-06-09 → ~2026-07-01

The economics are fully documented. The productivity story is not. Everything below separates the two — and flags exactly which claims survived source verification and which were killed.

MLOps for Decision Makers: The First Plain-Language Guide to Machine Learning Operations — Why 95% of AI Projects Fail and How to Make Sure Yours Doesn't (The AI Literacy Series Book 4)

As an affiliate, we earn on qualifying purchases.

01The Economics Are Airtight — and Not Flattering

Start with what is beyond dispute. Fable 5’s pricing is confirmed by Anthropic’s own launch post, its platform pricing docs, and the Fable product page — then corroborated by Forbes, TechCrunch, Vellum, Finout, and Artificial Analysis.

Model	Input / 1M	Output / 1M	vs Fable 5
Claude Fable 5	$10	$50	—
Claude Opus 4.8	$5	$25	2.0× cheaper
Claude Sonnet 4.6	~$3	~$15	~3.3× cheaper
(retired) Opus 4 / 4.1	$15	$75	only Claude ever priced higher

Fable 5 is the most expensive model in Anthropic’s active lineup. On sticker, its input price is roughly 2× GPT-5.5’s — though that specific comparison rests on a single secondary source, so treat it as directional, not gospel.

The effective rate is softer than the headline. Fable 5 carries the standard 90% prompt-caching discount (cache reads at $1/1M), a batch tier at $5/$25, and a blended rate of $7.70 per 1M tokens at a realistic 7:2:1 cache-hit-to-input-to-output ratio. US-only inference is available at a 1.1× multiplier ($11/$55) for data-residency needs. The math checks out: (7 × $1.00 + 2 × $10.00 + 1 × $50.00) / 10 = $7.70.

One clarification that trips people up: Fable 5 and Mythos 5 are the same underlying weights, priced identically, differing only in safeguards. Reports that “lump them together” on price are correct to do so. That is not a conflation error — it is the actual product structure.

Everything about the price is knowable to the decimal. That is precisely what makes the missing productivity data so conspicuous.

AI Engineering: Building Applications with Foundation Models

As an affiliate, we earn on qualifying purchases.

02The Price-Performance Verdict

Here is the number that matters most for a budget owner. Artificial Analysis benchmarked Fable 5 against Opus 4.8 — the model at exactly half the price — and the delta is thin.

Benchmark	Fable 5	Opus 4.8	Delta
Intelligence Index	64.9	61.4	+5.7%
GDPval-AA (real-world knowledge-work Elo)	1,932	1,890	+2.2%
Cost of a full Index run	~$9,940	~$4,970	2.0× more

The Decoder’s framing — “twice as much for 5.7 percent more performance” — is not vendor spin. It is derived straight from third-party primary data. As an aggregate price-performance signal, Fable 5 is weak.

But precision cuts both ways, and honesty requires the counterweight. Expressing a 42-point Elo gap as “2.2%” understates it: on a logarithmic rating, a 42-point lead is roughly a 56% head-to-head win rate, and both Anthropic and Artificial Analysis frame the same result as a #1 launch. Fable 5 also sets records in 5 of 10 sub-benchmarks, where the per-task gains run larger than the blended composite. “Marginal on average” is true. “Marginal on your hardest task” may not be.

The right question is not “is Fable 5 better?” — it’s “is your workload one of the five sub-benchmarks where the premium shows up, or one of the other five where it doesn’t?”

AI Systems Performance Engineering: Optimizing Model Training and Inference Workloads with GPUs, CUDA, and PyTorch

As an affiliate, we earn on qualifying purchases.

03The Productivity Evidence Gap

Now the part the launch narrative glosses over. The question was specific: enterprise knowledge-work output gains, measured against a human / no-AI baseline. After five search angles and adversarial verification of 25 claims, here is the complete surviving evidence.

The entire human-baseline case is one testimonial

Anthropic’s launch post states it verbatim: in a 50-million-line Ruby codebase, Fable 5 performed a codebase-wide migration in a day that “would otherwise have taken a whole team over two months by hand,” credited to Stripe. Forbes and Finout repeat it.

That claim survived verification as a claim that was made. What did not survive were the stronger framings of it. Versions calling the Stripe result a “concrete” or “independently established human-work baseline” were voted down 0–3 by the verifiers. It is a launch-post customer signal with no methodology, no audit, and no independent replication — a vendor/customer anecdote, not a measurement.

What does not exist

What you’d want	What the sources actually contain
A controlled knowledge-work productivity study	None for Fable 5.
Independent, audited output gains vs. humans	None. One unaudited anecdote.
Cost-per-task (not per-token) vs. a human	Not established by any surviving source.
Throughput / latency vs. a human baseline	Not established. Only model-vs-model token pricing survived.
Fable 5 in the leading academic AI-productivity review	Absent — not mentioned at all.

The leading benchmark aggregator’s Anthropic page carries only intelligence, pricing, speed, and latency — zero productivity or business-impact metrics. And the most-cited empirical review of AI and labor productivity attributes its headline figures — ~40% faster task completion, 55.8% faster coding — to ChatGPT, GPT-4, and GitHub Copilot, and never once mentions Fable 5 or any Claude model.

The most expensive Claude model ever shipped publicly arrives with rigorous pricing and almost no rigorous productivity evidence. Its headline enterprise proof point is a single un-audited migration story.

Machine Learning: Architecture in the age of Artificial Intelligence

As an affiliate, we earn on qualifying purchases.

04The Skeptic’s Ledger: Claims That Did Not Survive

Data-first means reporting what failed verification as loudly as what passed. These circulate; do not repeat them as fact.

Claim in circulation	Verdict
The Stripe migration is a “concrete / independently established human-work baseline”	Refuted (0–3). Unaudited anecdote.
Fable 5 finished a frontier physics task in 36 hours using one-third of GPT-5.5’s reasoning tokens	Refuted (0–3). No surviving support.
Model trackers show no “Fable 5” / it isn’t tracked	Refuted (0–3). It is tracked.
Finout’s “$250k/year vs $125k/year” cost illustration	Do not quote. ~500× arithmetic error.

That last one deserves a note, because the directional advice around it is sound even though the number is broken: one secondary analysis recommends reserving Fable 5 for complex, multi-step reasoning and using the cheapest model that clears your quality bar (Sonnet 4.6 for high-volume classification, summarization, RAG). Good instinct — just don’t cite the dollar figures attached to it.

05The Procurement Playbook

If you own an AI budget, the verified data supports four concrete moves this quarter.

Do this quarter

Default down, escalate up. Route high-volume, well-defined work (classification, summarization, RAG, extraction) to Sonnet 4.6 or Opus 4.8. Reserve Fable 5 for the narrow band of complex reasoning where the sub-benchmark records appear. Paying $50/1M output for a summarization pipeline is a self-inflicted wound.
Demand cost-per-task, not price-per-token. The sticker price is knowable; your real cost is not — it depends on tokens-per-task and turns-to-completion, neither published. Instrument it before you standardize.
Treat the Stripe number as a hypothesis, not a benchmark. Run your own before/after on one real workflow with a human baseline. Two weeks of that beats the entire public record.
Re-check the numbers before you commit. Fable 5 launched June 9, hit GA around today, and went through a brief suspension and redeployment in between. Pricing and benchmark positions may still be settling.

The Bottom Line

The AI market keeps making the same category error: mistaking a documented capability for a demonstrated outcome. Fable 5 is the cleanest example yet. Its price is engineered to four significant figures. Its productivity impact is a story someone told at a launch.

That does not make Fable 5 a bad model — it makes it an unproven one on the single axis buyers care about most. The 5.7% aggregate gain over a model at half the price is real. So are the sub-benchmark records. So is the total absence of controlled, human-baselined productivity evidence. All three are true at once, and any vendor pitch that mentions only the first two is selling you the price of the ticket as if it were the destination.

Buy Fable 5 for the tasks where the frontier gains show up. Do not buy the productivity narrative until someone measures it.

The core message

The price of Fable 5 is documented to the decimal. Its productivity is documented to a single anecdote. Until that asymmetry closes, “Fable 5 makes us more productive” is a forecast, not a finding — and forecasts don’t belong in a signed budget.

Anthropic priced Fable 5 like a frontier model and proved it like a press release. Make it earn the premium on your data, not theirs.

About the Author

Thorsten Meyer is a Munich-based futurist, post-labor economist, and recipient of OpenAI’s 10 Billion Token Award. He spent two decades managing €1B+ portfolios in enterprise ICT before deciding that writing about the transition was more useful than managing quarterly slides through it. He has read more model pricing pages than any well-adjusted person should. More at ThorstenMeyerAI.com.

Sources

Anthropic, Introducing Claude Fable 5 and Claude Mythos 5 (2026-06-09) — anthropic.com/news/claude-fable-5-mythos-5
Anthropic, Claude Fable 5 — product page — anthropic.com/claude/fable
Artificial Analysis, Claude Fable 5 — model page — artificialanalysis.ai/models/claude-fable-5
Artificial Analysis, Anthropic — provider page — artificialanalysis.ai/providers/anthropic
The Decoder, Claude Fable 5 costs twice as much for 5.7 percent more performance — the-decoder.com
Forbes (Ron Schmelzer), Anthropic’s Fable 5 AI Model Cost (2026-06-10) — forbes.com
TechCrunch, Anthropic released Claude Fable 5, its most powerful model publicly (2026-06-09) — techcrunch.com
Vellum, Claude Fable 5 and Mythos 5 benchmarks explained — vellum.ai
Finout, Claude Fable 5 / Mythos 5 pricing & benchmarks — finout.io
Latent.Space, Anthropic Claude Fable 5 & Mythos — latent.space
Law & Economics Center, AI, Productivity, and Labor Markets: A Review of the Empirical Evidence — laweconcenter.org

Research method: 5-angle web sweep, 17 sources fetched, 77 claims extracted, 25 verified under 3-vote adversarial checking (20 confirmed, 5 refuted). Claims that failed verification are reported in Section 4 rather than suppressed.

Twice the Price, 5.7% More Intelligence

Up next

The Delegation Ladder: The Four Agentic Loops, and What Each One Lets You Stop Doing

Author

Thorsten Meyer

Share article

Twice the Price, 5.7% More Intelligence

Executive Summary

MLOps for Decision Makers: The First Plain-Language Guide to Machine Learning Operations — Why 95% of AI Projects Fail and How to Make Sure Yours Doesn't (The AI Literacy Series Book 4)

01The Economics Are Airtight — and Not Flattering

AI Engineering: Building Applications with Foundation Models

02The Price-Performance Verdict

AI Systems Performance Engineering: Optimizing Model Training and Inference Workloads with GPUs, CUDA, and PyTorch

03The Productivity Evidence Gap

The entire human-baseline case is one testimonial

What does not exist

Machine Learning: Architecture in the age of Artificial Intelligence

04The Skeptic’s Ledger: Claims That Did Not Survive

05The Procurement Playbook

The Bottom Line

About the Author

Sources

IdeaClyst: The Validation Council

Terafab: Deep Analysis

OpenAI–Broadcom AI Chip Partnership: Context and Implications

Strategic Risk Assessment: The Transition to Comprehension Lock-In and Agentic Context Platforms

The Delegation Ladder: The Four Agentic Loops, and What Each One Lets You Stop Doing

A Skill Is a Folder, Not a Prompt: What Anthropic Learned Running Hundreds of Them

Kill-Switch-Proof: How to Build So Washington Can’t Take Your AI Stack Down

Fable 5 Is Back. GPT-5.6 Is Next. And Anthropic Reportedly Already Has Something Stronger.

Twice the Price, 5.7% More Intelligence

Up next

Author

Thorsten Meyer

Share article

Executive Summary

MLOps for Decision Makers: The First Plain-Language Guide to Machine Learning Operations — Why 95% of AI Projects Fail and How to Make Sure Yours Doesn't (The AI Literacy Series Book 4)

01The Economics Are Airtight — and Not Flattering

AI Engineering: Building Applications with Foundation Models

02The Price-Performance Verdict

AI Systems Performance Engineering: Optimizing Model Training and Inference Workloads with GPUs, CUDA, and PyTorch

03The Productivity Evidence Gap

The entire human-baseline case is one testimonial

What does not exist

Machine Learning: Architecture in the age of Artificial Intelligence

04The Skeptic’s Ledger: Claims That Did Not Survive

05The Procurement Playbook

The Bottom Line

About the Author

Sources

You May Also Like