Seismic Impact (30%)
8.0/10
How newsworthy is this in AI?
Ecosystem Relevance (70%)
9.0/10
How useful for your apps?
Claude: Speed up responses with fast mode
/fast in Claude Code... but at a cost that's 6x the normal price.
Opus is usually $5/million input and $25/million output. The new fast mode is $30/million input and $150/million output!
There's a 50% discount until the end of February 16th, so only a 3x multiple (!) before then.
How much faster is it? The linked documentation doesn't say, but on Twitter Claude say:
Our teams have been building with a 2.5x-faster version of Claude Opus 4.6.
We’re now making it available as an early experiment via Claude Code and our API.
Claude Opus 4.5 had a context limit of 200,000 tokens. 4.6 has an option to increase that to 1,000,000 at 2x the input price ($10/m) and 1.5x the output price ($37.50/m) once your input exceeds 200,000 tokens. These multiples hold for fast mode too, so after Feb 16th you'll be able to pay a hefty $60/m input and $225/m output for Anthropic's fastest best model.
Tags: ai, generative-ai, llms, anthropic, claude, llm-pricing, claude-code
The Claude fast mode could significantly optimize the orchestrator's agent interactions, especially for resource-intensive tasks like code generation, test automation, and procedural game logic generation in apps like territory_game or soccer_elo prediction models. By leveraging the 2.5x speed increase and expanded context window, Zac's ecosystem could implement more complex, real-time AI-driven workflows with lower latency and higher token capacity, potentially reducing overall computational overhead and response times across the suite of Rails applications. Rationale: