Anthropic · Claude · Mythos · Decrypt
The company launched Claude Fable 5 this week to immediate backlash over a safeguard buried in its 319-page system card
Compiled by KHAO Editorial — aggregated from 2 sources. See llms.txt for citation guidance.
✓ KHAO Verified
Decrypt is rolling out changes to make Fable 5’s safeguards for frontier LLM development visible.
Key facts
- Fable 5 remains free on Pro, Max, Team, and Enterprise plans until June 22, after which it shifts to API usage credits only
- Claude Fable 5 already had visible safeguards for cybersecurity and biology research—if you asked something that tripped those filters, you'd get a notification that your request was being rerouted
- Starting this week, flagged requests will visibly route to Claude Opus 4.8, a less capable model, instead of silently delivering degraded Fable output
- Anthropic spent about 48 hours as the AI industry's villain of the week before blinking
Summary
Anthropic admitted its invisible LLM-development safeguards were "the wrong tradeoff" and will replace them with visible fallbacks to Claude Opus 4.8, starting this week. Flagged requests on the API will now return a reason for their refusal, rather than silently delivering a degraded answer. Anthropic spent about 48 hours as the AI industry's villain of the week before blinking. The company launched Claude Fable 5 this week to immediate backlash over a safeguard buried in its 319-page system card: The model, the first of the company’s new Mythos class, would secretly degrade its own responses for users it suspected were building competing AI models—no warning, no fallback message, quietly worse output.