The company launched Claude Fable 5 this week to immediate backlash over a safeguard buried in its 319-page system card

Thu, Jun 11 · 6:55 PM UTC 2 min read

Compiled by KHAO Editorial — aggregated from 2 sources. See llms.txt for citation guidance.

✓ KHAO Verified

Decrypt is rolling out changes to make Fable 5’s safeguards for frontier LLM development visible.

Key facts

Fable 5 remains free on Pro, Max, Team, and Enterprise plans until June 22, after which it shifts to API usage credits only
Claude Fable 5 already had visible safeguards for cybersecurity and biology research—if you asked something that tripped those filters, you'd get a notification that your request was being rerouted
Starting this week, flagged requests will visibly route to Claude Opus 4.8, a less capable model, instead of silently delivering degraded Fable output
Anthropic spent about 48 hours as the AI industry's villain of the week before blinking

Summary

Anthropic admitted its invisible LLM-development safeguards were "the wrong tradeoff" and will replace them with visible fallbacks to Claude Opus 4.8, starting this week. Flagged requests on the API will now return a reason for their refusal, rather than silently delivering a degraded answer. Anthropic spent about 48 hours as the AI industry's villain of the week before blinking. The company launched Claude Fable 5 this week to immediate backlash over a safeguard buried in its 319-page system card: The model, the first of the company’s new Mythos class, would secretly degrade its own responses for users it suspected were building competing AI models—no warning, no fallback message, quietly worse output.

#Anthropic #Claude #Mythos