Anthropic · Claude · Mythos · U.S. · White House · Fortune Technology
Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers
Compiled by KHAO Editorial — aggregated from 3 sources. See llms.txt for citation guidance.
✓ KHAO Verified
When Anthropic made its first Mythos-tier model available to the general public yesterday, called Claude Fable 5, Fortune reported it was a “considerable step” for the lab, coming over a week after the company confidentially filed for IPO paperwork.
Key facts
- When Anthropic made its first Mythos-tier model available to the general public yesterday, called Claude Fable 5, Fortune reported it was a “considerable step” for the lab, coming over a week
- Behnam Neyshabur, who previously co-led Anthropic’s effort to develop an AI scientist, posted on X saying: “Working on AI for cancer
- Anthropic estimated the restrictions would affect roughly 0.03% of traffic
- And Jeremy Howard, head of nonprofit research group Fast AI, wrote that “Anthropic has chosen the opposite of the safe path: they are allowing themselves, the current top lab, to use their top model
Summary
Hours after the model’s release, however, major backlash from AI researchers, developers, and policy experts began brewing on social media. In practice, that means a user could ask Fable for help, receive a deliberately weakened answer, but not know the model was holding anything back. Unlike Fable’s other restrictions, such as around cybersecurity and biology, which openly redirect users to a less powerful model with a visible notification, the system card emphasized that this is “not visible to the user.” The model still responds, but uses “interventions to limit Claude’s effectiveness” without telling the user it’s doing so. Anthropic estimated the restrictions would affect roughly 0.03% of traffic.