Anthropic · Claude · Mythos · U.S. · White House · Fortune Technology

Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers

Wed, Jun 10 · 5:43 PM UTC 2 min read

Compiled by KHAO Editorial — aggregated from 3 sources. See llms.txt for citation guidance.

✓ KHAO Verified

When Anthropic made its first Mythos-tier model available to the general public yesterday, called Claude Fable 5, Fortune reported it was a “considerable step” for the lab, coming over a week after the company confidentially filed for IPO paperwork.

Key facts

When Anthropic made its first Mythos-tier model available to the general public yesterday, called Claude Fable 5, Fortune reported it was a “considerable step” for the lab, coming over a week
Behnam Neyshabur, who previously co-led Anthropic’s effort to develop an AI scientist, posted on X saying: “Working on AI for cancer
Anthropic estimated the restrictions would affect roughly 0.03% of traffic
And Jeremy Howard, head of nonprofit research group Fast AI, wrote that “Anthropic has chosen the opposite of the safe path: they are allowing themselves, the current top lab, to use their top model

Summary

Hours after the model’s release, however, major backlash from AI researchers, developers, and policy experts began brewing on social media. In practice, that means a user could ask Fable for help, receive a deliberately weakened answer, but not know the model was holding anything back. Unlike Fable’s other restrictions, such as around cybersecurity and biology, which openly redirect users to a less powerful model with a visible notification, the system card emphasized that this is “not visible to the user.” The model still responds, but uses “interventions to limit Claude’s effectiveness” without telling the user it’s doing so. Anthropic estimated the restrictions would affect roughly 0.03% of traffic.

#Anthropic #Claude #Mythos #U.S. #White House

Full coverage

All sources for this story are listed below — KHAO's direct ingest only; no additional coverage was discovered via Google News.

Tier 1 — direct ingest

Fortune Technology Jun 10 · 17:43 UTC

TechCrunch AI Jun 10 · 15:41 UTC

Anthropic released its latest model Fable on Tuesday, billing it as a public and limited version of its powerful and much-hyped cybersecurity model Mythos.

Cointelegraph Jun 10 · 4:04 UTC

Venture capitalist Simon Dedic said Anthropic’s latest AI models drop the cost and skill needed to find crypto exploits to “zero.”