Anthropic · Claude · Mythos · Ars Technica

Anthropic confirms these topics are too dangerous to let its Fable 5 model talk

Tue, Jun 9 · 7:20 PM UTC 2 min read

Compiled by KHAO Editorial — aggregated from 7 sources. See llms.txt for citation guidance.

✓ KHAO Verified

Among the many claimed benchmark improvements for Fable 5, the one related to cybersecurity was a particularly large jump. Credit: Anthropic.

Anthropic Tuesday publicly released Claude Fable 5, its first “Mythos-class” model that it says surpasses its previous frontier Opus models in overall capabilities.

Key facts

API and Enterprise users will be able to access the Fable 5 model at a cost of $10-per-million input tokens and $50-per-million output tokens starting today
Anthropic says Fable 5 operates on the “same underlying model” as Mythos 5, which is coming out of its monthslong “Mythos Preview” period today, but only for “a small group of cyberdefenders” judged
Anthropic Tuesday publicly released Claude Fable 5, its first “Mythos-class” model that it says surpasses its previous frontier Opus models in overall capabilities
In over 1,000 hours of red-team testing with a bug bounty program, Anthropic says external teams failed to find any universal jailbreaks for Fable 5

Summary

Anthropic says Fable 5 operates on the “same underlying model” as Mythos 5, which is coming out of its monthslong “Mythos Preview” period today, but only for “a small group of cyberdefenders” judged trustworthy through the existing Project Glasswing. Anthropic said it has tuned these safeguards to be “stricter than ideal,” meaning the system may occasionally refuse “harmless requests” in a way that it acknowledges may be frustrating for regular users. Fable 5’s topic-based safeguards are built around a system of classifiers designed to broadly detect banned prompt subjects as well as any potential jailbreak attempts. The company said it is particularly worried about Mythos 5’s ability to perform “agentic hacking,” executing multi-part cyberattacks with much more facility than earlier models.

#Anthropic #Claude #Mythos

Full coverage

All sources for this story are listed below — KHAO's direct ingest only; no additional coverage was discovered via Google News.

Tier 1 — direct ingest

Ars Technica Jun 9 · 19:20 UTC

Anthropic Tuesday publicly released Claude Fable 5, its first “Mythos-class” model that it says surpasses its previous frontier Opus models in overall capabilities.

Decrypt Jun 9 · 18:38 UTC

At the same time, critics, including OpenAI CEO Sam Altman, have accused the company of using “ fear-based marketing ” to sell its AI models.

TechCrunch AI Jun 9 · 17:00 UTC

Anthropic is bringing its most powerful AI model to the general public for the first time, but it’s doing it with guardrails.

TechCrunch AI Jun 9 · 20:37 UTC

Ethan Mollick, a notable AI researcher and University of Pennsylvania scholar, has been playing around with the model and seems to be having a lot of fun.

Engadget Jun 9 · 20:23 UTC

Claude subscribers can try the model until June 22 without spending usage credits.

The Register Jun 9 · 19:32 UTC

Company also changes data retention policy.

CNBC Technology Jun 9 · 18:44 UTC

Two months after Anthropic rolled out Mythos to a limited number of users, citing concerns about the artificial intelligence model's potential to do damage in the wrong hands, the company said it's ready to release an…