← Back to KHAO

Claude Code · Anthropic · Claude ·

Given this, Claude Code launched with the simplest possible defense: allow reads, require approval for write, bash

2 min read

Compiled by KHAO Editorial — aggregated from 1 source. See llms.txt for citation guidance.

◌ Single Source

When bounds can be placed on the relative damage of an autonomous agent—such as through control over its environment—high-utility capabilities can motivate deployment. Claude Mythos Preview is an example of a model whose blast radius was deemed too high to ship in April 2026. However, we expect broa.

However, as mentioned, approval fatigue showed up within weeks.

Key facts

Summary

Twelve months ago, they'd have rejected out of hand the idea of granting Claude access sufficient to take down an internal Anthropic service. Today that level of access is routine, and Anthropic developers are more productive for it. The first is to supervise the agent’s behavior via a human-in-the-loop. Claude Code previously protected against agents taking unintended actions by asking users for permission at each turn. The second approach to capping the blast radius—and the focus of much of this post—is containment. Over the past two years, they've shipped three primary agentic products: claude.ai, Claude Code, and Claude Cowork. User misuse: A user—either maliciously or through carelessness—directs the agent to do something harmful.

Read full article at anthropic.com →

#Claude Code #Anthropic #Claude