Prompt injection · Anthropic · OpenAI · Claude · Google · ChatGPT · Decrypt

What Is an AI Prompt Injection Attack

Sat, May 30 · 1:01 PM UTC 2 min read

Compiled by KHAO Editorial — aggregated from 1 source + 3 references discovered via search. See llms.txt for citation guidance.

★ Tier-1 Source

Forward this thread to.

Key facts

The term was coined on September 12, 2022, by British developer Simon Willison in a now-famous blog post
They scanned 2 to 3 billion crawled web pages per month and found a 32% jump in malicious indirect prompt injections between November 2025 and February 2026
Anthropic claims a Chinese group it designated GTG-1002 had used Claude Code, jailbroken via prompt injection, to attempt intrusions against roughly 30 targets including tech companies, financial
Anthropic estimates the AI executed 80% to 90% of the operation autonomously, making thousands of requests per second

Summary

The attack works by tricking a chatbot into following an attacker's instructions instead of yours. OpenAI publicly admitted in December 2025 that the problem is “unlikely to ever be fully solved,” and the U.K.'s National Cyber Security Centre issued a formal warning that LLMs are 'inherently confusable deputies.'. Imagine you ask your AI assistant to summarize an email. You never see the instructions. The Open Worldwide Application Security Project, the cybersecurity nonprofit behind the industry-standard vulnerability rankings, places prompt injection at number one on its top 10 list of threats for AI applications.

Read full article at Decrypt →

#Prompt injection #Anthropic #OpenAI #Claude #Google #ChatGPT

Full coverage

Other sources that covered this story, discovered via Google News. 3 unverified additional sources beyond our direct ingest — use these to verify claims, compare framings, or quote specific publications.

Tier 1 — direct ingest

Decrypt May 30 · 13:01 UTC

Forward this thread to." The AI does it. Imagine you ask your AI assistant to summarize an email.

Tier 3 — covered but not verified (3)

Geeky-gadgets.com May 29 · 16:48 UTC

Apple has introduced a new architecture aimed at addressing a long-standing challenge in AI systems that execute autonomous actions.

Bleepingcomputer.com May 29 · 16:48 UTC

02:21 PM - 1 Threat actors are abusing ChatGPT's content-sharing feature to display fake OpenAI outage pages that direct users to download malware disguised as the ChatGPT desktop application.

Thehackernews.com May 29 · 16:48 UTC

Cybersecurity researchers have disclosed details of a vulnerability in OpenAI ChatGPT that leverages the artificial intelligence (AI) assistant's implicit trust in Markdown links and images to trigger prompt injections…