Anthropic · OpenAI · Google · Mythos · GPT · AI Safety Institute · The Register
AI models are getting better at replacing cybersecurity pros on certain tasks
Compiled by KHAO Editorial — aggregated from 1 source. See llms.txt for citation guidance.
◌ Single Source
UK researchers find LLMs are learning to finish jobs faster and improving all the time.
Key facts
- In February 2026, AISI internally reduced the expected task time doubling period from 8 to 4.7 months, based on progress made since late 2024
- In February 2026, they estimated that frontier models' 80 percent-reliability cyber time horizon had doubled every 4.7 months since reasoning models emerged in late 2024, given a 2.5M token limit
- Their results imply a consistent doubling time of 4.2 months on software tasks since late 2024," AISI said, noting that with the latest Mythos Preview checkpoint (model update), it's closer to 4
- This was around half their November 2025 doubling time estimate, which was 8 months for both 50 percent and 80 percent reliability
Summary
The UK AI Security Institute (AISI) has found that frontier models are quickly becoming more efficient when asked to do some cybersecurity work. AISI measures this with its "time window benchmark for cybersecurity," which estimates how much work an AI can do compared to a human. AISI has found the human-comparable task time, 16 minutes in this instance, is growing, fast. In February 2026, AISI internally reduced the expected task time doubling period from 8 to 4.7 months, based on progress made since late 2024.