Business · Fortune Technology
For most workplace tasks, AI is good enough to pass but not good enough to impress, MIT finds
Compiled by KHAO Editorial — aggregated from 1 outlet. See llms.txt for citation guidance.
◌ Single Source
The growing share of American office workers who have experimented with artificial intelligence in their day-to-day work have likely had a few moments of doubt as to their long-term job stability.
Key facts
- As of late 2025, AI models scored a 7 in roughly 65% of tasks
- By 2029, the authors estimate, most AI models will be able to accomplish between 80% and 95% of text-based tasks at the minimally sufficient benchmark
- MIT researchers used 41 different LLMs—including versions of Claude, Gemini, and ChatGPT—to analyze performance in more than 11,000 primarily text-based tasks for various job roles listed
- Regardless of how much time an AI model had to complete a task, the probability of success when graded against a 9 or “superior” quality score never exceeded 50%
Summary
But for all the improvements in AI over the past few years, the technology is still only able to hit low bars in specific workplace tasks, according to recent data published by MIT. Workers concerned they might soon be replaced by AI will likely be reassured by new research coming out of MIT, which frames the AI-driven jobs takeover narrative not so much as a fast-paced action movie, but more like a slow-burn think piece. AI is gradually improving at accomplishing a variety of tasks across several professions, according to a study of preliminary findings released on Thursday. MIT researchers used 41 different LLMs—including versions of Claude, Gemini, and ChatGPT—to analyze performance in more than 11,000 primarily text-based tasks for various job roles listed by the Labor Department.