
The findings came as enterprises accelerated adoption of general-purpose AI systems and AI agents, often relying on benchmark results, vendor documentation, and limited pilot deployments to assess risk before wider rollout.
Capabilities improved rapidly, but unevenly
Since the previous edition of the report was published in January 2025, general-purpose AI capabilities continued to improve, particularly in mathematics, coding, and autonomous operation, the report said.
Under structured testing conditions, leading AI systems achieved “gold-medal performance on International Mathematical Olympiad questions.” In software development, AI agents became capable of completing tasks that would have taken a human programmer about 30 minutes, compared with under 10 minutes a year earlier.
