OpenAI's o3 Model Achieve Human-Level Performance on Key Intelligence Test
- OpenAI's O3 model scored 85% on the ARC-AGI benchmark, achieving human-level results on a test for general intelligence.
- The previous best score by an AI was 55%, making O3's performance significantly higher.
- Many AI researchers now believe that artificial general intelligence is closer than previously thought.
- The potential for AGI could lead to revolutionary economic impacts and accelerated intelligence.
24 Articles
24 Articles
An AI Just Reached Human Level On 'General Intelligence'. What That Means
A new artificial intelligence (AI) model has just achieved human-level results on a test designed to measure “general intelligence”. On December 20, OpenAI's o3 system scored 85% on the ARC-AGI benchmark, well above the previous AI best score of 55% and on par with the average human score. It also scored well on a very difficult mathematics test. Creating artificial general intelligence, or AGI, is the stated goal of all the major AI research la…
Superintelligence: Scientists aim to create AI that matches human thinking abilities
Artificial intelligence (AI) companies are aiming to give machines human-level intelligence. “It’s possible that in a few thousand days we’ll have superintelligence; it may take a little longer, but I’m confident we’ll get there,” Sam Altman, CEO of San Francisco-based technology company OpenAI, wrote in Nature on September 23.

OpenAI’s o3 shows remarkable progress on ARC-AGI, sparking debate on AI reasoning
o3 solved one of the most difficult AI challenges, scoring 75.7% on the ARC-AGI benchmark. But does it really mean we're closer to AGI?
Coverage Details
Bias Distribution
- 60% of the sources are Center
To view factuality data please Upgrade to Premium
Ownership
To view ownership data please Upgrade to Vantage