-1.3 C
Iceland

Not Just Regurgitation: AI Shows Real Reasoning in Prediction Contest

Date:

An AI system has delivered a powerful rebuttal to the claim that large language models (LLMs) just “regurgitate” their training data. By placing in the top ten of a fiercely competitive international forecasting contest, ManticAI’s system has shown it can perform “genuine reasoning” about future events, according to its creators.
The British startup’s AI came in eighth in the Metaculus Cup, a competition where entrants predicted the outcomes of 60 complex real-world events over the summer. The questions, which ranged from election results to the likelihood of a public feud between Trump and Musk, could not be answered by simply searching a database of past information.
Toby Shevlane, ManticAI’s co-founder and a former Google DeepMind researcher, said, “You can’t predict the future like that.” He explained that their system uses a variety of AI agents from different developers to tackle problems from multiple angles. Agents are assigned roles like historical researcher, scenario analyst, and current events monitor, with their combined insights forming the basis for the final prediction.
This sophisticated process allowed the AI to develop unique perspectives. Shevlane noted that the system often disagreed with the consensus view of human forecasters, who tend to cluster around an average. This suggests AI can act as a check against common human biases like herd behavior, providing a more objective, data-driven viewpoint.
While acknowledging AI’s rapid progress, experts point out that it can still struggle with logical consistency checks on highly complex, multi-stage forecasts. However, the performance of ManticAI has convinced many that the future of prediction is collaborative. The goal will be to merge the tireless, unbiased analytical power of AI with the nuanced judgment and intuition that human experts still possess.

Subscribe to our magazine

━ more like this

Elon Musk Tout’s Grok Success as UK Threatens to Ban X Over Non-Consensual AI Pornography

Elon Musk has claimed a victory for his Grok app, noting it is the most downloaded in the UK, even as the government threatens...

Billionaire Tech CEO Declares Physical AI Breakthrough at Major Tech Conference

At one of the technology industry's most prominent annual gatherings, Nvidia's founder delivered a bold proclamation about artificial intelligence reaching a new threshold. His...

DigitalBridge Becomes SoftBank Asset in $4B Deal Focused on AI Growth

In a $4 billion transaction, SoftBank Group is acquiring digital infrastructure investor DigitalBridge Group as part of its strategic focus on artificial intelligence infrastructure....

OpenAI Discontinues Voice Chat for Mac ChatGPT Users

A popular feature is being removed from ChatGPT's Mac application. Starting January 15, 2026, users will no longer have access to voice conversations through...

Massive AI School Initiative Raises Questions About Content Appropriateness

Elon Musk's artificial intelligence company revealed Thursday plans for one of the largest educational AI deployments ever attempted. The xAI partnership with El Salvador...