-5 C
Iceland

Not Just Regurgitation: AI Shows Real Reasoning in Prediction Contest

Date:

An AI system has delivered a powerful rebuttal to the claim that large language models (LLMs) just “regurgitate” their training data. By placing in the top ten of a fiercely competitive international forecasting contest, ManticAI’s system has shown it can perform “genuine reasoning” about future events, according to its creators.
The British startup’s AI came in eighth in the Metaculus Cup, a competition where entrants predicted the outcomes of 60 complex real-world events over the summer. The questions, which ranged from election results to the likelihood of a public feud between Trump and Musk, could not be answered by simply searching a database of past information.
Toby Shevlane, ManticAI’s co-founder and a former Google DeepMind researcher, said, “You can’t predict the future like that.” He explained that their system uses a variety of AI agents from different developers to tackle problems from multiple angles. Agents are assigned roles like historical researcher, scenario analyst, and current events monitor, with their combined insights forming the basis for the final prediction.
This sophisticated process allowed the AI to develop unique perspectives. Shevlane noted that the system often disagreed with the consensus view of human forecasters, who tend to cluster around an average. This suggests AI can act as a check against common human biases like herd behavior, providing a more objective, data-driven viewpoint.
While acknowledging AI’s rapid progress, experts point out that it can still struggle with logical consistency checks on highly complex, multi-stage forecasts. However, the performance of ManticAI has convinced many that the future of prediction is collaborative. The goal will be to merge the tireless, unbiased analytical power of AI with the nuanced judgment and intuition that human experts still possess.

Subscribe to our magazine

━ more like this

Mark Zuckerberg’s Metaverse Cost $80 Billion and Made Facebook Look Like a Bargain

Context makes everything clearer. Facebook was built by a small team in a Harvard dorm room and eventually captured billions of users at minimal...

 Instagram Encrypted DMs Ending: The Tech Community Weighs In

The tech community has responded to Meta's decision to remove end-to-end encryption from Instagram DMs with a mixture of alarm and resignation. The change,...

Google Ends AI Feature That Used Crowd Wisdom to Answer Medical Questions

Google has confirmed it has discontinued a search feature that applied AI to organize crowd-sourced health advice for users. The tool, called "What People...

Microsoft Rushes to Court in Defense of Anthropic After Pentagon Issues Unprecedented AI Penalty

Acting swiftly in response to an unprecedented government action, Microsoft has filed a legal brief in a San Francisco federal court in support of...

Musk’s xAI “Colossus 2” Expansion: A Win for Tech, A Blow for Public Health?

Mississippi regulators have officially approved a permit for Elon Musk’s xAI to operate 41 methane gas turbines. These turbines will power the "Colossus 2"...