Master AI Concepts Today — Unveil the Future of Tech

New Method DeepConf Boosts Math Reasoning in Language Models

DeepConf promises to revolutionize language models' math reasoning. It's efficient, accurate, and could make 'thinking' models more practical.

, and Administrator

2025 October 8 . 6:16 AM

1 min read

In this image there are few army men and civilians hearing a speech delivered by the president, in... — In this image there are few army men and civilians hearing a speech delivered by the president, in the background of the image there are bushes, trees, stairs and there are few people seated on chairs.

New Method DeepConf Boosts Math Reasoning in Language Models

A new method, DeepConf, has been developed to enhance mathematical reasoning in language models without extra training. Led by Fu et al., this approach promises improved efficiency and accuracy.

DeepConf works in two modes. In offline mode, it achieved 99.9% accuracy on AIME 2025 tasks using gpt-oss-120B model. In online mode, it maintained 97.9% accuracy while reducing token consumption by up to 84.7% compared to standard majority voting. This efficiency is crucial as energy costs rise, questioning the long-term viability of 'thinking' models.

The method filters low-quality reasoning traces using internal confidence signals, improving both efficiency and accuracy. An early-exit scheme truncates overthinking without compromising results. However, it may struggle when a model is overly confident in incorrect answers, making the conservative variant a safer choice.

DeepConf's potential is significant. It could play a central role in language model development due to its efficiency and comparable or better results. By reducing computational costs, it addresses the economic viability concerns of 'thinking' models. With further refinement, DeepConf could help make mathematical reasoning in language models more practical and accessible.

Latest

This is the aerial view of a city. in this we can see buildings, towers, motor vehicles,...

Lifestyle

Romania's IPTV: The Future of Viewing Experiences

IPTV is revolutionizing Romania's content consumption. Engage with live polls, AR, and personalized content on your mobile devices. The future is here.

, and Administrator

2025 October 9

In the picture we can see a car engine with pipes, battery in it.

Climate-change

China Boosts EV Safety from 2026 with Mandatory Impact Tests and 'Battery Bazooka'

China's new EV safety rules promise tougher testing. The 'battery bazooka' could revolutionize fire prevention worldwide.

, and Administrator

2025 October 9

This is a paper. On this something is written.

War-and-conflicts

EU Committee Visits Taiwan Amid Rising Hybrid Threats and China Tensions

EU committee visits Taiwan to align against hybrid threats. President Lai Ching-te warns of increasing threats to both Taiwan and the EU.

, and Administrator

2025 October 9

In this image we can see there is a tool box with so many tools in it.

Stay Safe Online with Wise Learner Hub

CyberCX Speeds Up Essential Eight Compliance with New Solution

CyberCX's new solution cuts Essential Eight compliance time from months to days. It's a game-changer for organisations looking to bolster their cybersecurity fundamentals.

, and Administrator

2025 October 9

New Method DeepConf Boosts Math Reasoning in Language Models

New Method DeepConf Boosts Math Reasoning in Language Models

Read also:

Related

Latest