AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Junyu Zhang; Runpei Dong; Han Wang; Xuying Ning; Haoran Geng; Peihao Li; Xialin He; Yutong Bai; Jitendra Malik; Saurabh Gupta; Huan Zhang

2025 EMNLP EMNLP 2025

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Abstract

AbstractThis paper presents AlphaOne (𝛼1), a universal framework for modulating reasoning progress in large reasoning models (LRMs) at test time. 𝛼1 first introduces 𝛼 moment, which represents the scaled thinking phase with a universal parameter 𝛼.Within this scaled pre-𝛼 moment phase, it dynamically schedules slow thinking transitions by modeling the insertion of reasoning transition tokens as a Bernoulli stochastic process. After the 𝛼 moment, 𝛼1 deterministically terminates slow thinking with the end-of-thinking token, thereby fostering fast reasoning and efficient answer generation. This approach unifies and generalizes existing monotonic scaling methods by enabling flexible and dense slow-to-fast reasoning modulation. Extensive empirical studies on various challenging benchmarks across mathematical, coding, and scientific domains demonstrate 𝛼1‘s superior reasoning capability and efficiency. Project page: https://alphaone-project.github.io/.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — reasoning transition

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Junyu Zhang , Runpei Dong , Han Wang , Xuying Ning , Haoran Geng , Peihao Li , Xialin He , Yutong Bai , Jitendra Malik , Saurabh Gupta , Huan Zhang

Topics

Artificial Intelligence > Core AI > Foundation Models Machine Learning > Optimization & Theory > Neural Network Optimization Machine Learning > Learning Types > Reinforcement Learning Artificial Intelligence > Core AI > Large Language Models Artificial Intelligence > Core AI > Reasoning

Keywords

stochastic process inference optimization reasoning model test-time scaling slow thinking large reasoning model token generation reasoning transition fast reasoning bernoulli stochastic process

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025