AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time | Cool Papers

#1 AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time [PDF¹⁹] [Copy] [Kimi²²] [REL]

Authors: Junyu Zhang, Runpei Dong, Han Wang, Xuying Ning, Haoran Geng, Peihao Li, Xialin He, Yutong Bai, Jitendra Malik, Saurabh Gupta, Huan Zhang

This paper presents AlphaOne ( $\alpha$ 1), a universal framework for modulating reasoning progress in large reasoning models (LRMs) at test time. $\alpha$ 1 first introduces $\alpha$ moment, which represents the scaled thinking phase with a universal parameter $\alpha$ . Within this scaled pre- $\alpha$ moment phase, it dynamically schedules slow thinking transitions by modeling the insertion of reasoning transition tokens as a Bernoulli stochastic process. After the $\alpha$ moment, $\alpha$ 1 deterministically terminates slow thinking with the end-of-thinking token, thereby fostering fast reasoning and efficient answer generation. This approach unifies and generalizes existing monotonic scaling methods by enabling flexible and dense slow-to-fast reasoning modulation. Extensive empirical studies on various challenging benchmarks across mathematical, coding, and scientific domains demonstrate $\alpha$ 1's superior reasoning capability and efficiency. Project page: https://alphaone-project.github.io/

Subject: Computation and Language

Publish: 2025-05-30 17:58:36 UTC

2505.24863

#1 AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time [PDF19] [Copy] [Kimi22] [REL]

#1 AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time [PDF¹⁹] [Copy] [Kimi²²] [REL]