Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning

#1 Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning [PDF³] [Copy] [Kimi⁷] [REL]

Authors: Dayu Wang, Jiaye Yang, Weikang Li, Jiahui Liang, Yang Li

Recent advances in multi-agent systems highlight the potential of specialized small agents that collaborate via division of labor. Existing tool-integrated reasoning systems, however, often follow a single-agent paradigm in which one large model interleaves long-horizon reasoning with precise tool operations, leading to cognitive-load interference and unstable coordination. We present MSARL, a Multi-Small-Agent Reinforcement Learning framework that explicitly decouples reasoning from tool use. In MSARL, a Reasoning Agent decomposes problems and plans tool invocations, while multiple Tool Agents specialize in specific external tools, each trained via a combination of imitation learning and reinforcement learning with role-specific rewards. On mathematical problem solving with code execution, MSARL significantly improves reasoning stability and final-answer accuracy over single-agent baselines. Moreover, the architecture generalizes to diverse tool-use tasks, demonstrating that cognitive-role decoupling with small agents is a scalable blueprint for multi-agent AI design.

Subject: Artificial Intelligence

Publish: 2025-08-12 12:10:53 UTC

2508.08882

#1 Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning [PDF3] [Copy] [Kimi7] [REL]

#1 Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning [PDF³] [Copy] [Kimi⁷] [REL]