The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement

#1 The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement [PDF⁷] [Copy] [Kimi⁷] [REL]

Authors: Ruihan Yang, Fanghua Ye, Jian Li, Siyu Yuan, Yikai Zhang, Zhaopeng Tu, Xiaolong Li, Deqing Yang

Large language models (LLMs) have recently transformed from text-based assistants to autonomous agents capable of planning, reasoning, and iteratively improving their actions. While numerical reward signals and verifiers can effectively rank candidate actions, they often provide limited contextual guidance. In contrast, natural language feedback better aligns with the generative capabilities of LLMs, providing richer and more actionable suggestions. However, parsing and implementing this feedback effectively can be challenging for LLM-based agents. In this work, we introduce Critique-Guided Improvement (CGI), a novel two-player framework, comprising an actor model that explores an environment and a critic model that generates detailed nature language feedback. By training the critic to produce fine-grained assessments and actionable revisions, and the actor to utilize these critiques, our approach promotes more robust exploration of alternative strategies while avoiding local optima. Experiments in three interactive environments show that CGI outperforms existing baselines by a substantial margin. Notably, even a small critic model surpasses GPT-4 in feedback quality. The resulting actor achieves state-of-the-art performance, demonstrating the power of explicit iterative guidance to enhance decision-making in LLM-based agents.

Subjects: Computation and Language , Artificial Intelligence

Publish: 2025-03-20 10:42:33 UTC

2503.16024

#1 The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement [PDF7] [Copy] [Kimi7] [REL]

#1 The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement [PDF⁷] [Copy] [Kimi⁷] [REL]