Training Language Models to Critique With Multi-agent Feedback

Tian Lan; Wenwei Zhang; Chengqi Lyu; Shuaibin Li; Chen Xu; Heyan Huang; Dahua Lin; Xian-Ling Mao; Kai Chen

2025 EMNLP EMNLP 2025

Training Language Models to Critique With Multi-agent Feedback

Abstract

AbstractCritique ability, a meta-cognitive capability of humans, presents significant challenges for LLMs to improve. While utilizing human annotation can enhance critique ability effectively, most recent works primarily rely on supervised fine-tuning (SFT) using critiques generated by a single LLM like GPT-4, which is more scalable and cost-effective.However, such model-generated critiques often suffer from inherent flaws due to the complexity of critique. Consequently, fine-tuning LLMs on these flawed critiques not only limits performance but also propagates errors into the learned model.To address this issue, we propose MultiCritique, a unified framework that leverages multi-agent feedback to improve critique ability in both the supervised fine-tuning (SFT) and reinforcement learning (RL) stages.In the SFT stage, MultiCritique aggregates high-quality multi-agent critiques through a fine-grained meta-critique mechanism. In the RL stage, preference critiques are constructed and refined by validating their contributions to revisions, thereby enhancing robustness of RL in improving critique ability.Based on MultiCritique, we construct SFT and RL datasets. Extensive experimental results on two benchmarks highlight the key benefits of our dataset, including superior quality, enhanced data efficiency, strong generalization on unseen tasks, and improvements in the general capability of LLMs.Notably, our fine-tuned 7B model significantly surpasses advanced 7B-13B models, approaching advanced 70B LLMs and GPT-4.Resources have been publicly available.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — multi-agent feedback

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Tian Lan , Wenwei Zhang , Chengqi Lyu , Shuaibin Li , Chen Xu , Heyan Huang , Dahua Lin , Xian-Ling Mao , Kai Chen

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Machine Learning > Learning Types > Self-Supervised Learning Reinforcement Learning > Methods > Deep RL Artificial Intelligence > Core AI > Large Language Models Deep Learning > Learning Types > Reinforcement Learning Deep Learning > Learning Types > Fine-Tuning

Keywords

reinforcement learning language model reward model supervised fine-tuning multi-agent feedback large language model multi-agent system

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025