2024 ICML ICML 2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

👥 Mega-Team — 21 authors