← Back to papers

2024 ICML ICML 2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

👥 Mega-Team — 21 authors

Authors

Zhiheng Xi , Wenxiang Chen , Boyang Hong , Senjie Jin , Rui Zheng , Wei He , Yiwen Ding , Shichun Liu , Xin Guo , Junzhe Wang , Honglin Guo , Wei Shen , Xiaoran Fan , Yuhao Zhou , Shihan Dou , Xiao Wang , Xinbo Zhang , peng sun , Tao Gui , Qi Zhang , Xuanjing Huang

Related papers

Learning Latent Dynamic Robust Representations for World Models 2024

Beyond Individual Input for Deep Anomaly Detection on Tabular Data 2024

Risk Estimation in a Markov Cost Process: Lower and Upper Bounds 2024

Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval 2024

Ranking-based Client Imitation Selection for Efficient Federated Learning 2024