2024
ICML
ICML 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
👥
Mega-Team
— 21 authors
Authors
Zhiheng Xi
,
Wenxiang Chen
,
Boyang Hong
,
Senjie Jin
,
Rui Zheng
,
Wei He
,
Yiwen Ding
,
Shichun Liu
,
Xin Guo
,
Junzhe Wang
,
Honglin Guo
,
Wei Shen
,
Xiaoran Fan
,
Yuhao Zhou
,
Shihan Dou
,
Xiao Wang
,
Xinbo Zhang
,
peng sun
,
Tao Gui
,
Qi Zhang
,
Xuanjing Huang