2025
ICML
ICML 2025
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Authors
Thomas Zeng
,
Shuibai Zhang
,
Shutong Wu
,
Christian Classen
,
Daewon Chae
,
Ethan Ewer
,
Minjae Lee
,
Heeju Kim
,
Wonjun Kang
,
Jackson Kunde
,
Ying Fan
,
Jungtaek Kim
,
Hyung Il Koo
,
Kannan Ramchandran
,
Dimitris Papailiopoulos
,
Kangwook Lee