2025 ICML ICML 2025

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training