2025 ICML ICML 2025

MERIT: Maximum-normalized Element-wise Ratio for Language Model Large-batch Training