2024 ICML ICML 2024

Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks