2025 ICML ICML 2025

FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training