Neural Network Optimization
3648 directly classified papers
Papers per year
Papers
Better Embeddings with Coupled Adam
ACL 2025
Value Residual Learning
ACL 2025
LESA: Learnable LLM Layer Scaling-Up
ACL 2025