2024 EMNLP EMNLP 2024

GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients