2025
ICML
ICML 2025
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
Authors
Zhixuan Chen
,
Xing Hu
,
Dawei Yang
,
Zukang Xu
,
Xu Chen
,
Zhihang Yuan
,
Sifan Zhou
,
Jiangyong Yu