2024
ICML
ICML 2024
Exploring the Benefit of Activation Sparsity in Pre-training
Authors
Zhengyan Zhang
,
Chaojun Xiao
,
Qiujieli Qin
,
Yankai Lin
,
Zhiyuan Zeng
,
Xu Han
,
Zhiyuan Liu
,
Ruobing Xie
,
Maosong Sun
,
Jie Zhou