Papers
Sparse Attention with Linear Units
EMNLP 2021
Cross-Iteration Batch Normalization
CVPR 2021
Stochastic Whitening Batch Normalization
CVPR 2021