2023 IJCNLP IJCNLP 2023

On a Benefit of Masked Language Model Pretraining: Robustness to Simplicity Bias

Authors