Papers
Attack to Explain Deep Representation
CVPR 2020
A Tale of a Probe and a Parser
ACL 2020
Attention is not not Explanation
EMNLP 2019
Human-in-the-Loop Interpretability Prior
NIPS 2018