TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP

Nils Rethmeier; Vageesh Kumar Saxena; Isabelle Augenstein

2020 UAI UAI 2020

TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP

Abstract

While state-of-the-art NLP explainability (XAI) methods focus on explaining per-sample decisions in supervised end or probing tasks, this is insufficient to explain and quantify model knowledge transfer during (un-)supervised training. Thus, for TX-Ray, we modify the established computer vision explainability principle of ‘visualizing preferred inputs of neurons’ to make it usable for both NLP and for transfer analysis. This allows one to analyze, track and quantify how self- or supervised NLP models first build knowledge abstractions in pretraining (1), andthen transfer abstractions to a new domain (2), or adapt them during supervised finetuning (3) – see Fig. 1. TX-Ray expresses neurons as feature preference distributions to quantify fine-grained knowledge transfer or adaptation and guide human analysis. We find that, similar to Lottery Ticket based pruning, TX-Ray based pruning can improve test set generalization and that it can reveal how early stages of self-supervision automatically learn linguistic abstractions like parts-of-speech.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — model knowledge transfer

🐣 Hot Topic Early Bird — self-supervised learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Nils Rethmeier , Vageesh Kumar Saxena , Isabelle Augenstein

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Learning Types > Self-Supervised Learning Natural Language Processing > Resources & Methods > Transfer Learning

Keywords

self-supervised learning probing task model knowledge transfer feature preference distribution lottery ticket pruning

Download PDF

Related papers

Walking on Two Legs: Learning Image Segmentation with Noisy Labels 2020

Finite-Memory Near-Optimal Learning for Markov Decision Processes with Long-Run Average Reward 2020

Automated Dependence Plots 2020

Collapsible IDA: Collapsing Parental Sets for Locally Estimating Possible Causal Effects 2020

Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect 2020