Data and Model Distillation as a Solution for Domain-transferable Fact Verification

Mitch Paul Mithun; Sandeep Suntwal; Mihai Surdeanu

2021 NAACL NAACL 2021

Data and Model Distillation as a Solution for Domain-transferable Fact Verification

Abstract

AbstractWhile neural networks produce state-of-the-art performance in several NLP tasks, they generally depend heavily on lexicalized information, which transfer poorly between domains. We present a combination of two strategies to mitigate this dependence on lexicalized information in fact verification tasks. We present a data distillation technique for delexicalization, which we then combine with a model distillation method to prevent aggressive data distillation. We show that by using our solution, not only does the performance of an existing state-of-the-art model remain at par with that of the model trained on a fully lexicalized data, but it also performs better than it when tested out of domain. We show that the technique we present encourages models to extract transferable facts from a given fact verification dataset.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mitch Paul Mithun , Sandeep Suntwal , Mihai Surdeanu

Topics

Machine Learning > Application Areas > Domain Adaptation Machine Learning > Application Areas > Knowledge Distillation Machine Learning > Learning Types > Transfer Learning

Keywords

transfer learning domain adaptation fact verification knowledge distillation neural network

Download PDF

Related papers

Knowledge Router: Learning Disentangled Representations for Knowledge Graphs 2021

Cross-Task Instance Representation Interactions and Label Dependencies for Joint Information Extraction with Graph Convolutional Networks 2021

Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information Extraction 2021

Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing 2021

Probing Word Translations in the Transformer and Trading Decoder for Encoder Layers 2021