Using Explicit Discourse Connectives in Translation for Implicit Discourse Relation Classification

Wei Shi; Frances Yung; Raphael Rubino; Vera Demberg

2017 IJCNLP IJCNLP 2017

Using Explicit Discourse Connectives in Translation for Implicit Discourse Relation Classification

Abstract

AbstractImplicit discourse relation recognition is an extremely challenging task due to the lack of indicative connectives. Various neural network architectures have been proposed for this task recently, but most of them suffer from the shortage of labeled data. In this paper, we address this problem by procuring additional training data from parallel corpora: When humans translate a text, they sometimes add connectives (a process known as explicitation). We automatically back-translate it into an English connective and use it to infer a label with high confidence. We show that a training set several times larger than the original training set can be generated this way. With the extra labeled instances, we show that even a simple bidirectional Long Short-Term Memory Network can outperform the current state-of-the-art.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Natural Language Inference

🧭 Keyword Pioneer — discourse connective

🐣 Hot Topic Early Bird — parallel corpus

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Wei Shi , Frances Yung , Raphael Rubino , Vera Demberg

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Application Areas > Data Augmentation Deep Learning > Architectures > Neural Networks Machine Learning > Learning Types > Transfer Learning Natural Language Processing > Understanding > Natural Language Inference

Keywords

data augmentation cross-lingual transfer parallel corpus discourse connective bidirectional long short-term memory implicit discourse relation neural network

Download PDF

Related papers

Procedural Text Generation from an Execution Video 2017

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset 2017

Roles and Success in Wikipedia Talk Pages: Identifying Latent Patterns of Behavior 2017

PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts 2017

Alibaba at IJCNLP-2017 Task 1: Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task 2017