CopyNext: Explicit Span Copying and Alignment in Sequence to Sequence Models

Abhinav Singh; Patrick Xia; Guanghui Qin; Mahsa Yarmohammadi; Benjamin Van Durme

2020 EMNLP EMNLP 2020

CopyNext: Explicit Span Copying and Alignment in Sequence to Sequence Models

Abstract

AbstractCopy mechanisms are employed in sequence to sequence (seq2seq) models to generate reproductions of words from the input to the output. These frameworks, operating at the lexical type level, fail to provide an explicit alignment that records where each token was copied from. Further, they require contiguous token sequences from the input (spans) to be copied individually. We present a model with an explicit token-level copy operation and extend it to copying entire spans. Our model provides hard alignments between spans in the input and output, allowing for nontraditional applications of seq2seq, like information extraction. We demonstrate the approach on Nested Named Entity Recognition, achieving near state-of-the-art accuracy with an order of magnitude increase in decoding speed.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — span copying

🐣 Hot Topic Early Bird — span extraction

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Abhinav Singh , Patrick Xia , Guanghui Qin , Mahsa Yarmohammadi , Benjamin Van Durme

Topics

Deep Learning > Architectures > Transformers Natural Language Processing > Applications > Information Extraction Natural Language Processing > Applications > Named Entity Recognition Machine Learning > Learning Types > Multi-Modal Learning Machine Learning > Core Methods > Structured Prediction Deep Learning > Learning Types > Sequence Modeling

Keywords

information extraction named entity recognition sequence to sequence explicit alignment copy mechanism nested entity span extraction span copying

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020