Neural Lattice-to-Sequence Models for Uncertain Inputs

Matthias Sperber; Graham Neubig; Jan Niehues; Alex Waibel

2017 EMNLP EMNLP 2017

Neural Lattice-to-Sequence Models for Uncertain Inputs

Abstract

AbstractThe input to a neural sequence-to-sequence model is often determined by an up-stream system, e.g. a word segmenter, part of speech tagger, or speech recognizer. These up-stream models are potentially error-prone. Representing inputs through word lattices allows making this uncertainty explicit by capturing alternative sequences and their posterior probabilities in a compact form. In this work, we extend the TreeLSTM (Tai et al., 2015) into a LatticeLSTM that is able to consume word lattices, and can be used as encoder in an attentional encoder-decoder model. We integrate lattice posterior scores into this architecture by extending the TreeLSTM’s child-sum and forget gates and introducing a bias term into the attention mechanism. We experiment with speech translation lattices and report consistent improvements over baselines that translate either the 1-best hypothesis or the lattice without posterior scores.

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — lattice lstm

🐣 Hot Topic Early Bird — speech translation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Matthias Sperber , Graham Neubig , Jan Niehues , Alex Waibel

Topics

Deep Learning > Architectures > Neural Networks Deep Learning > Techniques > Model Architecture Natural Language Processing > Generation > Machine Translation Deep Learning > Learning Types > Representation Learning Deep Learning > Architectures > Recurrent Neural Networks

Keywords

attention mechanism sequence-to-sequence model encoder-decoder model speech translation word lattice lattice lstm uncertain input

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017