Interpretable Structure Induction via Sparse Attention

Ben Peters; Vlad Niculae; André F. T. Martins

2018 EMNLP EMNLP 2018

Interpretable Structure Induction via Sparse Attention

Abstract

AbstractNeural network methods are experiencing wide adoption in NLP, thanks to their empirical performance on many tasks. Modern neural architectures go way beyond simple feedforward and recurrent models: they are complex pipelines that perform soft, differentiable computation instead of discrete logic. The price of such soft computing is the introduction of dense dependencies, which make it hard to disentangle the patterns that trigger a prediction. Our recent work on sparse and structured latent computation presents a promising avenue for enhancing interpretability of such neural pipelines. Through this extended abstract, we aim to discuss and explore the potential and impact of our methods.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning

📈 Trend Setter — Attention Mechanism

🧭 Keyword Pioneer — structure induction

🐣 Hot Topic Early Bird — sparse attention

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio

Authors

Ben Peters , Vlad Niculae , André F. T. Martins

Topics

Artificial Intelligence > Core AI > Interpretability Deep Learning > Techniques > Model Architecture Deep Learning > Techniques > Attention Mechanism

Keywords

sparse attention neural pipeline structure induction interpretable structure latent computation disentangle pattern

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018