2018 EMNLP EMNLP 2018

Interpretable Structure Induction via Sparse Attention

Abstract

AbstractNeural network methods are experiencing wide adoption in NLP, thanks to their empirical performance on many tasks. Modern neural architectures go way beyond simple feedforward and recurrent models: they are complex pipelines that perform soft, differentiable computation instead of discrete logic. The price of such soft computing is the introduction of dense dependencies, which make it hard to disentangle the patterns that trigger a prediction. Our recent work on sparse and structured latent computation presents a promising avenue for enhancing interpretability of such neural pipelines. Through this extended abstract, we aim to discuss and explore the potential and impact of our methods.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning
📈 Trend Setter — Attention Mechanism
🧭 Keyword Pioneer — structure induction
🐣 Hot Topic Early Bird — sparse attention
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio