Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection

Richard Socher; Eric H. Huang; Jeffrey Pennin; Christopher D. Manning; Andrew Y. Ng

2011 NIPS NeurIPS 2011

Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection

Abstract

Paraphrase detection is the task of examining two sentences and determining whether they have the same meaning. In order to obtain high accuracy on this task, thorough syntactic and semantic analysis of the two statements is needed. We introduce a method for paraphrase detection based on recursive autoencoders (RAE). Our unsupervised RAEs are based on a novel unfolding objective and learn feature vectors for phrases in syntactic trees. These features are used to measure the word- and phrase-wise similarity between two sentences. Since sentences may be of arbitrary length, the resulting matrix of similarity measures is of variable size. We introduce a novel dynamic pooling layer which computes a fixed-sized representation from the variable-sized matrices. The pooled representation is then used as input to a classifier. Our method outperforms other state-of-the-art approaches on the challenging MSRP paraphrase corpus.

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing

📈 Trend Setter — Autoencoders

🧭 Keyword Pioneer — recursive autoencoders

🐣 Hot Topic Early Bird — representation learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Richard Socher , Eric H. Huang , Jeffrey Pennin , Christopher D. Manning , Andrew Y. Ng

Topics

Deep Learning > Architectures > Autoencoders Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Applications > Text Classification Machine Learning > Learning Types > Deep Learning Natural Language Processing > Applications > Natural Language Understanding

Keywords

representation learning syntactic parsing semantic analysis paraphrase detection dynamic pooling sentence similarity syntactic tree syntax tree recursive autoencoder

Download PDF

Related papers

Co-Training for Domain Adaptation 2011

The Local Rademacher Complexity of Lp-Norm Multiple Kernel Learning 2011

Learning to Agglomerate Superpixel Hierarchies 2011

A Reinforcement Learning Theory for Homeostatic Regulation 2011

A Global Structural EM Algorithm for a Model of Cancer Progression 2011