2017 ACL ACL 2017

Sentence Alignment Methods for Improving Text Simplification Systems

Abstract

AbstractWe provide several methods for sentence-alignment of texts with different complexity levels. Using the best of them, we sentence-align the Newsela corpora, thus providing large training materials for automatic text simplification (ATS) systems. We show that using this dataset, even the standard phrase-based statistical machine translation models for ATS can outperform the state-of-the-art ATS systems.

📈 Trend Setter — Text Generation
🧭 Keyword Pioneer — sentence alignment
🐝 Cross-Pollinator — Artificial Intelligence, Data Science & Analytics, Interdisciplinary, Machine Learning, Natural Language Processing, Speech & Audio
🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing
🐣 Hot Topic Early Bird — parallel corpus