Generating flexible proper name references in text: Data, models and evaluation

Thiago Castro Ferreira; Emiel Krahmer; Sander Wubben

2017 EACL EACL 2017

Generating flexible proper name references in text: Data, models and evaluation

Abstract

AbstractThis study introduces a statistical model able to generate variations of a proper name by taking into account the person to be mentioned, the discourse context and variation. The model relies on the REGnames corpus, a dataset with 53,102 proper name references to 1,000 people in different discourse contexts. We evaluate the versions of our model from the perspective of how human writers produce proper names, and also how human readers process them. The corpus and the model are publicly available.

🧭 Keyword Pioneer — discourse context

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Thiago Castro Ferreira , Emiel Krahmer , Sander Wubben

Topics

Natural Language Processing > Understanding > Named Entity Recognition Natural Language Processing > Generation > Text Generation Natural Language Processing > Resources & Methods > Text Representation

Keywords

text generation statistical model named entity reference resolution discourse context proper name generation name reference

Download PDF

Related papers

Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages 2017

Learning and Knowledge Transfer with Memory Networks for Machine Comprehension 2017

Is this a Child, a Girl or a Car? Exploring the Contribution of Distributional Similarity to Learning Referential Word Meanings 2017

Building Web-Interfaces for Vector Semantic Models with the WebVectors Toolkit 2017

Assessing Convincingness of Arguments in Online Debates with Limited Number of Features 2017