2020
EMNLP
EMNLP 2020
TED-MDB Lexicons: Tr-EnConnLex, Pt-EnConnLex
Abstract
AbstractIn this work, we present two new bilingual discourse connective lexicons, namely, for Turkish-English and European Portuguese-English created automatically using the existing discourse relation-aligned TED-MDB corpus. In their current form, the Pt-En lexicon includes 95 entries, whereas the Tr-En lexicon contains 133 entries. The lexicons constitute the first step of a larger project of developing a multilingual discourse connective lexicon.
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Natural Language Processing