2023
ACL
ACL 2023
GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic Evaluation
Abstract
AbstractWe present GENTLE, a new mixed-genre English challenge corpus totaling 17K tokens and consisting of 8 unusual text types for out-of-domain evaluation: dictionary entries, esports commentaries, legal documents, medical notes, poetry, mathematical proofs, syllabuses, and threat letters. GENTLE is manually annotated for a variety of popular NLP tasks, including syntactic dependency parsing, entity recognition, coreference resolution, and discourse parsing. We evaluate state-of-the-art NLP systems on GENTLE and find severe degradation for at least some genres in their performance on all tasks, which indicates GENTLE’s utility as an evaluation dataset for NLP systems.
🧭
Keyword Pioneer
— challenge corpus
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Natural Language Processing
Authors
Topics
Natural Language Processing > Understanding > Coreference Resolution
Natural Language Processing > Understanding > Parsing
Natural Language Processing > Resources & Methods > Text Representation
Machine Learning > Learning Types > Evaluation
Artificial Intelligence > Core AI > Language
Machine Learning > Core Methods > Evaluation
Artificial Intelligence > Core AI > Natural Language Processing
Natural Language Processing > Applications > Natural Language Understanding