2021
EMNLP
EMNLP 2021
SeqScore: Addressing Barriers to Reproducible Named Entity Recognition Evaluation
Abstract
AbstractTo address a looming crisis of unreproducible evaluation for named entity recognition, we propose guidelines and introduce SeqScore, a software package to improve reproducibility. The guidelines we propose are extremely simple and center around transparency regarding how chunks are encoded and scored. We demonstrate that despite the apparent simplicity of NER evaluation, unreported differences in the scoring procedure can result in changes to scores that are both of noticeable magnitude and statistically significant. We describe SeqScore, which addresses many of the issues that cause replication failures.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— evaluation reproducibility
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Interpretability
Machine Learning > Optimization & Theory > Theory
Natural Language Processing > Understanding > Named Entity Recognition
Natural Language Processing > Applications > Named Entity Recognition
Machine Learning > Learning Types > Evaluation
Machine Learning > Application Areas > Evaluation