2023 ACL ACL 2023

Sakura at SemEval-2023 Task 2: Data Augmentation via Translation

Abstract

AbstractWe demonstrate a simple yet effective approach to augmenting training data for multilingual named entity recognition using translations. The named entity spans from the original sentences are transferred to translations via word alignment and then filtered with the baseline recognizer. The proposed approach outperforms the baseline XLM-Roberta on the multilingual dataset.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing
🐣 Hot Topic Early Bird — multilingual nlp
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio