Few-NERD: A Few-shot Named Entity Recognition Dataset

Ning Ding; Guangwei Xu; Yulin Chen; Xiaobin Wang; Xu Han; Pengjun Xie; Haitao Zheng; Zhiyuan Liu

2021 ACL ACL 2021

Few-NERD: A Few-shot Named Entity Recognition Dataset

Abstract

AbstractRecently, considerable literature has grown up around the theme of few-shot named entity recognition (NER), but little published benchmark data specifically focused on the practical and challenging task. Current approaches collect existing supervised NER datasets and re-organize them to the few-shot setting for empirical study. These strategies conventionally aim to recognize coarse-grained entity types with few examples, while in practice, most unseen entity types are fine-grained. In this paper, we present Few-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types. Few-NERD consists of 188,238 sentences from Wikipedia, 4,601,160 words are included and each is annotated as context or a part of the two-level entity type. To the best of our knowledge, this is the first few-shot NER dataset and the largest human-crafted NER dataset. We construct benchmark tasks with different emphases to comprehensively assess the generalization capability of models. Extensive empirical results and analysis show that Few-NERD is challenging and the problem requires further research. The Few-NERD dataset and the baselines will be publicly available to facilitate the research on this problem.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ning Ding , Guangwei Xu , Yulin Chen , Xiaobin Wang , Xu Han , Pengjun Xie , Haitao Zheng , Zhiyuan Liu

Topics

Machine Learning > Learning Types > Zero-Shot Learning Natural Language Processing > Understanding > Named Entity Recognition Machine Learning > Learning Paradigms > Few-Shot Learning Natural Language Processing > Applications > Named Entity Recognition Deep Learning > Learning Types > Few-Shot Learning

Keywords

few-shot learning named entity recognition entity typing few-shot named entity recognition fine-grained entity type

Download PDF

Related papers

Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training 2021

A Non-Autoregressive Edit-Based Approach to Controllable Text Simplification 2021

How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements 2021

Exploring Discourse Structures for Argument Impact Classification 2021

Language Embeddings for Typology and Cross-lingual Transfer Learning 2021