Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease named entity recognition

Zenan Zhai; Dat Quoc Nguyen; Karin Verspoor

2018 EMNLP EMNLP 2018

Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease named entity recognition

Abstract

AbstractWe compare the use of LSTM-based and CNN-based character-level word embeddings in BiLSTM-CRF models to approach chemical and disease named entity recognition (NER) tasks. Empirical results over the BioCreative V CDR corpus show that the use of either type of character-level word embeddings in conjunction with the BiLSTM-CRF models leads to comparable state-of-the-art performance. However, the models using CNN-based character-level word embeddings have a computational performance advantage, increasing training time over word-based models by 25% while the LSTM-based character-level word embeddings more than double the required training time.

🌉 Interdisciplinary Bridge — Deep Learning and Healthcare & Medicine and Natural Language Processing

🧭 Keyword Pioneer — chemical entity recognition

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zenan Zhai , Dat Quoc Nguyen , Karin Verspoor

Topics

Natural Language Processing > Applications > Named Entity Recognition Deep Learning > Architectures > Recurrent Neural Networks Healthcare & Medicine > Clinical > Medical NLP

Keywords

named entity recognition conditional random field character embedding bi-directional lstm character-level embedding chemical entity recognition disease mention recognition disease entity recognition

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018