BERT based Transformers lead the way in Extraction of Health Information from Social Media

Sidharth Ramesh; Abhiraj Tiwari; Parthivi Choubey; Saisha Kashyap; Sahil Khose; Kumud Lakara; Nishesh Singh; Ujjwal Verma

2021 NAACL NAACL 2021

BERT based Transformers lead the way in Extraction of Health Information from Social Media

Abstract

AbstractThis paper describes our submissions for the Social Media Mining for Health (SMM4H) 2021 shared tasks. We participated in 2 tasks: (1) Classification, extraction and normalization of adverse drug effect (ADE) mentions in English tweets (Task-1) and (2) Classification of COVID-19 tweets containing symptoms (Task-6). Our approach for the first task uses the language representation model RoBERTa with a binary classification head. For the second task, we use BERTweet, based on RoBERTa. Fine-tuning is performed on the pre-trained models for both tasks. The models are placed on top of a custom domain-specific pre-processing pipeline. Our system ranked first among all the submissions for subtask-1(a) with an F1-score of 61%. For subtask-1(b), our system obtained an F1-score of 50% with improvements up to +8% F1 over the median score across all submissions. The BERTweet model achieved an F1 score of 94% on SMM4H 2021 Task-6.

🌉 Interdisciplinary Bridge — Deep Learning and Healthcare & Medicine and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sidharth Ramesh , Abhiraj Tiwari , Parthivi Choubey , Saisha Kashyap , Sahil Khose , Kumud Lakara , Nishesh Singh , Ujjwal Verma

Topics

Deep Learning > Architectures > Transformers Natural Language Processing > Applications > Text Classification Natural Language Processing > Resources & Methods > Large Language Models Healthcare & Medicine > Clinical > Clinical NLP Healthcare & Medicine > Clinical > Medical AI Deep Learning > Models > Transformers

Keywords

text classification named entity recognition clinical text mining language model text extraction adverse drug effect health nlp

Download PDF

Related papers

Knowledge Router: Learning Disentangled Representations for Knowledge Graphs 2021

Cross-Task Instance Representation Interactions and Label Dependencies for Joint Information Extraction with Graph Convolutional Networks 2021

Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information Extraction 2021

Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing 2021

Probing Word Translations in the Transformer and Trading Decoder for Encoder Layers 2021