Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Aman Sinha; Timothee Mickus; Marianne Clausel; Mathieu Constant; Xavier Coubez

2024 ACL ACL 2024

Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Abstract

AbstractThe success of pretrained language models (PLMs) across a spate of use-cases has led to significant investment from the NLP community towards building domain-specific foundational models. On the other hand, in mission critical settings such as biomedical applications, other aspects also factor in—chief of which is a model’s ability to produce reasonable estimates of its own uncertainty. In the present study, we discuss these two desiderata through the lens of how they shape the entropy of a model’s output probability distribution. We find that domain specificity and uncertainty awareness can often be successfully combined, but the exact task at hand weighs in much more strongly.

❓ The Questioner

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — output entropy

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Aman Sinha , Timothee Mickus , Marianne Clausel , Mathieu Constant , Xavier Coubez

Topics

Machine Learning > Application Areas > Domain Adaptation Natural Language Processing > Applications > Text Classification

Keywords

domain adaptation uncertainty estimation pretrained language model biomedical text classification output entropy

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024