A Continued Pretrained LLM Approach for Automatic Medical Note Generation

Dong Yuan; Eti Rastogi; Gautam Naik; Sree Prasanna Rajagopal; Sagar Goyal; Fen Zhao; Bharath Chintagunta; Jeffrey Ward

2024 NAACL NAACL 2024

A Continued Pretrained LLM Approach for Automatic Medical Note Generation

Abstract

AbstractLLMs are revolutionizing NLP tasks. However, the use of the most advanced LLMs, such as GPT-4, is often prohibitively expensive for most specialized fields. We introduce HEAL, the first continuously trained 13B LLaMA2-based LLM that is purpose-built for medical conversations and measured on automated scribing. Our results demonstrate that HEAL outperforms GPT-4 and PMC-LLaMA in PubMedQA, with an accuracy of 78.4%. It also achieves parity with GPT-4 in generating medical notes. Remarkably, HEAL surpasses GPT-4 and Med-PaLM 2 in identifying more correct medical concepts and exceeds the performance of human scribes and other comparable models in correctness and completeness.

🧭 Keyword Pioneer — automated scribing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dong Yuan , Eti Rastogi , Gautam Naik , Sree Prasanna Rajagopal , Sagar Goyal , Fen Zhao , Bharath Chintagunta , Jeffrey Ward

Topics

Natural Language Processing > Generation > Text Generation Natural Language Processing > Resources & Methods > Large Language Models

Keywords

continued pretraining medical conversation medical note generation large language model automated scribing

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024