Taming the Real-world Complexities in CPT E/M Coding with Large Language Models

Islam Nassar; Yang Lin; Yuan Jin; Rongxin Zhu; Chang Wei Tan; Zenan Zhai; Nitika Mathur; Thanh Tien Vu; Xu Zhong; Long Duong; Yuan-Fang Li

2025 EMNLP EMNLP 2025

Taming the Real-world Complexities in CPT E/M Coding with Large Language Models

Abstract

AbstractEvaluation and Management (E/M) coding, under the Current Procedural Terminology (CPT) taxonomy, documents medical services provided to patients by physicians. Used primarily for billing purposes, it is in physicians’ best interest to provide accurate CPT E/M codes. Automating this coding task will help alleviate physicians’ documentation burden, improve billing efficiency, and ultimately enable better patient care. However, a number of real-world complexities have made E/M encoding automation a challenging task. In this paper, we elaborate some of the key complexities and present ProFees, our LLM-based framework that tackles them, followed by a systematic evaluation. On an expert-curated real-world dataset, ProFees achieves an increase in coding accuracy of more than 36% over a commercial CPT E/M coding system and almost 5% over our strongest single-prompt baseline, demonstrating its effectiveness in addressing the real-world complexities.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Healthcare & Medicine and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — cpt e/m coding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Islam Nassar , Yang Lin , Yuan Jin , Rongxin Zhu , Chang Wei Tan , Zenan Zhai , Nitika Mathur , Thanh Tien Vu , Xu Zhong , Long Duong , Yuan-Fang Li

Topics

Artificial Intelligence > Core AI > Foundation Models Machine Learning > Application Areas > Risk Management Natural Language Processing > Applications > Information Extraction Natural Language Processing > Applications > Text Classification Healthcare & Medicine > Clinical > Clinical NLP Machine Learning > Learning Types > Few-Shot Learning Artificial Intelligence > Core AI > Large Language Models Healthcare & Medicine > Clinical > Medical AI Healthcare & Medicine > Clinical > Medical NLP

Keywords

few-shot learning text classification clinical natural language processing medical coding clinical documentation large language model healthcare nlp cpt e/m coding automated coding billing automation cpt coding

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025