Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

Siva Rajesh Kasa; Karan Gupta; Sumegh Roychowdhury; Ashutosh Kumar; Yaswanth Biruduraju; SANTHOSH KUMAR KASA; Pattisapu Nikhil Priyatam; Arindam Bhattacharya; Shailendra Agarwal; Vijay Huddar

2025 EMNLP EMNLP 2025

Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

Abstract

Abstract*The comparison between discriminative and generative classifiers has intrigued researchers since [Efron (1975)’s](https://www.jstor.org/stable/2285453) seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present the first comprehensive evaluation of modern generative and discriminative architectures—Auto-regressive, Masked Language Modeling, Discrete Diffusion, and Encoders for text classification. Our study reveals that the classical “two regimes” phenomenon manifests distinctly across different architectures and training paradigms. Beyond accuracy, we analyze sample efficiency, calibration, noise robustness, and ordinality across diverse scenarios. Our findings offer practical guidance for selecting the most suitable modeling approach based on real-world constraints such as latency and data limitations.*

❓ The Questioner

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Siva Rajesh Kasa , Karan Gupta , Sumegh Roychowdhury , Ashutosh Kumar , Yaswanth Biruduraju , SANTHOSH KUMAR KASA , Pattisapu Nikhil Priyatam , Arindam Bhattacharya , Shailendra Agarwal , Vijay Huddar

Topics

Machine Learning > Core Methods > Classification Deep Learning > Architectures > Transformers Deep Learning > Techniques > Pretraining Natural Language Processing > Applications > Text Classification Machine Learning > Learning Types > Classification Deep Learning > Models > Transformers

Keywords

text classification model calibration sample complexity generative model discrete diffusion masked language modeling discriminative model discriminative classifier generative classifier

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025