RegNLI: Detecting Online Product Misbranding through Legal and Linguistic Alignment

Diya Saha; Abhishek Bharadwaj Varanasi; Tirthankar Dasgupta; Manjira Sinha

2026 EACL EACL 2026

RegNLI: Detecting Online Product Misbranding through Legal and Linguistic Alignment

Abstract

AbstractMisbranding of health-related products poses significant risks to public safety and regulatory compliance. Existing approaches to claim verification largely rely on keyword matching or generic text classification, failing to capture the nuanced reasoning required to align product claims with legal statutes. In this work, we introduce RegNLI, a novel framework that formulates misbranding detection as a inference task between product claims and regulatory provisions. Leveraging a curated dataset of FDA warning letters, we construct structured representations of claims and statutes. Our model integrates a regulation-aware gating mechanism with a contrastive alignment objective to jointly optimize misbranding classification and statute mapping. Experiments on the FDA-Misbrand dataset demonstrate that RegNLI significantly outperforms strong baselines across accuracy, F1-score, and regulation alignment metrics, while providing interpretable attention patterns that highlight critical linguistic cues. This work establishes a foundation for compliance-aware NLP systems and opens new directions for integrating formal reasoning with neural architectures in regulatory domains.

🧭 Keyword Pioneer — misbranding detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Security & Privacy, Speech & Audio

Authors

Diya Saha , Abhishek Bharadwaj Varanasi , Tirthankar Dasgupta , Manjira Sinha

Topics

Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Applications > Text Classification

Keywords

semantic analysis contrastive alignment attention pattern regulatory compliance misbranding detection statute mapping

Download PDF

Related papers

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health 2026

A Benchmark for Audio Reasoning Capabilities of Multimodal Large Language Models 2026

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection 2026

Generative Personality Simulation via Theory-Informed Structured Interview 2026

Word Surprisal Correlates with Sentential Contradiction in LLMs 2026