← Applications

Natural Language Processing › Applications ›

Natural Language Inference

918 directly classified papers

Papers per year

Papers

Large Language Models Put to the Test on Chinese Noun Compounds: Experiments on Natural Language Inference and Compound Semantics EACL 2026

Serbian SuperGLUE: Towards an Evaluation Benchmark for South Slavic Language Models EACL 2026

UTER: Capturing the Human Touch in Evaluating Morphologically Rich and Low-Resource Languages NAACL 2025

Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study COLING 2025

On Reference (In-)Determinacy in Natural Language Inference NAACL 2025

Optimizing Cost-Efficiency with LLM-Generated Training Data for Conversational Semantic Frame Analysis NAACL 2025

Deep-change at CoMeDi: the Cross-Entropy Loss is not All You Need COLING 2025

Pragmatic Theories Enhance Understanding of Implied Meanings in LLMs IJCNLP 2025

Exploiting Task Reversibility of DRS Parsing and Generation: Challenges and Insights from a Multi-lingual Perspective COLING 2025

Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning COLING 2025

On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation NAACL 2025

Towards Long Context Hallucination Detection NAACL 2025

Machine Translation Metrics for Indigenous Languages Using Fine-tuned Semantic Embeddings NAACL 2025

Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient’s Point of View NAACL 2025

Can AI Validate Science? Benchmarking LLMs on Claim →Evidence Reasoning in AI Papers IJCNLP 2025

Not Just a Piece of Cake: Cross-Lingual Fine-Tuning for Idiom Identification IJCNLP 2025

Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems COLING 2025

CoMeDi Shared Task: Median Judgment Classification & Mean Disagreement Ranking with Ordinal Word-in-Context Judgments COLING 2025

Funzac at CoMeDi Shared Task: Modeling Annotator Disagreement from Word-In-Context Perspectives COLING 2025

MMLabUIT at CoMeDiShared Task: Text Embedding Techniques versus Generation-Based NLI for Median Judgment Classification COLING 2025

Detecting Inconsistencies in Narrative Elements of Cross Lingual Nakba Texts COLING 2025

Linking language model predictions to human behaviour on scalar implicatures COLING 2025

FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking NAACL 2025

MorphNLI: A Stepwise Approach to Natural Language Inference Using Text Morphing NAACL 2025

Predicting Median, Disagreement and Noise Label in Ordinal Word-in-Context Data COLING 2025