Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Natural Language Processing
›
Applications
›
Natural Language Inference
918 directly classified papers
Papers per year
2012: 1
2013: 1
2014: 1
2015: 2
2016: 12
2017: 40
2018: 55
2019: 103
2020: 87
2021: 73
2022: 155
2023: 121
2024: 116
2025: 149
2026: 2
Papers
Large Language Models Put to the Test on Chinese Noun Compounds: Experiments on Natural Language Inference and Compound Semantics
EACL 2026
Serbian SuperGLUE: Towards an Evaluation Benchmark for South Slavic Language Models
EACL 2026
UTER: Capturing the Human Touch in Evaluating Morphologically Rich and Low-Resource Languages
NAACL 2025
Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study
COLING 2025
On Reference (In-)Determinacy in Natural Language Inference
NAACL 2025
Optimizing Cost-Efficiency with LLM-Generated Training Data for Conversational Semantic Frame Analysis
NAACL 2025
Deep-change at CoMeDi: the Cross-Entropy Loss is not All You Need
COLING 2025
Pragmatic Theories Enhance Understanding of Implied Meanings in LLMs
IJCNLP 2025
Exploiting Task Reversibility of DRS Parsing and Generation: Challenges and Insights from a Multi-lingual Perspective
COLING 2025
Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning
COLING 2025
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
NAACL 2025
Towards Long Context Hallucination Detection
NAACL 2025
Machine Translation Metrics for Indigenous Languages Using Fine-tuned Semantic Embeddings
NAACL 2025
Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient’s Point of View
NAACL 2025
Can AI Validate Science? Benchmarking LLMs on Claim →Evidence Reasoning in AI Papers
IJCNLP 2025
Not Just a Piece of Cake: Cross-Lingual Fine-Tuning for Idiom Identification
IJCNLP 2025
Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems
COLING 2025
CoMeDi Shared Task: Median Judgment Classification & Mean Disagreement Ranking with Ordinal Word-in-Context Judgments
COLING 2025
Funzac at CoMeDi Shared Task: Modeling Annotator Disagreement from Word-In-Context Perspectives
COLING 2025
MMLabUIT at CoMeDiShared Task: Text Embedding Techniques versus Generation-Based NLI for Median Judgment Classification
COLING 2025
Detecting Inconsistencies in Narrative Elements of Cross Lingual Nakba Texts
COLING 2025
Linking language model predictions to human behaviour on scalar implicatures
COLING 2025
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
NAACL 2025
MorphNLI: A Stepwise Approach to Natural Language Inference Using Text Morphing
NAACL 2025
Predicting Median, Disagreement and Noise Label in Ordinal Word-in-Context Data
COLING 2025
<
1
2
3
4
5
…
37
>