Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Resources & Methods
Natural Language Processing
›
Resources & Methods
›
Natural Language Inference
680 directly classified papers
Papers per year
2006: 1
2012: 1
2016: 2
2017: 12
2018: 30
2019: 73
2020: 63
2021: 73
2022: 81
2023: 105
2024: 125
2025: 93
2026: 21
Papers
Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-Based Test Oracles
AAAI 2026
A Position Paper on Toxic Reasoning: Grounding Categories of Toxic Language in Implications and Attitudes
EACL 2026
Knowing What’s Missing: Assessing Information Sufficiency in Question Answering
EACL 2026
Sample-Size Scaling of the African Languages NLI Evaluation
EACL 2026
Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study
AAAI 2026
AdaMCoT: Rethinking Cross-Lingual Factual Reasoning Through Adaptive Multilingual Chain-of-Thought
AAAI 2026
Mary, the Cheeseburger-Eating Vegetarian: Do LLMs Recognize Incoherence in Narratives?
EACL 2026
FOL-Traces: Verified First-Order Logic Reasoning Traces at Scale
EACL 2026
When LLMs Annotate: Reliability Challenges in Low-Resource NLI
EACL 2026
PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs
EACL 2026
Is Word Sense Disambiguation Dead in the LLM Era?
AAAI 2026
In-Situ Eval: A Modular Framework for Custom and Real-Time RAG Benchmarking
AAAI 2026
A Novel Retrieve-Read-Group Paradigm for Open Knowledge Base Canonicalization
AAAI 2026
Judging by the Rules: Compliance-Aligned Framework for Modern Slavery Statement Monitoring
AAAI 2026
Can Reasoning Help Large Language Models Capture Human Annotator Disagreement?
EACL 2026
DETECT: Determining Ease and Textual Clarity of German Text Simplifications
EACL 2026
LLMs as Cultural Archives: Cultural Commonsense Knowledge Graph Extraction
EACL 2026
User-Centric Evidence Ranking for Attribution and Fact Verification
EACL 2026
Thunder-NUBench: A Benchmark for LLMs’ Sentence-Level Negation Understanding
EACL 2026
Improving the OOD Performance of Closed-Source LLMs on NLI Through Strategic Data Selection
EACL 2026
One Language, Three of Its Voices: Evaluating Multilingual LLMs Across Persian, Dari, and Tajiki on Translation and Understanding Tasks
EACL 2026
Co-Eval: Augmenting LLM-based Evaluation with Machine Metrics
EMNLP 2025
Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation
ACL 2025
Rule Discovery for Natural Language Inference Data Generation Using Out-of-Distribution Detection
EMNLP 2025
Beyond Accuracy: Revisiting Out-of-Distribution Generalization in NLI Models
ACL 2025
<
1
2
3
4
5
…
28
>