Research Explorer

A Practical Method for Generating String Counterfactuals

Matan Avitan, Ryan Cotterell, Yoav Goldberg et al.

2025 NAACL

A Preliminary Study on NLP-Based Personalized Support for Type 1 Diabetes Management

Sandra Mitrović, Federico Fontana, Andrea Zignoli et al.

2025 NAACL

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation

Bairu Hou, Yang Zhang, Jacob Andreas et al.

2025 NAACL

Arabic Dataset for LLM Safeguard Evaluation

Yasser Ashraf, Yuxia Wang, Bin Gu et al.

2025 NAACL

A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models

Peiqin Lin, Andre Martins, Hinrich Schuetze

2025 NAACL

Are explicit belief representations necessary? A comparison between Large Language Models and Bayesian probabilistic models

Dingyi Pan, Ben Bergen

2025 NAACL

Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages

Poulami Ghosh, Raj Dabre, Pushpak Bhattacharyya

2025 NAACL

Are Larger Language Models Better at Disambiguation?

Ziyuan Cao, William Schuler

2025 NAACL

Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation

Dongryeol Lee, Yerin Hwang, Yongil Kim et al.

2025 NAACL

Are Multimodal LLMs Robust Against Adversarial Perturbations? RoMMath: A Systematic Evaluation on Multimodal Math Reasoning

Yilun Zhao, Guo Gan, Chengye Wang et al.

2025 NAACL

Are Small Language Models Ready to Compete with Large Language Models for Practical Applications?

Neelabh Sinha, Vinija Jain, Aman Chadha

2025 NAACL

Are We Done with MMLU?

Aryo Pradipta Gema, Joshua Ong Jun Leang, Giwon Hong et al.

2025 NAACL

Argumentation in political empowerment on Instagram

Aenne Knierim, Ulrich Heid

2025 NAACL

ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification

Yaswanth M, Vaibhav Singh, Ayush Maheshwari et al.

2025 NAACL

Artificial Relationships in Fiction: A Dataset for Advancing NLP in Literary Domains

Despina Christou, Grigorios Tsoumakas

2025 NAACL

ARWI: Arabic Write and Improve

Kirill Chirkunov, Bashar Alhafni, Chatrine Qwaider et al.

2025 NAACL

As easy as PIE: understanding when pruning causes language models to disagree

Pietro Tropeano, Maria Maistro, Tuukka Ruotsalo et al.

2025 NAACL

A Sentence-Level Visualization of Attention in Large Language Models

Seongbum Seo, Sangbong Yoo, Hyelim Lee et al.

2025 NAACL

Ask Optimal Questions: Aligning Large Language Models with Retriever’s Preference in Conversation

Chanwoong Yoon, Gangwoo Kim, Byeongguk Jeon et al.

2025 NAACL

ASRank: Zero-Shot Re-Ranking with Answer Scent for Document Retrieval

Abdelrahman Abdallah, Jamshid Mozafari, Bhawna Piryani et al.

2025 NAACL

AssertionBench: A Benchmark to Evaluate Large-Language Models for Assertion Generation

Vaishnavi Pulavarthi, Deeksha Nandal, Soham Dan et al.

2025 NAACL

Assessing Crowdsourced Annotations with LLMs: Linguistic Certainty as a Proxy for Trustworthiness

Tianyi Li, Divya Sree, Tatiana Ringenberg

2025 NAACL

Assessing LLMs for Zero-shot Abstractive Summarization Through the Lens of Relevance Paraphrasing

Hadi Askari, Anshuman Chhabra, Muhao Chen et al.

2025 NAACL

Assessing the Reliability and Validity of GPT-4 in Annotating Emotion Appraisal Ratings

Deniss Ruder, Andero Uusberg, Kairit Sirts

2025 NAACL

Assessing the State of the Art in Scene Segmentation

Albin Zehe, Elisabeth Fischer, Andreas Hotho

2025 NAACL

Papers