2025 NAACL NAACL 2025

RAD-Bench: Evaluating Large Language Models’ Capabilities in Retrieval Augmented Dialogues