EKRAG: Benchmark RAG for Enterprise Knowledge Question Answering

Tan Yu; Wenfei Zhou; Lei Yang; Aaditya Shukla; Meenakshi Madugula; Pritam Gundecha; Nick Burnett; Anbang Xu; Vishal Seth; Tamar Bar; Rama Akkiraju; Vivienne Zhang

2025 NAACL NAACL 2025

EKRAG: Benchmark RAG for Enterprise Knowledge Question Answering

Abstract

AbstractRetrieval-augmented generation (RAG) offers a robust solution for developing enterprise internal virtual assistants by leveraging domain-specific knowledge and utilizing information from frequently updated corporate document repositories. In this work, we introduce the Enterprise-Knowledge RAG (EKRAG) dataset to benchmark RAG for enterprise knowledge question-answering (QA) across a diverse range of corporate documents, such as product releases, technical blogs, and financial reports. Using EKRAG, we systematically evaluate various retrieval models and strategies tailored for corporate content. We propose novel embedding-model (EM)-as-judge and ranking-model (RM)-as-judge approaches to assess answer quality in the context of enterprise information. Combining these with the existing LLM-as-judge method, we then comprehensively evaluate the correctness, relevance, and faithfulness of generated answers to corporate queries. Our extensive experiments shed light on optimizing RAG pipelines for enterprise knowledge QA, providing valuable guidance for practitioners. This work contributes to enhancing information retrieval and question-answering capabilities in corporate environments that demand high degrees of factuality and context-awareness.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Science and Natural Language Processing

🧭 Keyword Pioneer — enterprise knowledge

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Tan Yu , Wenfei Zhou , Lei Yang , Aaditya Shukla , Meenakshi Madugula , Pritam Gundecha , Nick Burnett , Anbang Xu , Vishal Seth , Tamar Bar , Rama Akkiraju , Vivienne Zhang

Topics

Natural Language Processing > Applications > Question Answering Computer Science > Applications > Information Retrieval Artificial Intelligence > Core AI > Large Language Models

Keywords

question answering retrieval-augmented generation embedding model answer faithfulness enterprise knowledge

Download PDF

Few-shot Personalization of LLMs with Mis-aligned Responses 2025

NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals 2025

Understanding Figurative Meaning through Explainable Visual Entailment 2025

CogLM: Tracking Cognitive Development of Large Language Models 2025

EKRAG: Benchmark RAG for Enterprise Knowledge Question Answering

Abstract

Authors

Topics

Keywords

Related papers