2025
ACL
ACL 2025
HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection
Abstract
AbstractWe present HalluSearch, a multilingual pipeline designed to detect fabricated text spans in Large Language Model (LLM) outputs as part of Mu-SHROOM. HalluSearch couples retrieval-augmented verification with fine-grained factual splitting to identify and localize hallucinations in 14 different languages. Empirical evaluations show that HalluSearch performs competitively, placing fourth in both English (within the top 10%) and Czech. While the system’s retrieval-based strategy generally proves robust, it faces challenges in languages with limited online coverage, underscoring the need for further research to ensure consistent hallucination detection across diverse linguistic contexts.
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— factual splitting
Authors
Topics
Artificial Intelligence > Core AI > Foundation Models
Artificial Intelligence > Core AI > Interpretability
Artificial Intelligence > Core AI > Responsible AI
Machine Learning > Application Areas > Domain Adaptation
Machine Learning > Application Areas > Privacy
Natural Language Processing > Applications > Fact-Checking
Natural Language Processing > Resources & Methods > Knowledge Editing
Natural Language Processing > Resources & Methods > Large Language Models
Natural Language Processing > Resources & Methods > Multilingual NLP
Keywords
retrieval augmented generation
fact verification
factual accuracy
multilingual nlp
information retrieval
retrieval-augmented generation
hallucination detection
factual verification
multilingual system
span detection
large language model
retrieval-augmented verification
factual splitting
fabricated text detection