RaFe: Ranking Feedback Improves Query Rewriting for RAG

Shengyu Mao; Yong Jiang; Boli Chen; Xiao Li; Peng Wang; Xinyu Wang; Pengjun Xie; Fei Huang; Huajun Chen; Ningyu Zhang

2024 EMNLP EMNLP 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAG

Abstract

AbstractAs Large Language Models (LLMs) and Retrieval Augmentation Generation (RAG) techniques have evolved, query rewriting has been widely incorporated into the RAG system for downstream tasks like open-domain QA to enhance document retrieval by reformulating queries. Many works have attempted to improve query rewriting in smaller models to avoid rewriting with costly LLMs, and the most common method is to employ reinforcement learning for feedback training. However, current methods require annotations (labeled relevant documents or downstream answers) or predesigned rewards for feedback, lack generalization, and fail to utilize signals tailored for query rewriting. In this paper, we propose RaFe, a framework for training query rewriting models. By leveraging reranker, RaFe provides ranking feedback aligned well with the rewriting objectives without needing signals from annotations and supports both online and offline training models. Experimental results demonstrate that with a general and publicly available reranker, RaFe can effectively steer the training for rewrite models.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shengyu Mao , Yong Jiang , Boli Chen , Xiao Li , Peng Wang , Xinyu Wang , Pengjun Xie , Fei Huang , Huajun Chen , Ningyu Zhang

Topics

Machine Learning > Application Areas > Knowledge Distillation Natural Language Processing > Applications > Information Retrieval Machine Learning > Learning Types > Reinforcement Learning

Keywords

reinforcement learning retrieval augmentation query rewriting feedback training

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024