2024
EMNLP
EMNLP 2024
Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario
Abstract
AbstractCurrent research on tool learning primarily focuses on selecting the most effective tool from a wide array of options, often overlooking cost-effectiveness, a crucial factor in human problem-solving. In this paper, we address query routing for homogeneous tools by predicting both their performance and the associated cost required to accomplish a given task. We then assign queries to the optimal tools in a cost-effective manner. Our experimental results demonstrate that our method achieves higher performance at a lower cost compared to strong baseline approaches.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— cost-effective routing
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Agent Systems
Artificial Intelligence > Core AI > Planning
Artificial Intelligence > Learning Paradigms > Meta-Learning
Machine Learning > Optimization & Theory > Optimization
Machine Learning > Optimization & Theory > Online Algorithms
Artificial Intelligence > Core AI > Reasoning
Machine Learning > Learning Types > Multi-Armed Bandits
Machine Learning > Learning Types > Retrieval-Augmented Generation
Natural Language Processing > Generation > Retrieval-Augmented Generation
Natural Language Processing > Resources & Methods > Retrieval-Augmented Generation
Deep Learning > Learning Types > Retrieval-Augmented Generation