Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

Ibrahim Abdelaziz; Kinjal Basu; Mayank Agarwal; Sadhana Kumaravel; Matthew Stallone; Rameswar Panda; Yara Rizk; G P Shrivatsa Bhargav; Maxwell Crouse; Chulaka Gunasekara; Shajith Ikbal; Sachindra Joshi; Hima Karanam; Vineet Kumar; Asim Munawar; Sumit Neelam; Dinesh Raghu; Udit Sharma; Adriana Meza Soria; Dheeraj Sreedhar; Praveen Venkateswaran; Merve Unuvar; David Daniel Cox; Salim Roukos; Luis A. Lastras; Pavan Kapanipathi

2024 EMNLP EMNLP 2024

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

Abstract

AbstractAn emergent research trend explores the use of Large Language Models (LLMs) as the backbone of agentic systems (e.g., SWE-Bench, Agent-Bench). To fulfill LLMs’ potential as autonomous agents, they must be able to identify, call, and interact with a variety of external tools and application program interfaces (APIs). This capability of LLMs, commonly termed function calling, leads to a myriad of advantages such as access to current and domain-specific information in databases and the outsourcing of tasks that can be reliably performed by tools. In this work, we introduce Granite-20B-FunctionCalling, a model trained using a multi-task training approach on seven fundamental tasks encompassed in function calling. Our comprehensive evaluation on multiple out-of-domain datasets, which compares Granite-20B-FunctionCalling to more than 15 other best proprietary and open models, shows that Granite-20B-FunctionCalling has better generalizability on multiple tasks across seven different evaluation benchmarks. Moreover, Granite-20B-FunctionCalling shows the best performance among all open models and ranks among the top on the Berkeley Function Calling Leaderboard (BFCL).

👥 Mega-Team — 26 authors

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐣 Hot Topic Early Bird — function calling

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ibrahim Abdelaziz , Kinjal Basu , Mayank Agarwal , Sadhana Kumaravel , Matthew Stallone , Rameswar Panda , Yara Rizk , G P Shrivatsa Bhargav , Maxwell Crouse , Chulaka Gunasekara , Shajith Ikbal , Sachindra Joshi , Hima Karanam , Vineet Kumar , Asim Munawar , Sumit Neelam , Dinesh Raghu , Udit Sharma , Adriana Meza Soria , Dheeraj Sreedhar , Praveen Venkateswaran , Merve Unuvar , David Daniel Cox , Salim Roukos , Luis A. Lastras , Pavan Kapanipathi

Topics

Artificial Intelligence > Core AI > Agent Systems Natural Language Processing > Resources & Methods > Large Language Models Artificial Intelligence > Core AI > Large Language Models Machine Learning > Learning Types > Multi-Agent Systems Deep Learning > Learning Types > Multi-Task Learning

Keywords

multi-task learning tool use agent system llm agent function calling external tool large language model api interaction

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024