AutoDSPy: Automating Modular Prompt Design with Reinforcement Learning for Small and Large Language Models

Nafew Azim; Abrar Ur Alam; Hasan Bin Omar; Abdullah Mohammad Muntasir Adnan Jami; Jawad Ibn Ahad; Muhammad Rafsan Kabir; Md. Ismail Hossain; Fuad Rahman; Mohammad Ruhul Amin; Shafin Rahman; Nabeel Mohammed

2025 EMNLP EMNLP 2025

AutoDSPy: Automating Modular Prompt Design with Reinforcement Learning for Small and Large Language Models

Abstract

AbstractLarge Language Models (LLMs) excel at complexreasoning tasks, yet their performance hinges on the quality of their prompts and pipeline structures. Manual promptdesign, as used in frameworks like DSPy, poses significantlimitations: it is time-intensive, demands substantial expertise,and lacks scalability, restricting the widespread use of LLMsacross diverse applications. To overcome these challenges, weintroduce AutoDSPy, the first framework to fully automateDSPy pipeline construction using reinforcement learning (RL).AutoDSPy leverages an RL-tuned policy network to dynamicallyselect optimal reasoning modules—such as Chain-of-Thought forlogical tasks or ReAct for tool integration—along with inputoutput signatures and execution strategies, entirely eliminatingthe need for manual configuration. Experimental results on theGSM8K and HotPotQA benchmarks demonstrate that AutoDSPyoutperforms traditional DSPy baselines, achieving accuracy gainsof up to 4.3% while reducing inference time, even with smallermodels like GPT-2 (127M). By integrating RL-based automation,AutoDSPy enhances both efficiency and accessibility, simplifyingthe development of structured, high-performing LLM solutionsand enabling scalability across a wide range of tasks

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Nafew Azim , Abrar Ur Alam , Hasan Bin Omar , Abdullah Mohammad Muntasir Adnan Jami , Jawad Ibn Ahad , Muhammad Rafsan Kabir , Md. Ismail Hossain , Fuad Rahman , Mohammad Ruhul Amin , Shafin Rahman , Nabeel Mohammed

Topics

Artificial Intelligence > Core AI > Planning Machine Learning > Optimization & Theory > Optimization Machine Learning > Learning Types > Reinforcement Learning Artificial Intelligence > Core AI > Large Language Models Natural Language Processing > Resources & Methods > Language Modeling Machine Learning > Learning Types > Prompt Engineering Natural Language Processing > Resources & Methods > Prompt Engineering

Keywords

reinforcement learning prompt engineering prompt design chain of thought automated pipeline modular design modular prompt large language model reasoning module modular reasoning automated prompt

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025