Papers
1,286 papers found
MemGuide: Intent-Driven Memory Selection for Goal-Oriented Multi-Session LLM Agents
Yiming Du, Bingbing Wang, Yang He et al.
When Instinct Guides and Insight Grounds: Staged RL Training for LLM Agents
Zijing Zhang, Boning Zhang
SARA: Leveraging LLM Agents and Jurisprudential Ontologies for Automated Legal Reasoning
Francisco C J Bonfim, Sara Pessoa SIlva, Alicia S Neves et al.
Physics-Informed Autonomous LLM Agents for Explainable Power Electronics Modulation Design
Junhua Liu, Fanfan Lin, Xinze Li et al.
RefLens: End-to-End Evidence-Grounded Citation Verification with LLM Agents
SeungHoo Lee, JuneHyoung Kwon, Jooweon Choi et al.
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents
Petr Anokhin, Nikita Semenov, Artyom Sorokin et al.
Can Graph Learning Improve Planning in LLM-based Agents?
Xixi Wu, Yifei Shen, Caihua Shan et al.
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy
Zhenyu Guan, Xiangyu Kong, Fangwei Zhong et al.
OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Haochen Shi, Zhiyuan Sun, Xingdi Yuan et al.
AXIS: Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents
Junting Lu, Zhiyang Zhang, Fangkai Yang et al.
Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents
Tao Wu, Jingyuan Chen, Wang Lin et al.
ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents
Zhigen Li, Jianxiang Peng, Yanmeng Wang et al.
Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories
Alperen Yildiz, Sin G Teo, Yiling Lou et al.
Can a Large Language Model Keep My Secrets? A Study on LLM-Controlled Agents
Niklas Hemken, Sai Koneru, Florian Jacob et al.
A Survey of LLM-based Agents in Medicine: How far are we from Baymax?
Wenxuan Wang, Zizhan Ma, Zheng Wang et al.
MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents
Haoran Tan, Zeyu Zhang, Chen Ma et al.
StateAct: Enhancing LLM Base Agents via Self-prompting and State-tracking
Nikolai Rozanov, Marek Rei
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems
Xiangyuan Xue, Zeyu Lu, Di Huang et al.
An Evaluation Mechanism of LLM-based Agents on Manipulating APIs
Bing Liu, Zhou Jianxiang, Dan Meng et al.
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
Wenyue Hua, Xianjun Yang, Mingyu Jin et al.
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Ruixuan Xiao, Wentao Ma, Ke Wang et al.
Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks
Yun-Shiuan Chuang, Krirk Nirunwiroj, Zach Studdiford et al.
SPARK: Simulating the Co-evolution of Stance and Topic Dynamics in Online Discourse with LLM-based Agents
Bowen Zhang, Yi Yang, Fuqiang Niu et al.