Papers
1,286 papers found
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering
Linyong Nan, Ellen Zhang, Weijin Zou et al.
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments
Kung-Hsiang Huang, Akshara Prabhakar, Sidharth Dhawan et al.
AI-LieDar : Examine the Trade-off Between Utility and Truthfulness in LLM Agents
Zhe Su, Xuhui Zhou, Sanketh Rangreji et al.
CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories
Yijia Xiao, Runhui Wang, Luyang Kong et al.
Adapting LLM Agents with Universal Communication Feedback
Kuan Wang, Yadong Lu, Michael Santacroce et al.
Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents
Qiusi Zhan, Richard Fang, Henil Shalin Panchal et al.
Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents
Shrinidhi Kumbhar, Venkatesh Mishra, Kevin Coutinho et al.
Self Knowledge-Tracing for Tool Use (SKT-Tool): Helping LLM Agents Understand Their Capabilities in Tool Use
Joshua Vigel, Renpei Cai, Eleanor Chen et al.
TableWise at SemEval-2025 Task 8: LLM Agents for TabQA
Harsh Bansal, Aman Raj, Akshit Sharma et al.
QleverAnswering-PUCRS at SemEval-2025 Task 8: Exploring LLM agents, code generation and correction for Table Question Answering
André Bergmann Lisboa, Lucas Cardoso Azevedo, Lucas Rafael Costella Pessutto
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Yuxuan Zhu, Antony Kellermann, Akul Gupta et al.
H-MEM: Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents
Haoran Sun, Shaoning Zeng, Bob Zhang
Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools
Ha Min Son, Huan Ren, Xin Liu et al.
Beyond Blind Following: Evaluating Robustness of LLM Agents under Imperfect Guidance
Yao Fu, Ran Qiu, Xinhe Wang et al.
Communication Enables Cooperation in LLM Agents: A Comparison with Curriculum-Based Approaches
Hachem Madmoun, Salem Lahlou
PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents
Minjia Wang, Yunfeng Wang, Xiao Ma et al.
Beyond IVR: Benchmarking Customer Support LLM Agents for Business-Adherence
Sumanth Balaji, Piyush Mishra, Aashraya Sachdeva et al.
SIRAJ: Diverse and Efficient Red-Teaming for LLM Agents via Distilled Structured Reasoning
Kaiwen Zhou, Ahmed Elgohary, A S M Iftekhar et al.
Label-Consistent Data Generation for Aspect-Based Sentiment Analysis Using LLM Agents
Mohammad Hossein Akbari Monfared, Lucie Flek, Akbar Karimi
RAG-Enhanced Collaborative LLM Agents for Drug Discovery
Namkyeong Lee, Edward De Brouwer, Ehsan Hajiramezanali et al.
AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments
Zikang Leng, Megha Thukral, Yaqi Liu et al.
Investigating Prosocial Behavior Theory in LLM Agents Under Policy-Induced Inequities
Yujia Zhou, Hexi Wang, Qingyao Ai et al.
MAGIC: Mastering Physical Adversarial Generation in Context Through Collaborative LLM Agents
Yun Xing, Nhat Chung, Jie Zhang et al.
Conformal Constrained Policy Optimization for Cost-Effective LLM Agents
Wenwen Si, Sooyong Jang, Insup Lee et al.
DEPO: Dual-Efficiency Preference Optimization for LLM Agents
Sirui Chen, Mengshi Zhao, Lei Xu et al.