Research Explorer

MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents

Kunlun Zhu, Hongyi Du, Zhaochen Hong et al.

2025 ACL

LocAgent: Graph-Guided LLM Agents for Code Localization

Zhaoling Chen, Robert Tang, Gangda Deng et al.

2025 ACL

NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization

Hyuntak Kim, Byung-Hak Kim

2025 ACL

GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents

Lingxiao Diao, Xinyue Xu, Wanxuan Sun et al.

2025 ACL

Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models

Yiwen Jiang, Deval Mehta, Wei Feng et al.

2025 ACL

Caution for the Environment: Multimodal LLM Agents are Susceptible to Environmental Distractions

Xinbei Ma, Yiting Wang, Yao Yao et al.

2025 ACL

Multiple LLM Agents Debate for Equitable Cultural Alignment

Dayeon Ki, Rachel Rudinger, Tianyi Zhou et al.

2025 ACL

LLM Agents Making Agent Tools

Georg Wölflein, Dyke Ferber, Daniel Truhn et al.

2025 ACL

The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents

Feiran Jia, Tong Wu, Xin Qin et al.

2025 ACL

LEAP & LEAN: Look-ahead Planning and Agile Navigation for LLM Agents

Nikhil Verma, Manasa Bharadwaj

2025 ACL

A Parallelized Framework for Simulating Large-Scale LLM Agents with Realistic Environments and Interactions

Jun Zhang, Yuwei Yan, Junbo Yan et al.

2025 ACL

Exploring Multi-Modal Data with Tool-Augmented LLM Agents for Precise Causal Discovery

ChengAo Shen, Zhengzhang Chen, Dongsheng Luo et al.

2025 ACL

Nuclear Deployed!: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents

Rongwu Xu, Xiaojian Li, Shuo Chen et al.

2025 ACL

Beyond Numeric Rewards: In-Context Dueling Bandits with LLM Agents

Fanzeng Xia, Hao Liu, Yisong Yue et al.

2025 ACL

LLM Agents for Coordinating Multi-User Information Gathering

Harsh Jhamtani, Jacob Andreas, Benjamin Van Durme

2025 ACL

TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session Dialogues

Yubin Ge, Salvatore Romeo, Jason Cai et al.

2025 ACL

A Joint Optimization Framework for Enhancing Efficiency of Tool Utilization in LLM Agents

Bin Wu, Edgar Meij, Emine Yilmaz

2025 ACL

Dynamic Personality in LLM Agents: A Framework for Evolutionary Modeling and Behavioral Analysis in the Prisoner’s Dilemma

Weiqi Zeng, Bo Wang, Dongming Zhao et al.

2025 ACL

The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs

Avinash Baidya, Kamalika Das, Xiang Gao

2025 ACL

PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play

Wei Fang, Yang Zhang, Kaizhi Qian et al.

2025 ACL

TableWise at SemEval-2025 Task 8: LLM Agents for TabQA

Harsh Bansal, Aman Raj, Akshit Sharma et al.

2025 ACL

QleverAnswering-PUCRS at SemEval-2025 Task 8: Exploring LLM agents, code generation and correction for Table Question Answering

André Bergmann Lisboa, Lucas Cardoso Azevedo, Lucas Rafael Costella Pessutto

2025 ACL

ALYMPICS: LLM Agents Meet Game Theory

Shaoguang Mao, Yuzhe Cai, Yan Xia et al.

2025 COLING

Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Yuxi Wei, Zi Wang, Yifan Lu et al.

2024 CVPR

LLM Agents in Interaction: Measuring Personality Consistency and Linguistic Alignment in Interacting Populations of Large Language Models

Ivar Frisch, Mario Giulianelli

2024 EACL

Papers