Papers

1,286 papers found
MPO: Boosting LLM Agents with Meta Plan Optimization
Weimin Xiong, Yifan Song, Qingxiu Dong et al.
2025 EMNLP
Agent Laboratory: Using LLM Agents as Research Assistants
Samuel Schmidgall, Yusheng Su, Ze Wang et al.
2025 EMNLP
2025 EMNLP
LLM Agents for Education: Advances and Applications
Zhendong Chu, Shen Wang, Jian Xie et al.
2025 EMNLP
Do LLM Agents Have Regret? A Case Study in Online Learning and Games
Chanwoo Park, Xiangyu Liu, Asuman E. Ozdaglar et al.
2025 ICLR
Robotouille: An Asynchronous Planning Benchmark for LLM Agents
Gonzalo Gonzalez-Pumariega, Leong Su Yean, Neha Sunkara et al.
2025 ICLR
Moral Alignment for LLM Agents
Elizaveta Tennant, Stephen Hailes, Mirco Musolesi
2025 ICLR
AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
Maksym Andriushchenko, Alexandra Souly, Mateusz Dziemian et al.
2025 ICLR
2025 ICLR
2025 ICLR
BadRobot: Jailbreaking Embodied LLM Agents in the Physical World
Hangtao Zhang, Chenyu Zhu, Xianlong Wang et al.
2025 ICLR
2025 ICLR
EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
Yunxiao Zhang, Guanming Xiong, Haochen Li et al.
2025 IJCAI
AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents
Luca Gioacchini, Giuseppe Siracusano, Davide Sanvito et al.
2024 NAACL