Papers
1,286 papers found
DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents
Varun Nair, Elliot Schumacher, Geoffrey Tso et al.
Properties and Challenges of LLM-Generated Explanations
Jenny Kunz, Marco Kuhlmann
Improving Retrospective Language Agents via Joint Policy Gradient Optimization
Xueyang Feng, Bo Lan, Quanyu Dai et al.
SELFGOAL: Your Language Agents Already Know How to Achieve High-level Goals
Ruihan Yang, Jiangjie Chen, Yikai Zhang et al.
LLM-Based Explicit Models of Opponents for Multi-Agent Games
XiaoPeng Yu, Wanpeng Zhang, Zongqing Lu
Revealing the Barriers of Language Agents in Planning
Jian Xie, Kexun Zhang, Jiangjie Chen et al.
Can Unconfident LLM Annotations Be Used for Confident Conclusions?
Kristina Gligoric, Tijana Zrnic, Cinoo Lee et al.
Towards Rationality in Language and Multimodal Agents: A Survey
Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang et al.
TurkingBench: A Challenge Benchmark for Web Agents
Kevin Xu, Yeganeh Kordi, Tanay Nayak et al.
MeNTi: Bridging Medical Calculator and LLM Agent with Nested Tool Calling
Yakun Zhu, Shaohang Wei, Xu Wang et al.
Hello Again! LLM-powered Personalized Agent for Long-term Dialogue
Hao Li, Chenghao Yang, An Zhang et al.
My LLM might Mimic AAE - But When Should It?
Sandra Camille Sandoval, Christabel Acquaye, Kwesi Adu Cobbina et al.
Arabic Dataset for LLM Safeguard Evaluation
Yasser Ashraf, Yuxia Wang, Bin Gu et al.
SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent
Keyeun Lee, Seo Hyeong Kim, Seolhee Lee et al.
LLMs as Meta-Reviewers’ Assistants: A Case Study
Eftekhar Hossain, Sanjeev Kumar Sinha, Naman Bansal et al.
Towards Lifelong Dialogue Agents via Timeline-based Memory Management
Kai Tzu-iunn Ong, Namyoung Kim, Minju Gwak et al.
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
Xianyang Zhan, Agam Goyal, Yilun Chen et al.
MASTER: A Multi-Agent System with LLM Specialized MCTS
Bingzheng Gan, Yufan Zhao, Tianyi Zhang et al.
WorkTeam: Constructing Workflows from Natural Language with Multi-Agents
Hanchao Liu, Rongjun Li, Weimin Xiong et al.
LLM Safety for Children
Prasanjit Rath, Hari Shrawgi, Parag Agrawal et al.
RxLens: Multi-Agent LLM-powered Scan and Order for Pharmacy
Akshay Jagatap, Srujana Merugu, Prakash Mandayam Comar
Foundation Models Meet Embodied Agents
Manling Li, Yunzhu Li, Jiayuan Mao et al.
Social Intelligence in the Age of LLMs
Hao Zhu, Bodhisattwa Prasad Majumder, Dirk Hovy et al.
GenSim: A General Social Simulation Platform with Large Language Model based Agents
Jiakai Tang, Heyang Gao, Xuchen Pan et al.
LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications
Danqing Zhang, Balaji Rama, Jingyi Ni et al.