Papers
638 papers found
Fair Algorithms for Multi-Agent Multi-Armed Bandits
Safwan Hossain, Evi Micha, Nisarg Shah
Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
Yuchen Xiao, Weihao Tan, Christopher Amato
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Zixian Ma, Rose Wang, Fei-Fei Li et al.
A Closer Look at Offline RL Agents
Yuwei Fu, Di Wu, Benoit Boulet
Task-Agnostic Graph Explanations
Yaochen Xie, Sumeet Katariya, Xianfeng Tang et al.
Envy-free Policy Teaching to Multiple Agents
Jiarui Gan, R Majumdar, Adish Singla et al.
Peer Prediction for Learning Agents
Shi Feng, Fang-Yi Yu, Yiling Chen
Multi-agent Dynamic Algorithm Configuration
Ke Xue, Jiacheng Xu, Lei Yuan et al.
Online Agnostic Multiclass Boosting
Vinod Raman, Ambuj Tewari
Cross-Episodic Curriculum for Transformer Agents
Lucy Xiaoyang Shi, Yunfan Jiang, Jake Grigsby et al.
Agnostic Multi-Group Active Learning
Nicholas Rittler, Kamalika Chaudhuri
Language Model Alignment with Elastic Reset
Michael Noukhovitch, Samuel Lavoie, Florian Strub et al.
How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception
Dingkang Yang, Kun Yang, Yuzheng Wang et al.
Mind2Web: Towards a Generalist Agent for the Web
Xiang Deng, Yu Gu, Boyuan Zheng et al.
Latent Space Translation via Semantic Alignment
Valentino Maiorca, Luca Moschella, Antonio Norelli et al.
The Waymo Open Sim Agents Challenge
Nico Montali, John Lambert, Paul Mougin et al.
Hierarchical Multi-Agent Skill Discovery
Mingyu Yang, Yaodong Yang, Zhenbo Lu et al.
Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Hang Zhou, Yehui Tang, Haochen Qin et al.
Periodic agent-state based Q-learning for POMDPs
Amit Sinha, Matthieu Geist, Aditya Mahajan
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Chang Ma, Junlei Zhang, Zhihao Zhu et al.
GTA: A Benchmark for General Tool Agents
Jize Wang, Zerun Ma, Yining Li et al.
Contracting with a Learning Agent
Guru Guruganesh, Yoav Kolumbus, Jon Schneider et al.
On the Effects of Data Scale on UI Control Agents
Wei Li, William Bishop, Alice Li et al.
Sample-Efficient Agnostic Boosting
Udaya Ghai, Karan Singh
Elliptical Attention
Stefan K. Nielsen, Laziz U. Abdullaev, Rachel S.Y. Teo et al.