Papers
357 papers found
Bandit Linear Control
Asaf Cassel, Tomer Koren
Inference for Batched Bandits
Kelly Zhang, Lucas Janson, Susan Murphy
Optimal Best-arm Identification in Linear Bandits
Yassir Jedra, Alexandre Proutiere
Model Selection in Contextual Stochastic Bandit Problems
Aldo Pacchiano, My Phan, Yasin Abbasi Yadkori et al.
Adapting to Misspecification in Contextual Bandits
Dylan J Foster, Claudio Gentile, Mehryar Mohri et al.
Supermasks in Superposition
Mitchell Wortsman, Vivek Ramanujan, Rosanne Liu et al.
Batched Coarse Ranking in Multi-Armed Bandits
Nikolai Karpov, Qin Zhang
Pruning Filter in Filter
Fanxu Meng, Hao Cheng, Ke Li et al.
Choice Bandits
Arpit Agarwal, Nicholas Johnson, Shivani Agarwal
Finding All $\epsilon$-Good Arms in Stochastic Bandits
Blake Mason, Lalit Jain, Ardhendu Tripathy et al.
A Gang of Adversarial Bandits
Mark Herbster, Stephen Pasteris, Fabio Vitale et al.
Learning Equilibria in Matching Markets from Bandit Feedback
Meena Jagadeesan, Alexander Wei, Yixin Wang et al.
Pure Exploration in Kernel and Neural Bandits
Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang et al.
Beyond Bandit Feedback in Online Multiclass Classification
Dirk van der Hoeven, Federico Fusco, Nicolò Cesa-Bianchi
Label Disentanglement in Partition-based Extreme Multilabel Classification
Xuanqing Liu, Wei-Cheng Chang, Hsiang-Fu Yu et al.
Transformer in Transformer
Kai Han, An Xiao, Enhua Wu et al.
Bandit Phase Retrieval
Tor Lattimore, Botao Hao
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang, Liu Leqi, Zachary Lipton et al.
No Regrets for Learning the Prior in Bandits
Soumya Basu, Branislav Kveton, Manzil Zaheer et al.
Post-Contextual-Bandit Inference
Aurelien Bibaut, Maria Dimakopoulou, Nathan Kallus et al.
Effective Dimension in Bandit Problems under Censorship
Gauthier Guinet, Saurabh Amin, Patrick Jaillet
Handcrafted Backdoors in Deep Neural Networks
Sanghyun Hong, Nicholas Carlini, Alexey Kurakin
Matching in Multi-arm Bandit with Collision
YiRui Zhang, Siwei Wang, Zhixuan Fang
Learning in Congestion Games with Bandit Feedback
Qiwen Cui, Zhihan Xiong, Maryam Fazel et al.
Near-Optimal Collaborative Learning in Bandits
Clémence Réda, Sattar Vakili, Emilie Kaufmann