Papers
357 papers found
Bandit Convex Optimization in Non-stationary Environments
Peng Zhao, Guanghui Wang, Lijun Zhang et al.
Bandit Learning in Decentralized Matching Markets
Lydia T. Liu, Feng Ruan, Horia Mania et al.
Adaptation to the Range in K-Armed Bandits
Hédi Hadiji, Gilles Stoltz
Continuous-in-time Limit for Bayesian Bandits
Yuhua Zhu, Zachary Izzo, Lexing Ying
Fast Rates in Pool-Based Batch Active Learning
Claudio Gentile, Zhilei Wang, Tong Zhang
Score-Based Diffusion Models in Function Space
Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista et al.
Competing Bandits in Time Varying Matching Markets
Deepan Muthirayan, Chinmay Maheshwari, Pramod Khargonekar et al.
Extrapolation in NLP
Jeff Mitchell, Pontus Stenetorp, Pasquale Minervini et al.
Do Multilingual Language Models Think Better in English?
Julen Etxaniz, Gorka Azkune, Aitor Soroa et al.
Revealing the Barriers of Language Agents in Planning
Jian Xie, Kexun Zhang, Jiangjie Chen et al.
Reverse Modeling in Large Language Models
Sicheng Yu, Xu Yuanchen, Cunxiao Du et al.
Exploring Backward Reasoning in Large Language Models
Leonardo Ranaldi, Giulia Pucci
Blind Swarms for Coverage in 2-D
Vin de Silva, Robert Ghrist, Abubakr Muhammad
Shape-Based Compliance in Locomotion
Matt Travers, Julian Whitman, Perrin Schiebel et al.
GPU-Based Max Flow Maps in the Plane
Renato Farias, Marcelo Kallmann
Reducing Exploration of Dying Arms in Mortal Bandits
Stefano Tracà, Cynthia Rudin, Weiyu Yan
Semi-bandit Optimization in the Dispersed Setting
Maria-Florina Balcan, Travis Dick, Wesley Pegden
Testification of Condorcet Winners in dueling bandits
Björn Haddenhorst, Viktor Bengs, Jasmin Brandt et al.
Class balancing GAN with a classifier in the loop
Harsh Rangwani, Konda Reddy Mopuri, R. Venkatesh Babu
CORe: Capitalizing On Rewards in Bandit Exploration
Nan Wang, Branislav Kveton, Maryam Karimzadehgan
Face Identity-Aware Disentanglement in StyleGAN
Adrian Suwała, Bartosz Wójcik, Magdalena Proszewska et al.
Bandit Based Attention Mechanism in Vision Transformers
Amartya Roy Chowdhury, Raghuram Bharadwaj Diddigi, Prabuchandran K J et al.
Using Subword-Embeddings for Bilingual Lexicon Induction in Bantu Languages
Adrian Breiding, Alan Akbik
BanglaLlama: LLaMA for Bangla Language
Abdullah Khan Zehady, Shubhashis Roy Dipta, Naymul Islam et al.
BanglaIPA: Towards Robust Text-to-IPA Transcription with Contextual Rewriting in Bengali
Jakir Hasan, Shrestha Datta, Md Saiful Islam et al.