Papers
938 papers found
Learning in online MDPs: is there a price for handling the communicating case?
Gautam Chandrasekaran, Ambuj Tewari
Learning Nonlinear Causal Effect via Kernel Anchor Regression
Wenqi Shi, Wenkai Xu
Learning robust representation for reinforcement learning with distractions by reward sequence prediction
Qi Zhou, Jie Wang, Qiyuan Liu et al.
Learning To Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning
Ruihan Wu, Xiangyu Chen, Chuan Guo et al.
Learning to reason about contextual knowledge for planning under uncertainty
Cheng Cui, Saeid Amiri, Yan Ding et al.
Lifelong bandit optimization: no prior and no regret
Felix Schur, Parnian Kassraie, Jonas Rothfuss et al.
Locally Regularized Sparse Graph by Fast Proximal Gradient Descent
Dongfang Sun, Yingzhen Yang
Local Message Passing on Frustrated Systems
Luca Schmid, Joshua Brenk, Laurent Schmalen
Logit-based ensemble distribution distillation for robust autoregressive sequence uncertainties
Yassir Fathullah, Guoxuan Xia, Mark J. F. Gales
Loosely consistent emphatic temporal-difference learning
Jiamin He, Fengdi Che, Yi Wan et al.
Low-rank matrix recovery with unknown correspondence
Zhiwei Tang, Tsung-Hui Chang, Xiaojing Ye et al.
Massively parallel reweighted wake-sleep
Thomas Heap, Gavin Leech, Laurence Aitchison
Maximizing submodular functions under submodular constraints
Madhavan R. Padmanabhan, Yanhui Zhu, Samik Basu et al.
MDPose: real-time multi-person pose estimation via mixture density model
Seunghyeon Seo, Jaeyoung Yoo, Jihye Hwang et al.
Memory Mechanism for Unsupervised Anomaly Detection
Jiahao Li, Yiqiang Chen, Yunbing Xing
Meta-learning Control Variates: Variance Reduction with Limited Data
Zhuo Sun, Chris J Oates, François-Xavier Briol
MFA: Multi-layer Feature-aware Attack for Object Detection
Wen Chen, Yushan Zhang, Zhiheng Li et al.
Mitigating Transformer Overconfidence via Lipschitz Regularization
Wenqian Ye, Yunsheng Ma, Xu Cao et al.
Mixture of Normalizing Flows for European Option Pricing
Yongxin Yang, Timothy M. Hospedales
MixupE: Understanding and improving Mixup from directional derivative perspective
Yingtian Zou, Vikas Verma, Sarthak Mittal et al.
MMEL: A Joint Learning Framework for Multi-Mention Entity Linking
Chengmei Yang, Bowei He, Yimeng Wu et al.
Mnemonist: Locating Model Parameters that Memorize Training Examples
Ali Shahin Shamsabadi, Jamie Hayes, Borja Balle et al.
Modified Retrace for Off-Policy Temporal Difference Learning
Xingguo Chen, Xingzhou Ma, Yang Li et al.
Molecule Design by Latent Space Energy-Based Modeling and Gradual Distribution Shifting
Deqian Kong, Bo Pang, Tian Han et al.
Monte-Carlo Search for an Equilibrium in Dec-POMDPs
Yang You, Vincent Thomas, Francis Colas et al.