Papers
938 papers found
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati, Amy Zhang, Joelle Pineau et al.
Static and Dynamic Values of Computation in MCTS
Eren Sezener, Peter Dayan
Statistically Efficient Greedy Equivalence Search
Max Chickering
Stochastic Variational Inference for Dynamic Correlated Topic Models
Federico Tomasi, Praveen Chandar, Gal Levy-Fix et al.
Streaming Nonlinear Bayesian Tensor Decomposition
Zhimeng Pan, Zheng Wang, Shandian Zhe
Structure Learning for Cyclic Linear Causal Models
Carlos Amendola, Philipp Dettling, Mathias Drton et al.
Submodular Bandit Problem Under Multiple Constraints
Sho Takemori, Masahiro Sato, Takashi Sonoda et al.
Symbolic Querying of Vector Spaces: Probabilistic Databases Meets Relational Embeddings
Tal Friedman, Guy Broeck
Testing Goodness of Fit of Conditional Density Models with Kernels
Wittawat Jitkrittum, Heishiro Kanagawa, Bernhard Schölkopf
The Hawkes Edge Partition Model for Continuous-time Event-based Temporal Networks
Sikun Yang, Heinz Koeppl
The Indian Chefs Process
Patrick Dallaire, Luca Ambrogioni, Ludovic Trottier et al.
Time Series Analysis using a Kernel based Multi-Modal Uncertainty Decomposition Framework
Rishabh Singh, Jose Principe
Towards Threshold Invariant Fair Classification
Mingliang Chen, Min Wu
TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP
Nils Rethmeier, Vageesh Kumar Saxena, Isabelle Augenstein
Unknown mixing times in apprenticeship and reinforcement learning
Tom Zahavy, Alon Cohen, Haim Kaplan et al.
Verifying Individual Fairness in Machine Learning Models
Philips George John, Deepak Vijaykeerthy, Diptikalyan Saha
Walking on Two Legs: Learning Image Segmentation with Noisy Labels
Guohua Cheng, Hongli Ji, Yan Tian
What You See May Not Be What You Get: UCB Bandit Algorithms Robust to $\varepsilon$-Contamination
Laura Niss, Ambuj Tewari
Zeroth Order Non-convex optimization with Dueling-Choice Bandits
Yichong Xu, Aparna Joshi, Aarti Singh et al.
A Bayesian Approach to Robust Reinforcement Learning
Esther Derman, Daniel Mankowitz, Timothy Mann et al.
Active Multi-Information Source Bayesian Quadrature
Alexandra Gessner, Javier Gonzalez, Maren Mahsereci
Adaptive Hashing for Model Counting
Jonathan Kuck, Tri Dao, Shengjia Zhao et al.
Adaptively Truncating Backpropagation Through Time to Control Gradient Bias
Christopher Aicher, Nicholas J. Foti, Emily B. Fox
Adaptivity and Optimality: A Universal Algorithm for Online Convex Optimization
Guanghui Wang, Shiyin Lu, Lijun Zhang
A Fast Proximal Point Method for Computing Exact Wasserstein Distance
Yujia Xie, Xiangfeng Wang, Ruijia Wang et al.