Sample Complexity of Kernel-Based Q-Learning

Sing-Yuan Yeh; Fu-Chieh Chang; Chang-Wei Yueh; Pei-Yuan Wu; Alberto Bernacchia; Sattar Vakili

2023 AISTATS AISTATS 2023

Sample Complexity of Kernel-Based Q-Learning

Abstract

Modern reinforcement learning (RL) often faces an enormous state-action space. Existing analytical results are typically for settings with a small number of state-actions, or simple models such as linearly modeled Q functions. To derive statistically efficient RL policies handling large state-action spaces, with more general Q functions, some recent works have considered nonlinear function approximation using kernel ridge regression. In this work, we derive sample complexities for kernel based Q-learning when a generative model exists. We propose a non-parametric Q-learning algorithm which finds an $\varepsilon$-optimal policy in an arbitrarily large scale discounted MDP. The sample complexity of the proposed algorithm is order optimal with respect to $\varepsilon$ and the complexity of the kernel (in terms of its information gain). To the best of our knowledge, this is the first result showing a finite sample complexity under such a general model.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sing-Yuan Yeh , Fu-Chieh Chang , Chang-Wei Yueh , Pei-Yuan Wu , Alberto Bernacchia , Sattar Vakili

Topics

Machine Learning > Optimization & Theory > Statistical Learning Reinforcement Learning > Applications > Value Iteration Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Core Methods > Kernel Methods

Keywords

reinforcement learning function approximation sample complexity markov decision process kernel ridge regression non-parametric method kernel methods

Download PDF

Related papers

Safe Sequential Testing and Effect Estimation in Stratified Count Data 2023

Who Should Predict? Exact Algorithms For Learning to Defer to Humans 2023

An Online and Unified Algorithm for Projection Matrix Vector Multiplication with Application to Empirical Risk Minimization 2023

Stochastic Gradient Descent-Ascent: Unified Theory and New Efficient Methods 2023

The Ordered Matrix Dirichlet for State-Space Models 2023