Learning Rates for Q-learning

Eyal Even Dar; Yishay Mansour

2003 JMLR JMLR 2003

Learning Rates for Q-learning

Abstract

In this paper we derive convergence rates for Q-learning. We show an interesting relationship between the convergence rate and the learning rate used in Q-learning. For a polynomial learning rate, one which is 1/ t ω at time t where ω∈(1/2,1), we show that the convergence rate is polynomial in 1/(1-γ), where γ is the discount factor. In contrast we show that for a linear learning rate, one which is 1/ t at time t , the convergence rate has an exponential dependence on 1/(1-γ). In addition we show a simple example that proves this exponential behavior is inherent for linear learning rates. [abs] [ pdf ][ ps.gz ][ ps ]

🌱 Topic Pioneer — Reinforcement Learning

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

📈 Trend Setter — Stochastic Methods

🧭 Keyword Pioneer — reinforcement learning

🐣 Hot Topic Early Bird — reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Eyal Even Dar , Yishay Mansour

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Stochastic Methods Machine Learning > Learning Types > Reinforcement Learning

Keywords

reinforcement learning discount factor learning rate convergence rate

Download PDF

Related papers

Bottom-Up Relational Learning of Pattern Matching Rules for Information Extraction 2003

An Efficient Boosting Algorithm for Combining Preferences 2003

A Multiscale Framework For Blind Separation of Linearly Mixed Signals 2003

Word-Sequence Kernels 2003

An Extensive Empirical Study of Feature Selection Metrics for Text Classification 2003