Bellman Error Based Feature Generation using Random Projections on Sparse Spaces

Mahdi Milani Fard; Yuri Grinberg; Amir-massoud Farahmand; Joelle Pineau; Doina Precup

2013 NIPS NeurIPS 2013

Bellman Error Based Feature Generation using Random Projections on Sparse Spaces

Abstract

This paper addresses the problem of automatic generation of features for value function approximation in reinforcement learning. Bellman Error Basis Functions (BEBFs) have been shown to improve the error of policy evaluation with function approximation, with a convergence rate similar to that of value iteration. We propose a simple, fast and robust algorithm based on random projections, which generates BEBFs for sparse feature spaces. We provide a finite sample analysis of the proposed method, and prove that projections logarithmic in the dimension of the original space guarantee a contraction in the error. Empirical results demonstrate the strength of this method in domains in which choosing a good state representation is challenging.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — feature generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

📈 Trend Setter — Value Iteration

🐣 Hot Topic Early Bird — reinforcement learning

Authors

Mahdi Milani Fard , Yuri Grinberg , Amir-massoud Farahmand , Joelle Pineau , Doina Precup

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Optimization Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Core Methods > Feature Learning Reinforcement Learning > Methods > Value Iteration Deep Learning > Learning Types > Reinforcement Learning

Keywords

reinforcement learning feature learning policy evaluation sparse coding value function approximation bellman error feature generation sparse feature spaces random projection

Download PDF

Related papers

Latent Structured Active Learning 2013

On Flat versus Hierarchical Classification in Large-Scale Taxonomies 2013

Generalized Method-of-Moments for Rank Aggregation 2013

Third-Order Edge Statistics: Contour Continuation, Curvature, and Cortical Connections 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent 2013