2008 NIPS NeurIPS 2008

On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor

Abstract

In this theoretical contribution we provide mathematical proof that two of the most important classes of network learning - correlation-based differential Hebbian learning and reward-based temporal difference learning - are asymptotically equivalent when timing the learning with a local modulatory signal. This opens the opportunity to consistently reformulate most of the abstract reinforcement learning framework from a correlation based perspective that is more closely related to the biophysics of neurons.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Interdisciplinary and Machine Learning and Reinforcement Learning
📈 Trend Setter — Agent Systems
🧭 Keyword Pioneer — differential hebbian learning
🐝 Cross-Pollinator — Artificial Intelligence, Data Science & Analytics, Interdisciplinary, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics
🐣 Hot Topic Early Bird — reinforcement learning