Rémi Munos
117 papers
· 2006–2025
· 7 conferences
· across top CS/AI conferences
Achievements
🗺️
Taxonomy Completionist
(43)
🌈
Renaissance Researcher
(9)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(19)
🌉
Interdisciplinary Bridge
🐝
Cross-Pollinator
(13)
🌟
Keyword Trendsetter Combo
(6)
🏠
Conference Loyalist
(46)
🐺
Lone Wolf
(3)
🤝
Dynamic Duo
(35)
👑
Triple Crown
🌱
Topic Pioneer
🔬
Deep Specialist
(15)
🏆
Keyword Champion
💎
Century Club
(117)
🔥
Unstoppable
(20)
🗃️
Keyword Collector
(212)
📈
Trend Setter
🚀
Conference Pioneer
⚡
Prolific Year
(5)
Conferences
NIPS (46)
ICML (41)
JMLR (11)
AISTATS (10)
ICLR (7)
ACML (1)
COLT (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(23)
multi-armed bandit
(17)
regret bound
(17)
markov decision process
(12)
value function
(11)
stochastic optimization
(10)
variance reduction
(9)
deep reinforcement learning
(9)
distributional reinforcement learning
(9)
sample complexity
(9)
value iteration
(7)
policy gradient
(7)
off-policy learning
(7)
online algorithm
(7)
policy optimization
(7)
representation learning
(6)
online learning
(6)
stratified sampling
(5)
game theory
(5)
nash equilibrium
(5)
Papers
Temporal Difference Flows
ICML 2025
Nash Learning from Human Feedback
ICML 2024
Quantile Credit Assignment
ICML 2023
Taylor Expansion of Discount Factors
ICML 2021
Taylor Expansion Policy Optimization
ICML 2020
Adaptive Trade-Offs in Off-Policy Learning
AISTATS 2020
Spectral bandits
JMLR 2020
Hindsight Credit Assignment
NIPS 2019
The Termination Critic
AISTATS 2019
Maximum a Posteriori Policy Optimisation
ICLR 2018
Optimistic optimization of a Brownian
NIPS 2018
Noisy Networks For Exploration
ICLR 2018
Learning to search with MCTSnets
ICML 2018
Cheap Bandits
ICML 2015
Toward Minimax Off-policy Value Estimation
AISTATS 2015
Active Regression by Stratification
NIPS 2014
Risk-Aversion in Multi-armed Bandits
NIPS 2012
Optimistic planning for Markov decision processes
AISTATS 2012
Speedy Q-Learning
NIPS 2011
Sparse Recovery with Brownian Sensing
NIPS 2011
-Armed Bandits
JMLR 2011
LSTD with Random Projections
NIPS 2010
Compressed Least-Squares Regression
NIPS 2009
Online Optimization in X-Armed Bandits
NIPS 2008
Policy Gradient in Continuous Time
JMLR 2006