Csaba Szepesvári
158 papers
· 2007–2025
· 11 conferences
· across top CS/AI conferences
Achievements
🌍
Conference Polyglot
(11)
🗺️
Taxonomy Completionist
(48)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(10)
🏃
Academic Marathon
(18)
🐣
Hot Topic Early Bird
🌈
Renaissance Researcher
(8)
🌉
Interdisciplinary Bridge
🏠
Conference Loyalist
(56)
🌟
Keyword Trendsetter Combo
(3)
🌱
Topic Pioneer
👑
Triple Crown
🔬
Deep Specialist
(11)
🏆
Keyword Champion
(3)
🧬
Topic Evolution
🏆
Grand Slam
🤝
Dynamic Duo
(33)
❓
The Questioner
(3)
📈
Trend Setter
🚀
Conference Pioneer
🔥
Unstoppable
(19)
⚡
Prolific Year
(12)
💎
Century Club
(158)
🗃️
Keyword Collector
(201)
Conferences
NIPS (56)
ICML (37)
AISTATS (26)
COLT (14)
ALT (7)
JMLR (6)
UAI (4)
ICLR (3)
IJCAI (3)
AAAI (1)
L4DC (1)
Top co-authors
Keywords
regret bound
(49)
online learning
(31)
multi-armed bandit
(25)
markov decision process
(20)
stochastic optimization
(20)
reinforcement learning
(16)
sample complexity
(13)
linear function approximation
(12)
policy iteration
(9)
regret analysis
(8)
partial monitoring
(8)
function approximation
(8)
regret minimization
(7)
online algorithm
(7)
thompson sampling
(7)
value function
(7)
contextual bandit
(7)
learning to rank
(6)
policy optimization
(6)
stochastic bandit
(6)
Papers
Exploration via linearly perturbed loss minimisation
AISTATS 2024
Context-lumpable stochastic bandits
NIPS 2023
Stochastic Gradient Succeeds for Bandits
ICML 2023
Online Sparse Reinforcement Learning
AISTATS 2021
Adaptive Approximate Policy Iteration
AISTATS 2021
Meta-Thompson Sampling
ICML 2021
Randomized Exploration in Generalized Linear Bandits
AISTATS 2020
Adaptive Exploration in Linear Contextual Bandit
AISTATS 2020
Online Algorithm for Unsupervised Sensor Selection
AISTATS 2019
Online Learning to Rank with Features
ICML 2019
Stochastic Rank-1 Bandits
AISTATS 2017
Unsupervised Sequential Sensor Acquisition
AISTATS 2017
Bernoulli Rank-1 Bandits for Click Feedback
IJCAI 2017
Conservative Bandits
ICML 2016
Combinatorial Cascading Bandits
NIPS 2015
Toward Minimax Off-policy Value Estimation
AISTATS 2015
Universal Option Models
NIPS 2014
Online Learning under Delayed Feedback
ICML 2013
Characterizing the Representer Theorem
ICML 2013
-Armed Bandits
JMLR 2011
Regularized Policy Iteration
NIPS 2008
Online Optimization in X-Armed Bandits
NIPS 2008