Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Reinforcement Learning
›
Applications
›
Robotics
2069 directly classified papers
Papers per year
2005: 4
2006: 13
2007: 13
2008: 14
2009: 4
2010: 14
2011: 15
2012: 20
2013: 20
2014: 18
2015: 14
2016: 22
2017: 53
2018: 84
2019: 166
2020: 215
2021: 282
2022: 268
2023: 342
2024: 254
2025: 193
2026: 41
Papers
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
NIPS 2016
Adaptive optimal training of animal behavior
NIPS 2016
A forward model at Purkinje cell synapses facilitates cerebellar anticipatory control
NIPS 2016
Hierarchical Relative Entropy Policy Search
JMLR 2016
Hierarchical Decision Making In Electricity Grid Management
ICML 2016
Continuous Deep Q-Learning with Model-based Acceleration
ICML 2016
Why Most Decisions Are Easy in Tetris—And Perhaps in Other Sequential Decision Problems, As Well
ICML 2016
Control of Memory, Active Perception, and Action in Minecraft
ICML 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
ICML 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
ICML 2016
End-to-End Training of Deep Visuomotor Policies
JMLR 2016
Time and Energy Optimal Path Planning in General Flows
RSS 2016
Routing Autonomous Vehicles in Congested Transportation Networks: Structural Properties and Coordination Algorithms
RSS 2016
Robust Phase-Space Planning for Agile Legged Locomotion over Various Terrain Topologies
RSS 2016
Combined Optimization and Reinforcement Learning for Manipulation Skills
RSS 2016
Representing and Learning Complex Object Interactions
RSS 2016
Geometric Swimming on a Granular Surface
RSS 2016
Design and Implementation of a Novel Robot Fish with Active and Compliant Propulsion Mechanism
RSS 2016
Shape-Based Compliance in Locomotion
RSS 2016
Closed Loop Control of a Tethered Magnetic Capsule Endoscope
RSS 2016
Action-Conditional Video Prediction using Deep Networks in Atari Games
NIPS 2015
Learning Continuous Control Policies by Stochastic Value Gradients
NIPS 2015
Learning of Non-Parametric Control Policies with High-Dimensional State Features
AISTATS 2015
Modelling Policies in MDPs in Reproducing Kernel Hilbert Space
AISTATS 2015
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret
ICML 2015
<
1
…
76
77
78
…
83
>