← Applications

Reinforcement Learning › Applications ›

Game AI

511 directly classified papers

Papers per year

Papers

Player Movement Models for Video Game Level Generation IJCAI 2017

Emergent Behaviors in Mixed-Autonomy Traffic CORL 2017

End-to-end optimization of goal-driven and visually grounded dialogue systems IJCAI 2017

Stratified Strategy Selection for Unit Control in Real-Time Strategy Games IJCAI 2017

Safe and Nested Subgame Solving for Imperfect-Information Games NIPS 2017

A multi-agent reinforcement learning model of common-pool resource appropriation NIPS 2017

Multi-Agent Systems of Inverse Reinforcement Learners in Complex Games IJCAI 2017

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning NIPS 2017

Why Most Decisions Are Easy in Tetris—And Perhaps in Other Sequential Decision Problems, As Well ICML 2016

Deep Exploration via Bootstrapped DQN NIPS 2016

On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search ICML 2016

On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games AISTATS 2016

Kernel Estimation and Model Combination in A Bandit Problem with Covariates JMLR 2016

Trust Region Policy Optimization ICML 2015

Multipolicy Decision-Making for Autonomous Driving via Changepoint-based Behavior Prediction RSS 2015

Approximate Modified Policy Iteration and its Application to the Game of Tetris JMLR 2015

Universal Option Models NIPS 2014

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning NIPS 2014

Reinforcement learning with value advice ACML 2014

Performance Bounds for λ Policy Iteration and Application to the Game of Tetris JMLR 2013

Approximate Dynamic Programming Finally Performs Well in the Game of Tetris NIPS 2013

Convergence of Monte Carlo Tree Search in Simultaneous Move Games NIPS 2013

Efficient Monte Carlo Counterfactual Regret Minimization in Games with Many Player Actions NIPS 2012

Predicting Dynamic Difficulty NIPS 2011

A reinterpretation of the policy oscillation phenomenon in approximate policy iteration NIPS 2011