Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Reinforcement Learning
1263 directly classified papers
Papers per year
2006: 1
2007: 2
2008: 3
2009: 2
2010: 1
2011: 2
2012: 3
2013: 2
2014: 3
2015: 2
2016: 8
2017: 44
2018: 95
2019: 134
2020: 123
2021: 131
2022: 143
2023: 127
2024: 194
2025: 240
2026: 3
Papers
Deep Communicating Agents for Abstractive Summarization
NAACL 2018
Improving Reinforcement Learning Based Image Captioning with Natural Language Prior
EMNLP 2018
Learning a Policy for Opportunistic Active Learning
EMNLP 2018
Improving Neural Abstractive Document Summarization with Explicit Information Selection Modeling
EMNLP 2018
Prediction Improves Simultaneous Neural Machine Translation
EMNLP 2018
Thread Popularity Prediction and Tracking with a Permutation-invariant Model
EMNLP 2018
CaLcs: Continuously Approximating Longest Common Subsequence for Sequence Level Optimization
EMNLP 2018
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
EMNLP 2018
Learning End-to-End Goal-Oriented Dialog with Multiple Answers
EMNLP 2018
AirDialogue: An Environment for Goal-Oriented Dialogue Research
EMNLP 2018
Paraphrase Generation with Deep Reinforcement Learning
EMNLP 2018
Greedy Search with Probabilistic N-gram Matching for Neural Machine Translation
EMNLP 2018
A Reinforcement Learning-driven Translation Model for Search-Oriented Conversational Systems
EMNLP 2018
Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management
EMNLP 2018
Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning
EMNLP 2018
Semi-Supervised QA with Generative Domain-Adaptive Nets
ACL 2017
From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood
ACL 2017
Search-based Neural Structured Learning for Sequential Question Answering
ACL 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
EMNLP 2017
Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision
EMNLP 2017
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
CVPR 2017
Self-Critical Sequence Training for Image Captioning
CVPR 2017
PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning
CVPR 2017
Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos
CVPR 2017
Reinforced Video Captioning with Entailment Rewards
EMNLP 2017
<
1
…
47
48
49
50
51
>