Artificial Intelligence › Core AI ›

Reinforcement Learning

767 directly classified papers

Papers per year

Papers

PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference NIPS 2024

P2BPO: Permeable Penalty Barrier-Based Policy Optimization for Safe RL AAAI 2024

Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts EMNLP 2024

TRIP NEGOTIATOR: A Travel Persona-aware Reinforced Dialogue Generation Model for Personalized Integrative Negotiation in Tourism EMNLP 2024

Reward Modeling Requires Automatic Adjustment Based on Data Quality EMNLP 2024

π-Light: Programmatic Interpretable Reinforcement Learning for Resource-Limited Traffic Signal Control AAAI 2024

Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation AAAI 2024

Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint ACL 2024

Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning EMNLP 2024

ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback EMNLP 2024

Exploration via linearly perturbed loss minimisation AISTATS 2024

Active Reinforcement Learning for Robust Building Control AAAI 2024

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code EMNLP 2024

Semi-Supervised Reward Modeling via Iterative Self-Training EMNLP 2024

Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use EMNLP 2024

On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation AISTATS 2024

Enhancing Reinforcement Learning with Dense Rewards from Language Model Critic EMNLP 2024

Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning AAAI 2024

Virtual Action Actor-Critic Framework for Exploration (Student Abstract) AAAI 2024

AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning CVPR 2024

StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model EMNLP 2024

E2CL: Exploration-based Error Correction Learning for Embodied Agents EMNLP 2024

Transformers Learn Transition Dynamics when Trained to Predict Markov Decision Processes EMNLP 2024

Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning NIPS 2024

MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-Object Demand-driven Navigation NIPS 2024