Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Reinforcement Learning from Human Feedback
90 directly classified papers
Papers per year
2020: 1
2022: 1
2023: 2
2024: 40
2025: 46
Papers
Discovering Preference Optimization Algorithms with and for Large Language Models
NIPS 2024
DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
NIPS 2024
Aligner: Efficient Alignment by Learning to Correct
NIPS 2024
Automated Multi-level Preference for MLLMs
NIPS 2024
Perplexity-aware Correction for Robust Alignment with Noisy Preferences
NIPS 2024
A Critical Evaluation of AI Feedback for Aligning Large Language Models
NIPS 2024
Preference Alignment with Flow Matching
NIPS 2024
Decoding-Time Language Model Alignment with Multiple Objectives
NIPS 2024
SpeechAlign: Aligning Speech Generation to Human Preferences
NIPS 2024
Calibrated Self-Rewarding Vision Language Models
NIPS 2024
Spectral Editing of Activations for Large Language Model Alignment
NIPS 2024
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
ACL 2023
OpenAssistant Conversations - Democratizing Large Language Model Alignment
NIPS 2023
Training language models to follow instructions with human feedback
NIPS 2022
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
AAAI 2020
<
1
2
3
4
>