2024 ICML ICML 2024

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback