Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Minyoung Hwang; Luca Weihs; Chanwoo Park; Kimin Lee; Aniruddha Kembhavi; Kiana Ehsani

2024 CVPR CVPR 2024

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Abstract

Customizing robotic behaviors to be aligned with diverse human preferences is an underexplored challenge in the field of embodied AI. In this paper we present Promptable Behaviors a novel framework that facilitates efficient personalization of robotic agents to diverse human preferences in complex environments. We use multi-objective reinforcement learning to train a single policy adaptable to a broad spectrum of preferences. We introduce three distinct methods to infer human preferences by leveraging different types of interactions: (1) human demonstrations (2) preference feedback on trajectory comparisons and (3) language instructions. We evaluate the proposed method in personalized object-goal navigation and flee navigation tasks in ProcTHOR and RoboTHOR demonstrating the ability to prompt agent behaviors to satisfy human preferences in various scenarios.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Reinforcement Learning and Robotics

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Minyoung Hwang , Luca Weihs , Chanwoo Park , Kimin Lee , Aniruddha Kembhavi , Kiana Ehsani

Topics

Artificial Intelligence > Core AI > Human-AI Interaction Artificial Intelligence > Core AI > Multi-Agent Systems Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Deep Learning > Learning Types > Reinforcement Learning Robotics > Applications > Robotics

Keywords

robot navigation preference learning embodied ai preference feedback multi-objective reinforcement learning human preference language instruction

Download PDF

Related papers

DUSt3R: Geometric 3D Vision Made Easy 2024

Bezier Everywhere All at Once: Learning Drivable Lanes as Bezier Graphs 2024

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows 2024

Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization 2024

DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models 2024