Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

Allen Z. Ren; Anushri Dixit; Alexandra Bodrova; Sumeet Singh; Stephen Tu; Noah Brown; Peng Xu; Leila Takayama; Fei Xia; Jake Varley; Zhenjia Xu; Dorsa Sadigh; Andy Zeng; Anirudha Majumdar

2023 CORL CoRL 2023

Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

Abstract

Large language models (LLMs) exhibit a wide range of promising capabilities — from step-by-step planning to commonsense reasoning — that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, a framework for measuring and aligning the uncertainty of LLM-based planners, such that they know when they don’t know, and ask for help when needed. KnowNo builds on the theory of conformal prediction to provide statistical guarantees on task completion while minimizing human help in complex multi-step planning settings. Experiments across a variety of simulated and real robot setups that involve tasks with different modes of ambiguity (for example, from spatial to numeric uncertainties, from human preferences to Winograd schemas) show that KnowNo performs favorably over modern baselines (which may involve ensembles or extensive prompt tuning) in terms of improving efficiency and autonomy, while providing formal assurances. KnowNo can be used with LLMs out-of-the-box without model-finetuning, and suggests a promising lightweight approach to modeling uncertainty that can complement and scale with the growing capabilities of foundation models.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Allen Z. Ren , Anushri Dixit , Alexandra Bodrova , Sumeet Singh , Stephen Tu , Noah Brown , Peng Xu , Leila Takayama , Fei Xia , Jake Varley , Zhenjia Xu , Dorsa Sadigh , Andy Zeng , Anirudha Majumdar

Topics

Artificial Intelligence > Core AI > AI Safety Artificial Intelligence > Core AI > Planning Machine Learning > Optimization & Theory > Bayesian Inference

Keywords

conformal prediction uncertainty quantification human-robot interaction robot planning large language model

Download PDF

Related papers

Stochastic Occupancy Grid Map Prediction in Dynamic Scenes 2023

SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning 2023

Robot Parkour Learning 2023

Task-Oriented Koopman-Based Control with Contrastive Encoder 2023

Language-Guided Traffic Simulation via Scene-Level Diffusion 2023