Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?

Yuchen Cui; Scott Niekum; Abhinav Gupta; Vikash Kumar; Aravind Rajeswaran

2022 L4DC L4DC 2022

Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?

Abstract

Task specification is at the core of programming autonomous robots. A low-effort modality for task specification is critical for engagement of non-expert end users and ultimate adoption of personalized robot agents. A widely studied approach to task specification is through goals, using either compact state space vectors or goal images from the same robot scene. The former is often not easily human interpretable and necessitates detailed state estimation and scene understanding. The latter requires the generation of desired goal image, which often requires a human to complete the task, defeating the purpose of having autonomous robots. In this work, we explore alternate and more general forms of goal specification that are expected to be easier for humans to specify and use such as images obtained from the internet, hand sketches that provide a visual description of the desired task, or simple language descriptions. As a first step towards this, we study the capabilities of large scale pre-trained models (foundation models) for zero-shot goal specification, and find that they are surprisingly effective in a collection of simulated robot manipulation tasks and real-world datasets.

❓ The Questioner

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yuchen Cui , Scott Niekum , Abhinav Gupta , Vikash Kumar , Aravind Rajeswaran

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Core AI > Foundation Models Artificial Intelligence > Learning Paradigms > Zero-Shot Learning

Keywords

zero-shot learning robot manipulation foundation model goal specification task specification

Download PDF

Related papers

Learning-Enabled Robust Control with Noisy Measurements 2022

Input-to-State Stable Neural Ordinary Differential Equations with Applications to Transient Modeling of Circuits 2022

Data-Driven Controller Synthesis of Unknown Nonlinear Polynomial Systems via Control Barrier Certificates 2022

Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks 2022

On the Effectiveness of Iterative Learning Control 2022