Iterative Residual Policy for Goal-Conditioned Dynamic Manipulation of Deformable Objects

Cheng Chi; Benjamin Burchfiel; Eric Cousineau; Siyuan Feng; Shuran Song

2022 RSS RSS 2022

Iterative Residual Policy for Goal-Conditioned Dynamic Manipulation of Deformable Objects

Abstract

This paper tackles the task of goal-conditioned dynamic manipulation of deformable objects. This task is highly challenging due to its complex dynamics (introduced by object deformation and high-speed action) and strict task requirements (defined by a precise goal specification). To address these challenges, we present Iterative Residual Policy (IRP), a general learning framework applicable to repeatable tasks with complex dynamics. IRP learns an implicit policy via delta dynamics -- instead of modeling the entire dynamical system and inferring actions from that model, IRP learns delta dynamics that predict the effects of delta action on the previously-observed trajectory. When combined with adaptive action sampling, the system can quickly optimize its actions online to reach a specified goal. We demonstrate the effectiveness of IRP on two tasks: whipping a rope to hit a target point and swinging a cloth to reach a target pose. Despite being trained only in simulation on a fixed robot setup, IRP is able to efficiently generalize to noisy real-world dynamics, new objects with unseen physical properties, and even different robot hardware embodiments, demonstrating its excellent generalization capability relative to alternative approaches.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — implicit policy

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Cheng Chi , Benjamin Burchfiel , Eric Cousineau , Siyuan Feng , Shuran Song

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Optimization Reinforcement Learning > Applications > Robotics Robotics > Capabilities > Manipulation Artificial Intelligence > Core AI > Robotics

Keywords

sim-to-real transfer policy learning deformable object dynamic manipulation implicit policy goal-conditioned manipulation iterative residual policy delta dynamics

Download PDF

Related papers

Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map 2022

DICP: Doppler Iterative Closest Point Algorithm 2022

Distributed Optimisation and Deconstruction of Bridges by Self-Assembling Robots 2022

Autonomously Untangling Long Cables 2022

SymForce: Symbolic Computation and Code Generation for Robotics 2022