DexPBT: Scaling up Dexterous Manipulation for Hand-Arm Systems with Population Based Training

Aleksei Petrenko; Arthur Allshire; Gavriel State; Ankur Handa; Viktor Makoviychuk

2023 RSS RSS 2023

DexPBT: Scaling up Dexterous Manipulation for Hand-Arm Systems with Population Based Training

Abstract

In this work, we propose algorithms and methods that enable learning dexterous object manipulation using simulated one- or two-armed robots equipped with multi-fingered hand end-effectors. Using a parallel GPU-accelerated physics simulator (Isaac Gym), we implement challenging tasks for these robots, including regrasping, grasp-and-throw, and object reorientation. To solve these problems we introduce a decentralized Population-Based Training (PBT) algorithm that allows us to massively amplify the exploration capabilities of deep reinforcement learning. We find that this method significantly outperforms regular end-to-end learning and is able to discover robust control policies in challenging tasks. Video demonstrations of learned behaviors and the code can be found at https://sites.google.com/view/dexpbt

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — gpu-accelerated physics

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics

Authors

Aleksei Petrenko , Arthur Allshire , Gavriel State , Ankur Handa , Viktor Makoviychuk

Topics

Machine Learning > Optimization & Theory > Optimization Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics

Keywords

dexterous manipulation population-based training multi-fingered hand gpu-accelerated physics parallel physics simulator

Download PDF

Related papers

FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation 2023

Uncertain Pose Estimation during Contact Tasks using Differentiable Contact Features 2023

Follow my Advice: Assume-Guarantee Approach to Task Planning with Human in the Loop 2023

Centralized Model Predictive Control for Collaborative Loco-Manipulation 2023

Robotic Table Tennis: A Case Study into a High Speed Learning System 2023