Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time

Shaowei Liu; Hanwen Jiang; Jiarui Xu; Sifei Liu; Xiaolong Wang

2021 CVPR CVPR 2021

Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time

Abstract

Estimating 3D hand and object pose from a single image is an extremely challenging problem: hands and objects are often self-occluded during interactions, and the 3D annotations are scarce as even humans cannot directly label the ground-truths from a single image perfectly. To tackle these challenges, we propose a unified framework for estimating the 3D hand and object poses with semi-supervised learning. We build a joint learning framework where we perform explicit contextual reasoning between hand and object representations. Going beyond limited 3D annotations in a single image, we leverage the spatial-temporal consistency in large-scale hand-object videos as a constraint for generating pseudo labels in semi-supervised learning. Our method not only improves hand pose estimation in challenging real-world dataset, but also substantially improve the object pose which has fewer ground-truths per instance. By training with large-scale diverse videos, our model also generalizes better across multiple out-of-domain datasets. Project page and code: https://stevenlsw.github.io/Semi-Hand-Object

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Machine Learning

🐣 Hot Topic Early Bird — hand pose estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shaowei Liu , Hanwen Jiang , Jiarui Xu , Sifei Liu , Xiaolong Wang

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Human Pose Estimation Computer Vision > Core AI > Computer Vision Machine Learning > Learning Paradigms > Semi-Supervised Learning Artificial Intelligence > Learning Paradigms > Semi-Supervised Learning Computer Vision > Analysis > Pose Estimation

Keywords

semi-supervised learning 3d vision hand pose estimation pseudo labeling 3d pose estimation pseudo label hand-object interaction object pose estimation object pose hand pose spatial-temporal consistency

Download PDF

Related papers

Learning To Reconstruct High Speed and High Dynamic Range Videos From Events 2021

DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls 2021

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs 2021

Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization 2021

Pose-Guided Human Animation From a Single Image in the Wild 2021