Learning Video Representations of Human Motion From Synthetic Data

Xi Guo; Wei Wu; Dongliang Wang; Jing Su; Haisheng Su; Weihao Gan; Jian Huang; Qin Yang

2022 CVPR CVPR 2022

Learning Video Representations of Human Motion From Synthetic Data

Abstract

In this paper, we take an early step towards video representation learning of human actions with the help of largescale synthetic videos, particularly for human motion representation enhancement. Specifically, we first introduce an automatic action-related video synthesis pipeline based on a photorealistic video game. A large-scale human action dataset named GATA (GTA Animation Transformed Actions) is then built by the proposed pipeline, which includes 8.1 million action clips spanning over 28K action classes. Based on the presented dataset, we design a contrastive learning framework for human motion representation learning, which shows significant performance improvements on several typical video datasets for action recognition, e.g., Charades, HAA 500 and NTU-RGB. Besides, we further explore a domain adaptation method based on cross-domain positive pairs mining to alleviate the domain gap between synthetic and realistic data. Extensive properties analyses of learned representation are conducted to demonstrate the effectiveness of the proposed dataset for enhancing human motion representation learning.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — human motion representation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xi Guo , Wei Wu , Dongliang Wang , Jing Su , Haisheng Su , Weihao Gan , Jian Huang , Qin Yang

Topics

Computer Vision > Analysis > Action Recognition Machine Learning > Learning Paradigms > Transfer Learning Deep Learning > Techniques > Contrastive Learning Deep Learning > Learning Types > Contrastive Learning Deep Learning > Learning Types > Domain Adaptation

Keywords

representation learning contrastive learning action recognition domain adaptation synthetic datum video representation learning human motion representation

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022