MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation

Rongchang Xie; CHUNYU WANG; Yizhou Wang

2020 CVPR CVPR 2020

MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation

Abstract

Cross view feature fusion is the key to address the occlusion problem in human pose estimation. The current fusion methods need to train a separate model for every pair of cameras making them difficult to scale. In this work, we introduce MetaFuse, a pre-trained fusion model learned from a large number of cameras in the Panoptic dataset. The model can be efficiently adapted or finetuned for a new pair of cameras using a small number of labeled images. The strong adaptation power of MetaFuse is due in large part to the proposed factorization of the original fusion model into two parts--(1) a generic fusion model shared by all cameras, and (2) lightweight camera-dependent transformations. Furthermore, the generic model is learned from many cameras by a meta-learning style algorithm to maximize its adaptation capability to various camera poses. We observe in experiments that MetaFuse finetuned on the public datasets outperforms the state-of-the-arts by a large margin which validates its value in practice.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — parameter efficient adaptation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Rongchang Xie , CHUNYU WANG , Yizhou Wang

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Artificial Intelligence > Learning Paradigms > Meta-Learning Computer Vision > Analysis > Human Pose Estimation Machine Learning > Learning Paradigms > Meta-Learning Artificial Intelligence > Core AI > Computer Vision Deep Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Few-Shot Learning

Keywords

few-shot learning transfer learning domain adaptation human pose estimation feature fusion parameter efficient adaptation cross-view fusion camera adaptation cross view fusion

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020