Joint-Relation Transformer for Multi-Person Motion Prediction

Qingyao Xu; Weibo Mao; Jingze Gong; Chenxin Xu; Siheng Chen; Weidi Xie; Ya Zhang; Yanfeng Wang

2023 ICCV ICCV 2023

Joint-Relation Transformer for Multi-Person Motion Prediction

Abstract

Multi-person motion prediction is a challenging problem due to the dependency of motion on both individual past movements and interactions with other people. Transformer-based methods have shown promising resultson this task, but they miss the explicit relation representation between joints, such as skeleton structure and pairwise distance, which is crucial for accurate interaction modeling. In this paper, we propose the Joint-Relation Transformer, which utilizes relation information to enhance interaction modeling and improve future motion prediction. Our relation information contains the relative distance and the intra/inter-person physical constraints. To fuse relation and joint information, we design a novel joint-relation fusion layer with relation-aware attention to update both features. Additionally, we supervise the relation information by forecasting future distance. Experiments show that our method achieves a 13.4% improvement of 900ms VIM on 3DPW-SoMoF/RC and 17.8%/12.0% improvement of 3s MPJPE on CMU-Mpcap/MuPoTS-3D dataset.

🧭 Keyword Pioneer — skeleton structure

🐣 Hot Topic Early Bird — human motion

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Qingyao Xu , Weibo Mao , Jingze Gong , Chenxin Xu , Siheng Chen , Weidi Xie , Ya Zhang , Yanfeng Wang

Topics

Deep Learning > Architectures > Transformers

Keywords

attention mechanism human motion motion prediction interaction modeling skeleton structure joint relation

Download PDF

Related papers

PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework 2023

Periodically Exchange Teacher-Student for Source-Free Object Detection 2023

Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations 2023

Minimal Solutions to Uncalibrated Two-view Geometry with Known Epipoles 2023

3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation 2023