CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement

Yun Liu; Chengwen Zhang; Ruofan Xing; Bingda Tang; Bowen Yang; Li Yi

2025 CVPR CVPR 2025

CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement

Abstract

Understanding how humans cooperatively rearrange household objects is critical for VR/AR and human-robot interaction. However, in-depth studies on modeling these behaviors are under-researched due to the lack of relevant datasets. We fill this gap by presenting CORE4D, a novel large-scale 4D human-object-human interaction dataset focusing on collaborative object rearrangement, which encompasses diverse compositions of various object geometries, collaboration modes, and 3D scenes. With 1K human-object-human motion sequences captured in the real world, we enrich CORE4D by contributing an iterative collaboration retargeting strategy to augment motions to a variety of novel objects. Leveraging this approach, CORE4D comprises a total of 11K collaboration sequences spanning 3K real and virtual object shapes. Benefiting from extensive motion patterns provided by CORE4D, we benchmark two tasks aiming at generating human-object interaction: human-object motion forecasting and interaction synthesis. Extensive experiments demonstrate the effectiveness of our collaboration retargeting strategy and indicate that CORE4D has posed new challenges to existing human-object interaction generation methodologies.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Robotics

🧭 Keyword Pioneer — collaborative object rearrangement

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yun Liu , Chengwen Zhang , Ruofan Xing , Bingda Tang , Bowen Yang , Li Yi

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Core AI > Multi-Agent Systems Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Action Recognition Computer Vision > Analysis > Human Analysis Artificial Intelligence > Core AI > Robotics Computer Vision > Domain-Specific > Robotics Robotics > Applications > Robotics

Keywords

action recognition motion forecasting 3d vision human-object interaction human-robot interaction motion capture 4d dataset collaborative object rearrangement interaction synthesis collaborative rearrangement

Download PDF

Related papers

AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos 2025

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding 2025

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing 2025

Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning 2025

Reversible Decoupling Network for Single Image Reflection Removal 2025