← Back to papers

2022 ECCV ECCV 2022

D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Authors

Zhenyu Chen , Qirui Wu , Matthias Nießner , Angel X. Chang

Related papers

Dynamically Transformed Instance Normalization Network for Generalizable Person Re-identification 2022

Synthesizing Light Field Video from Monocular Video 2022

MovieCuts: A New Dataset and Benchmark for Cut Type Recognition 2022

UC-OWOD: Unknown-Classified Open World Object Detection 2022

"Contributions of Shape, Texture, and Color in Visual Recognition" 2022