PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation With Photometrically Challenging Objects

Pengyuan Wang; Hyunjun Jung; Yitong Li; Siyuan Shen; Rahul Parthasarathy Srikanth; Lorenzo Garattoni; Sven Meier; Nassir Navab; Benjamin Busam

2022 CVPR CVPR 2022

PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation With Photometrically Challenging Objects

Abstract

Object pose estimation is crucial for robotic applications and augmented reality. Beyond instance level 6D object pose estimation methods, estimating category-level pose and shape has become a promising trend. As such, a new research field needs to be supported by well-designed datasets. To provide a benchmark with high-quality ground truth annotations to the community, we introduce a multimodal dataset for category-level object pose estimation with photometrically challenging objects termed PhoCaL. PhoCaL comprises 60 high quality 3D models of household objects over 8 categories including highly reflective, transparent and symmetric objects. We developed a novel robot-supported multi-modal (RGB, depth, polarisation) data acquisition and annotation process. It ensures sub-millimeter accuracy of the pose for opaque textured, shiny and transparent objects, no motion blur and perfect camera synchronisation. To set a benchmark for our dataset, state-of-the-art RGB-D and monocular RGB methods are evaluated on the challenging scenes of PhoCaL.

🌉 Interdisciplinary Bridge — Computer Vision and Robotics

🧭 Keyword Pioneer — category level

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Pengyuan Wang , Hyunjun Jung , Yitong Li , Siyuan Shen , Rahul Parthasarathy Srikanth , Lorenzo Garattoni , Sven Meier , Nassir Navab , Benjamin Busam

Topics

Computer Vision > Analysis > 3D Vision Robotics > Capabilities > Perception Computer Vision > Core AI > Computer Vision

Keywords

3d vision object pose estimation multi-modal dataset category-level pose category level photometrically challenging

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022