MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model

Haoyuan Wang; Zhenwei Wang; Xiaoxiao Long; Cheng Lin; Gerhard Hancke; Rynson W.H. Lau

2025 CVPR CVPR 2025

MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model

Abstract

With advances in deep learning models and the availability of large-scale 3D datasets, we have recently witnessed significant progress in single-view 3D reconstruction. However, existing methods often fail to reconstruct physically based material properties given a single image, limiting their applicability in complicated scenarios. This paper presents a novel approach (named MAGE) for generating 3D geometry with realistic decomposed material properties given a single image as input. Our method leverages inspiration from traditional computer graphics deferred rendering pipelines to introduce a multi-view G-buffer estimation model. The proposed model estimates G-buffers for various views as multi-domain images, including XYZ coordinates, normals, albedo, roughness, and metallic properties from a single-view RGB image. To address the inherent ambiguity and inconsistency in generating G-buffers simultaneously, we also formulate a deterministic network from the pretrained diffusion models and propose a lighting response loss that enforces consistency across these domains using PBR principles. Finally, we propose a large-scale synthetic dataset rich in material diversity for our model training. Experimental results demonstrate the effectiveness of our method in producing high-quality 3D meshes with rich material properties. Our code and dataset can be found at https://www.whyy.site/paper/mage.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — multi-view estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Haoyuan Wang , Zhenwei Wang , Xiaoxiao Long , Cheng Lin , Gerhard Hancke , Rynson W.H. Lau

Topics

Deep Learning > Models > Diffusion Models Computer Vision > Analysis > 3D Vision Computer Vision > Domain-Specific > Medical Imaging Computer Vision > Domain-Specific > Computer Graphics Computer Vision > Generation > 3D Generation Computer Vision > Processing > 3D Vision

Keywords

material estimation physically based rendering 3d geometry material property single-view 3d reconstruction g-buffer estimation deferred rendering multi-view estimation material properties

Download PDF

Related papers

AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos 2025

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding 2025

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing 2025

Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning 2025

Reversible Decoupling Network for Single Image Reflection Removal 2025