Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation

Yuxuan Wang; Xuanyu Yi; Haohan Weng; Qingshan Xu; Xiaokang Wei; Xianghui Yang; Chunchao Guo; Long Chen; Hanwang Zhang

2025 ICCV ICCV 2025

Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation

Abstract

Triangle meshes are fundamental to 3D applications. Current automatic mesh generation methods typically rely on intermediate representations that lack the continuous surface quality inherent to meshes. Converting these representations into meshes produces dense, suboptimal outputs. Although recent autoregressive approaches demonstrate promise in directly modeling mesh vertices and faces, they are constrained by the limitation in face count, scalability, and structural fidelity.To address these challenges, we propose Nautilus, a locality-aware autoencoder for artist-like mesh generation that leverages the local properties of manifold meshes to achieve structural fidelity and efficient representation. Our approach introduces a novel tokenization algorithm that preserves face proximity relationships and compresses sequence length through locally shared vertices and edges, enabling the generation of meshes with an unprecedented scale of up to 5,000 faces. Furthermore, we develop a Dual-stream Point Conditioner that captures fine-grained geometry, ensuring global consistency and local structural fidelity. Our experiments demonstrate that Nautilus significantly outperforms existing methods in generation quality.

🌉 Interdisciplinary Bridge — Computer Science and Computer Vision and Deep Learning

🧭 Keyword Pioneer — local properties

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Yuxuan Wang , Xuanyu Yi , Haohan Weng , Qingshan Xu , Xiaokang Wei , Xianghui Yang , Chunchao Guo , Long Chen , Hanwang Zhang

Topics

Deep Learning > Architectures > Autoencoders Computer Vision > Generation > Image Generation Computer Science > Applications > Computer Graphics

Keywords

mesh generation autoencoder architecture 3d geometry tokenization algorithm structural fidelity local properties manifold mesh triangle mesh locality-aware tokenization

Download PDF

Related papers

MA-CIR: A Multimodal Arithmetic Benchmark for Composed Image Retrieval 2025

SimMLM: A Simple Framework for Multi-modal Learning with Missing Modality 2025

MonSTeR: a Unified Model for Motion, Scene, Text Retrieval 2025

ASGS: Single-Domain Generalizable Open-Set Object Detection via Adaptive Subgraph Searching 2025

Robust Dataset Condensation using Supervised Contrastive Learning 2025