2024
CVPR
CVPR 2024
WonderJourney: Going from Anywhere to Everywhere
Abstract
We introduce WonderJourney a modular framework for perpetual 3D scene generation. Unlike prior work on view generation that focuses on a single type of scenes we start at any user-provided location (by a text description or an image) and generate a journey through a long sequence of diverse yet coherently connected 3D scenes. We leverage an LLM to generate textual descriptions of the scenes in this journey a text-driven point cloud generation pipeline to make a compelling and coherent sequence of 3D scenes and a large VLM to verify the generated scenes. We show compelling diverse visual results across various scene types and styles forming imaginary "wonderjourneys". Project website: https://kovenyu.com/WonderJourney.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Computer Vision and Deep Learning
🧭
Keyword Pioneer
— perpetual generation
🐣
Hot Topic Early Bird
— scene generation
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Multimodal Learning
Artificial Intelligence > Core AI > Procedural Generation
Deep Learning > Models > Generative Models
Computer Vision > Analysis > 3D Vision
Computer Vision > Generation > Video Generation
Artificial Intelligence > Core AI > Large Language Models
Artificial Intelligence > Core AI > Multi-Modal Learning
Deep Learning > Learning Types > Generative Models
Computer Vision > Generation > 3D Generation