Deformable Sprites for Unsupervised Video Decomposition

Vickie Ye; Zhengqi Li; Richard Tucker; Angjoo Kanazawa; Noah Snavely

2022 CVPR CVPR 2022

Deformable Sprites for Unsupervised Video Decomposition

Abstract

We describe a method to extract persistent elements of a dynamic scene from an input video. We represent each scene element as a Deformable Sprite consisting of three components: 1) a 2D texture image for the entire video, 2) per-frame masks for the element, and 3) non-rigid deformations that map the texture image into each video frame. The resulting decomposition allows for applications such as consistent video editing. Deformable Sprites are a type of video auto-encoder model that is optimized on individual videos, and does not require training on a large dataset, nor does it rely on pre-trained models. Moreover, our method does not require object masks or other user input, and discovers moving objects of a wider variety than previous work. We evaluate our approach on standard video datasets and show qualitative results on a diverse array of Internet videos.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Science and Computer Vision and Deep Learning and Machine Learning

🐣 Hot Topic Early Bird — video editing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Vickie Ye , Zhengqi Li , Richard Tucker , Angjoo Kanazawa , Noah Snavely

Topics

Machine Learning > Learning Types > Unsupervised Learning Computer Vision > Processing > Video Processing Computer Science > Systems > Computer Graphics Deep Learning > Learning Types > Self-Supervised Learning Artificial Intelligence > Core AI > Computer Vision Deep Learning > Learning Types > Representation Learning Deep Learning > Learning Types > Unsupervised Learning

Keywords

unsupervised learning video editing non-rigid deformation video decomposition auto-encoder model

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022