Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Generation
Computer Vision
›
Generation
›
Video Generation
1433 directly classified papers
Papers per year
2006: 2
2007: 1
2013: 8
2014: 2
2015: 3
2016: 10
2017: 15
2018: 27
2019: 56
2020: 56
2021: 85
2022: 81
2023: 177
2024: 277
2025: 540
2026: 93
Papers
Cross-Modal Learning for Music-to-Music-Video Description Generation
NAACL 2025
CVT5: Using Compressed Video Encoder and UMT5 for Dense Video Captioning
COLING 2025
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation
CVPR 2025
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering
ICCV 2025
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
ICCV 2025
OSV: One Step is Enough for High-Quality Image to Video Generation
CVPR 2025
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
CVPR 2025
AnimateAnything: Consistent and Controllable Animation for Video Generation
CVPR 2025
WorldScore: A Unified Evaluation Benchmark for World Generation
ICCV 2025
OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics
ICCV 2025
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
ICCV 2025
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
ICCV 2025
Multi-identity Human Image Animation with Structural Video Diffusion
ICCV 2025
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
ICCV 2025
Learning 4D Embodied World Models
ICCV 2025
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
ICCV 2025
FreeDance: Towards Harmonic Free-Number Group Dance Generation via a Unified Framework
ICCV 2025
Beyond the Frame: Generating 360deg Panoramic Videos from Perspective Videos
ICCV 2025
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
ICCV 2025
InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation
ICCV 2025
DIVE: Taming DINO for Subject-Driven Video Editing
ICCV 2025
GReg: Geometry-Aware Region Refinement for Sign Language Video Generation
ICCV 2025
DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations
ICCV 2025
Large-scale Pre-training for Grounded Video Caption Generation
ICCV 2025
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation
CVPR 2025
<
1
…
4
5
6
…
58
>