Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Generation
Computer Vision
›
Generation
›
Video Generation
1433 directly classified papers
Papers per year
2006: 2
2007: 1
2013: 8
2014: 2
2015: 3
2016: 10
2017: 15
2018: 27
2019: 56
2020: 56
2021: 85
2022: 81
2023: 177
2024: 277
2025: 540
2026: 93
Papers
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
CVPR 2025
Multi-identity Human Image Animation with Structural Video Diffusion
ICCV 2025
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide
CVPR 2025
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
ICCV 2025
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
CVPR 2025
Learning 4D Embodied World Models
ICCV 2025
CVT5: Using Compressed Video Encoder and UMT5 for Dense Video Captioning
COLING 2025
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
ICCV 2025
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
CVPR 2025
FreeDance: Towards Harmonic Free-Number Group Dance Generation via a Unified Framework
ICCV 2025
Cross-Modal Learning for Music-to-Music-Video Description Generation
NAACL 2025
Beyond the Frame: Generating 360deg Panoramic Videos from Perspective Videos
ICCV 2025
TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation
CVPR 2025
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
ICCV 2025
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models
NAACL 2025
InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation
ICCV 2025
Can Generative Video Models Help Pose Estimation?
CVPR 2025
DIVE: Taming DINO for Subject-Driven Video Editing
ICCV 2025
FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation
CVPR 2025
GReg: Geometry-Aware Region Refinement for Sign Language Video Generation
ICCV 2025
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
CVPR 2025
DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations
ICCV 2025
T2Bs: Text-to-Character Blendshapes via Video Generation
ICCV 2025
Large-scale Pre-training for Grounded Video Caption Generation
ICCV 2025
ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters
AAAI 2025
<
1
…
9
10
11
…
58
>