Computer Vision › Generation ›

Video Generation

1433 directly classified papers

Papers per year

Papers

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model CVPR 2025

Multi-identity Human Image Animation with Structural Video Diffusion ICCV 2025

VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide CVPR 2025

Accelerating Diffusion Sampling via Exploiting Local Transition Coherence ICCV 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers CVPR 2025

Learning 4D Embodied World Models ICCV 2025

CVT5: Using Compressed Video Encoder and UMT5 for Dense Video Captioning COLING 2025

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video ICCV 2025

MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling CVPR 2025

FreeDance: Towards Harmonic Free-Number Group Dance Generation via a Unified Framework ICCV 2025

Cross-Modal Learning for Music-to-Music-Video Description Generation NAACL 2025

Beyond the Frame: Generating 360deg Panoramic Videos from Perspective Videos ICCV 2025

TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation CVPR 2025

Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video ICCV 2025

ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models NAACL 2025

InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation ICCV 2025

Can Generative Video Models Help Pose Estimation? CVPR 2025

DIVE: Taming DINO for Subject-Driven Video Editing ICCV 2025

FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation CVPR 2025

GReg: Geometry-Aware Region Refinement for Sign Language Video Generation ICCV 2025

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models CVPR 2025

DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations ICCV 2025

T2Bs: Text-to-Character Blendshapes via Video Generation ICCV 2025

Large-scale Pre-training for Grounded Video Caption Generation ICCV 2025

ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters AAAI 2025