2025 ICML ICML 2025

Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation