Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Back to papers
2024
ECCV
ECCV 2024
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Authors
Yanwei Li
,
Chengyao Wang
,
Jiaya Jia
Download PDF
Related papers
Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos
2024
Learning Camouflaged Object Detection from Noisy Pseudo Label
2024
ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
2024
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
2024
UniCode : Learning a Unified Codebook for Multimodal Large Language Models
2024