Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Back to papers
2024
ECCV
ECCV 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Authors
Chuofan Ma
,
Yi Jiang
,
Jiannan Wu
,
Zehuan Yuan
,
Xiaojuan Qi
Download PDF
Related papers
Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos
2024
Learning Camouflaged Object Detection from Noisy Pseudo Label
2024
ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
2024
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
2024
UniCode : Learning a Unified Codebook for Multimodal Large Language Models
2024