Shan Guo
3 papers
· 2024–2025
· 2 conferences
· across top CS/AI conferences
Achievements
π
Conference Polyglot
(2)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(18)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
CVPR (2)
AAAI (1)
Top co-authors
Keywords
visual question answering
(2)
multimodal learning
(1)
document understanding
(1)
scene text detection
(1)
instruction tuning
(1)
vision language model
(1)
multi-modal large language model
(1)
multimodal large language model
(1)
multimodal representation
(1)
visual-language alignment
(1)
mask generation
(1)
optical character recognition
(1)
visual-text alignment
(1)
text attribute
(1)
text recognition
(1)
text-image alignment
(1)
text spotting
(1)
scene text spotting
(1)
visual language alignment
(1)
pre-training method
(1)