Arsha Nagrani
40 papers
· 2017–2025
· 9 conferences
· across top CS/AI conferences
Achievements
π£
Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Conference Polyglot
(9)
π
Academic Marathon
(8)
π
Cross-Pollinator
(10)
πΊοΈ
Taxonomy Completionist
(69)
π₯
Mega-Team
(43)
π¬
Deep Specialist
(17)
π€
Dynamic Duo
(20)
π§¬
Topic Evolution
π
Conference Pioneer
ποΈ
Keyword Collector
(170)
π
Trend Setter
π
Century Club
(40)
π₯
Unstoppable
(9)
β
The Questioner
β‘
Prolific Year
(7)
Conferences
CVPR (15)
ICCV (8)
INTERSPEECH (5)
ECCV (4)
NIPS (3)
ACL (2)
EMNLP (1)
IJCNLP (1)
WACV (1)
Top co-authors
Research topics
Keywords
multimodal learning
(16)
video understanding
(12)
video captioning
(5)
contrastive learning
(4)
temporal localization
(4)
large language model
(4)
visual question answering
(3)
audio description
(3)
vision-language model
(3)
video question answering
(3)
self-supervised learning
(3)
automatic speech recognition
(3)
speaker verification
(3)
cross-modal learning
(3)
video-language model
(3)
dense video captioning
(3)
efficient computing
(2)
transformer architecture
(2)
zero-shot learning
(2)
semantic alignment
(2)
Papers
VIEWS: Entity-Aware News Video Captioning
EMNLP 2024
Streaming Dense Video Captioning
CVPR 2024
AutoAD: Movie Description in Context
CVPR 2023
LanSER: Language-Model Supported Speech Emotion Recognition
INTERSPEECH 2023
VidChapters-7M: Video Chapters at Scale
NIPS 2023
AVATAR: Unconstrained Audiovisual Speech Recognition
INTERSPEECH 2022
Recognizing Multimodal Entailment
ACL 2021
Localizing Visual Sounds the Hard Way
CVPR 2021
Recognizing Multimodal Entailment
IJCNLP 2021
Spot the Conversation: Speaker Diarisation in the Wild
INTERSPEECH 2020
VoxCeleb2: Deep Speaker Recognition
INTERSPEECH 2018
VoxCeleb: A Large-Scale Speaker Identification Dataset
INTERSPEECH 2017