Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
Identifying Visible Actions in Lifestyle Vlogs
ACL 2019
Dense Procedure Captioning in Narrated Instructional Videos
ACL 2019
Like a Baby: Visually Situated Neural Language Acquisition
ACL 2019
Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing
CVPR 2019
Learning to Detect Human-Object Interactions With Knowledge
CVPR 2019
Learning Words by Drawing Images
CVPR 2019
ContextDesc: Local Descriptor Augmentation With Cross-Modality Context
CVPR 2019
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
CVPR 2019
LAEO-Net: Revisiting People Looking at Each Other in Videos
CVPR 2019
Text Guided Person Image Synthesis
CVPR 2019
Neural Sequential Phrase Grounding (SeqGROUND)
CVPR 2019
CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions
CVPR 2019
MSCap: Multi-Style Image Captioning With Unpaired Stylized Text
CVPR 2019
3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans
CVPR 2019
Fast User-Guided Video Object Segmentation by Interaction-And-Propagation Networks
CVPR 2019
CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection
CVPR 2019
JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation
EMNLP 2019
Robust Navigation with Language Pretraining and Stochastic Sampling
EMNLP 2019
Recommendation as a Communication Game: Self-Supervised Bot-Play for Goal-oriented Dialogue
EMNLP 2019
From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining
EMNLP 2019
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation
EMNLP 2019
MICRON: Multigranular Interaction for Contextualizing RepresentatiON in Non-factoid Question Answering
EMNLP 2019
English to Hindi Multi-modal Neural Machine Translation and Hindi Image Captioning
EMNLP 2019
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task
EMNLP 2019
Understanding the Effect of Textual Adversaries in Multimodal Machine Translation
EMNLP 2019
<
1
…
119
120
121
…
128
>