Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
NIPS 2018
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation
NIPS 2018
Chain of Reasoning for Visual Question Answering
NIPS 2018
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
NIPS 2018
Multimodal Generative Models for Scalable Weakly-Supervised Learning
NIPS 2018
Self-Supervised Generation of Spatial Audio for 360° Video
NIPS 2018
Multimodal Language Analysis with Recurrent Multistage Fusion
EMNLP 2018
Temporally Grounding Natural Sentence in Video
EMNLP 2018
Improving Reinforcement Learning Based Image Captioning with Natural Language Prior
EMNLP 2018
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
EMNLP 2018
TVQA: Localized, Compositional Video Question Answering
EMNLP 2018
Localizing Moments in Video with Temporal Language
EMNLP 2018
ICON: Interactive Conversational Memory Network for Multimodal Emotion Detection
EMNLP 2018
A Visual Attention Grounding Neural Model for Multimodal Machine Translation
EMNLP 2018
Evaluating Textual Representations through Image Generation
EMNLP 2018
End-to-end Image Captioning Exploits Distributional Similarity in Multimodal Space
EMNLP 2018
The MeMAD Submission to the WMT18 Multimodal Translation Task
EMNLP 2018
CUNI System for the WMT18 Multimodal Translation Task
EMNLP 2018
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report
EMNLP 2018
Combining Character and Word Information in Neural Machine Translation Using a Multi-Level Attention
NAACL 2018
Multimodal Frame Identification with Multilingual Evaluation
NAACL 2018
Deep Models of Interactions Across Sets
ICML 2018
A Probabilistic Model for Joint Learning of Word Embeddings from Texts and Images
EMNLP 2018
Visual Attention Model for Name Tagging in Multimodal Social Media
ACL 2018
Stock Movement Prediction from Tweets and Historical Prices
ACL 2018
<
1
…
121
122
123
…
128
>