Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
CVPR 2019
Pointing Novel Objects in Image Captioning
CVPR 2019
Engaging Image Captioning via Personality
CVPR 2019
Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding
CVPR 2019
Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention
CVPR 2019
RUBi: Reducing Unimodal Biases for Visual Question Answering
NIPS 2019
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
NIPS 2019
Heterogeneous Graph Learning for Visual Commonsense Reasoning
NIPS 2019
Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge
NIPS 2019
Reflection Separation using a Pair of Unpolarized and Polarized Images
NIPS 2019
Adaptive Cross-Modal Few-shot Learning
NIPS 2019
Deep RGB-D Canonical Correlation Analysis For Sparse Depth Completion
NIPS 2019
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
NIPS 2019
Deep Multimodal Multilinear Fusion with High-order Polynomial Pooling
NIPS 2019
Self-Critical Reasoning for Robust Visual Question Answering
NIPS 2019
A coupled autoencoder approach for multi-modal analysis of cell types
NIPS 2019
Large Scale High-Resolution Land Cover Mapping With Multi-Resolution Data
CVPR 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
CVPR 2019
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning
EMNLP 2019
Talk2Car: Taking Control of Your Self-Driving Car
EMNLP 2019
Fact-Checking Meets Fauxtography: Verifying Claims About Images
EMNLP 2019
Synchronously Generating Two Languages with Interactive Decoding
EMNLP 2019
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts
EMNLP 2019
Reviews Meet Graphs: Enhancing User and Item Representations for Recommendation with Hierarchical Attentive Graph Neural Network
EMNLP 2019
Bilinear Attention Networks
NIPS 2018
<
1
…
120
121
122
…
128
>