Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Multi-Modal Learning
1457 directly classified papers
Papers per year
2011: 1
2013: 4
2014: 3
2015: 3
2016: 9
2017: 11
2018: 27
2019: 61
2020: 109
2021: 87
2022: 153
2023: 213
2024: 391
2025: 384
2026: 1
Papers
MAAS: Multi-Modal Assignation for Active Speaker Detection
ICCV 2021
Curriculum Learning for Vision-and-Language Navigation
NIPS 2021
Multimodal Virtual Point 3D Detection
NIPS 2021
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
NIPS 2021
MERLOT: Multimodal Neural Script Knowledge Models
NIPS 2021
CSECU-DSG at SemEval-2021 Task 6: Orchestrating Multimodal Neural Architectures for Identifying Persuasion Techniques in Texts and Images
ACL 2021
LT3 at SemEval-2021 Task 6: Using Multi-Modal Compact Bilinear Pooling to Combine Visual and Textual Understanding in Memes
ACL 2021
YNU-HPCC at SemEval-2021 Task 6: Combining ALBERT and Text-CNN for Persuasion Detection in Texts and Images
ACL 2021
Multi-Scale Progressive Attention Network for Video Question Answering
ACL 2021
Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images
ACL 2021
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering
ACL 2021
Verb Knowledge Injection for Multilingual Event Processing
ACL 2021
Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task, Dataset, and Neural Baselines
ACL 2021
Detecting Propaganda Techniques in Memes
ACL 2021
Multi-stage Pre-training over Simplified Multimodal Pre-training Models
ACL 2021
Hierarchical Context-aware Network for Dense Video Event Captioning
ACL 2021
Adaptive Fusion Techniques for Multimodal Data
EACL 2021
MERL: Multimodal Event Representation Learning in Heterogeneous Embedding Spaces
AAAI 2021
UWSpeech: Speech to Speech Translation for Unwritten Languages
AAAI 2021
RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER
AAAI 2021
An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-level Structural Information
AAAI 2021
Humor Knowledge Enriched Transformer for Understanding Multimodal Humor
AAAI 2021
MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records
AAAI 2021
Learning Intuitive Physics with Multimodal Generative Models
AAAI 2021
Dual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval
AAAI 2021
<
1
…
48
49
50
…
59
>