Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Multimodal Learning
323 directly classified papers
Papers per year
2014: 1
2015: 1
2017: 8
2018: 11
2019: 11
2020: 27
2021: 23
2022: 46
2023: 35
2024: 53
2025: 104
2026: 3
Papers
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation
ACL 2021
A Large-Scale Chinese Multimodal NER Dataset with Speech Clues
ACL 2021
Check It Again:Progressive Visual Question Answering via Visual Entailment
ACL 2021
Joint Multi-modal Aspect-Sentiment Analysis with Auxiliary Cross-modal Relation Detection
EMNLP 2021
Vision Matters When It Should: Sanity Checking Multimodal Machine Translation Models
EMNLP 2021
Unimodal and Crossmodal Refinement Network for Multimodal Sequence Fusion
EMNLP 2021
Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis
EMNLP 2021
Towards Developing a Multilingual and Code-Mixed Visual Question Answering System by Knowledge Distillation
EMNLP 2021
An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal Dialog
EMNLP 2021
Does Vision-and-Language Pretraining Improve Lexical Grounding?
EMNLP 2021
Progressive Modality Reinforcement for Human Multimodal Emotion Recognition From Unaligned Multimodal Sequences
CVPR 2021
Defending Multimodal Fusion Models Against Single-Source Adversaries
CVPR 2021
Domain-Robust VQA With Diverse Datasets and Methods but No Target Labels
CVPR 2021
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
EMNLP 2021
DMRM: A Dual-Channel Multi-Hop Reasoning Model for Visual Dialog
AAAI 2020
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
EMNLP 2020
Modeling Intra and Inter-modality Incongruity for Multi-Modal Sarcasm Detection
EMNLP 2020
No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures
EMNLP 2020
Utilizing Multimodal Feature Consistency to Detect Adversarial Examples on Clinical Summaries
EMNLP 2020
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine Translation
EMNLP 2020
MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention
EMNLP 2020
Using Speaker-Aligned Graph Memory Block in Multimodally Attentive Emotion Recognition Network
INTERSPEECH 2020
Multi-Modal Embeddings Using Multi-Task Learning for Emotion Recognition
INTERSPEECH 2020
Group Gated Fusion on Attention-Based Bidirectional Alignment for Multimodal Emotion Recognition
INTERSPEECH 2020
A Multi-Scale Fusion Framework for Bimodal Speech Emotion Recognition
INTERSPEECH 2020
<
1
…
9
10
11
12
13
>