Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
Cross-Modal Cross-Domain Moment Alignment Network for Person Search
CVPR 2020
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
AAAI 2020
Asymmetrical Hierarchical Networks with Attentive Interactions for Interpretable Review-Based Recommendation
AAAI 2020
Discriminative Sentence Modeling for Story Ending Prediction
AAAI 2020
Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
AAAI 2020
Multi-Modality Cross Attention Network for Image and Sentence Matching
CVPR 2020
Social Influence Does Matter: User Action Prediction for In-Feed Advertising
AAAI 2020
TemPEST: Soft Template-Based Personalized EDM Subject Generation through Collaborative Summarization
AAAI 2020
PHASEN: A Phase-and-Harmonics-Aware Speech Enhancement Network
AAAI 2020
A Multi-Scale Approach for Graph Link Prediction
AAAI 2020
Reinforcement-Learning Based Portfolio Management with Augmented Asset Movement Prediction States
AAAI 2020
DeepDualMapper: A Gated Fusion Network for Automatic Map Extraction Using Aerial Images and Trajectories
AAAI 2020
Learning Multi-Modal Biomarker Representations via Globally Aligned Longitudinal Enrichments
AAAI 2020
M3ER: Multiplicative Multimodal Emotion Recognition using Facial, Textual, and Speech Cues
AAAI 2020
Universal Weighting Metric Learning for Cross-Modal Matching
CVPR 2020
EmotiCon: Context-Aware Multimodal Emotion Recognition Using Frege's Principle
CVPR 2020
Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph
EMNLP 2020
Exploring Hate Speech Detection in Multimodal Publications
WACV 2020
Audio-Visual Model Distillation Using Acoustic Images
WACV 2020
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
WACV 2020
Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction
EMNLP 2020
Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos
EMNLP 2020
Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning
EMNLP 2020
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
EMNLP 2020
HENIN: Learning Heterogeneous Neural Interaction Networks for Explainable Cyberbullying Detection on Social Media
EMNLP 2020
<
1
…
42
43
44
…
49
>