Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
Combating Human Trafficking with Multimodal Deep Models
ACL 2017
A Corpus of Natural Language for Visual Reasoning
ACL 2017
Demographic Inference on Twitter using Recursive Neural Networks
ACL 2017
Twitter Demographic Classification Using Deep Multi-modal Multi-task Learning
ACL 2017
AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine
ACL 2017
Combining Models from Multiple Sources for RGB-D Scene Recognition
IJCAI 2017
MAT: A Multimodal Attentive Translator for Image Captioning
IJCAI 2017
Video Highlight Prediction Using Audience Chat Reactions
EMNLP 2017
Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video
EMNLP 2017
Tensor Fusion Network for Multimodal Sentiment Analysis
EMNLP 2017
Extracting Visual Knowledge from the Web with Multimodal Learning
IJCAI 2017
openXBOW -- Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit
JMLR 2017
Graph-Structured Representations for Visual Question Answering
CVPR 2017
Deep Reinforcement Learning-Based Image Captioning With Embedding Reward
CVPR 2017
Missing Modalities Imputation via Cascaded Residual Autoencoder
CVPR 2017
SCC: Semantic Context Cascade for Efficient Action Detection
CVPR 2017
Weakly Supervised Dense Video Captioning
CVPR 2017
Person Search With Natural Language Description
CVPR 2017
Instance-Aware Image and Sentence Matching With Selective Multimodal LSTM
CVPR 2017
An empirical study on the effectiveness of images in Multimodal Neural Machine Translation
EMNLP 2017
Sound-Word2Vec: Learning Word Representations Grounded in Sounds
EMNLP 2017
Deriving continous grounded meaning representations from referentially structured multimodal contexts
EMNLP 2017
Image Pivoting for Learning Multilingual Multimodal Representations
EMNLP 2017
Multimodal Learning and Reasoning for Visual Question Answering
NIPS 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
NIPS 2017
<
1
…
124
125
126
127
128
>