← Learning Types

Deep Learning › Learning Types ›

Multi-Modal Learning

3194 directly classified papers

Papers per year

Papers

Identifying Visible Actions in Lifestyle Vlogs ACL 2019

Dense Procedure Captioning in Narrated Instructional Videos ACL 2019

Like a Baby: Visually Situated Neural Language Acquisition ACL 2019

Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing CVPR 2019

Learning to Detect Human-Object Interactions With Knowledge CVPR 2019

Learning Words by Drawing Images CVPR 2019

ContextDesc: Local Descriptor Augmentation With Cross-Modality Context CVPR 2019

OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge CVPR 2019

LAEO-Net: Revisiting People Looking at Each Other in Videos CVPR 2019

Text Guided Person Image Synthesis CVPR 2019

Neural Sequential Phrase Grounding (SeqGROUND) CVPR 2019

CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions CVPR 2019

MSCap: Multi-Style Image Captioning With Unpaired Stylized Text CVPR 2019

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans CVPR 2019

Fast User-Guided Video Object Segmentation by Interaction-And-Propagation Networks CVPR 2019

CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection CVPR 2019

JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation EMNLP 2019

Robust Navigation with Language Pretraining and Stochastic Sampling EMNLP 2019

Recommendation as a Communication Game: Self-Supervised Bot-Play for Goal-oriented Dialogue EMNLP 2019

From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining EMNLP 2019

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation EMNLP 2019

MICRON: Multigranular Interaction for Contextualizing RepresentatiON in Non-factoid Question Answering EMNLP 2019

English to Hindi Multi-modal Neural Machine Translation and Hindi Image Captioning EMNLP 2019

Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task EMNLP 2019

Understanding the Effect of Textual Adversaries in Multimodal Machine Translation EMNLP 2019