Artificial Intelligence › Core AI ›

Multimodal Learning

13057 directly classified papers

Papers per year

Papers

Convolutional Neural Network Architectures for Matching Natural Language Sentences NIPS 2014

Self-Calibration and Visual SLAM with a Multi-Camera System on a Micro Aerial Vehicle RSS 2014

Bayesian Co-Boosting for Multi-modal Gesture Recognition JMLR 2014

Multimodal Neural Language Models ICML 2014

Multimodal Learning with Deep Boltzmann Machines JMLR 2014

Heterogeneous Visual Features Fusion via Sparse Multimodal Machine CVPR 2013

Topical Video Object Discovery from Key Frames by Modeling Word Co-occurrence Prior CVPR 2013

MAGIC Summoning: Towards Automatic Suggesting and Testing of Gestures With Low Probability of False Positives During Use JMLR 2013

Bringing Semantics into Focus Using Visual Abstraction CVPR 2013

From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding ICCV 2013

MODEC: Multimodal Decomposable Models for Human Pose Estimation CVPR 2013

Joint Detection, Tracking and Mapping by Semantic Bundle Adjustment CVPR 2013

A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching CVPR 2013

Bayesian Structure Learning for Functional Neuroimaging AISTATS 2013

Multisensory Encoding, Decoding, and Identification NIPS 2013

Learning to Predict Gaze in Egocentric Video ICCV 2013

The Multi-Task Learning View of Multimodal Data ACML 2013

A Bayesian Approach to Multimodal Visual Dictionary Learning CVPR 2013

Cross-View Image Geolocalization CVPR 2013

Toward Interactive Grounded Language Acqusition RSS 2013

Video Event Understanding Using Natural Language Descriptions ICCV 2013

Predicting Primary Gaze Behavior Using Social Saliency Fields ICCV 2013

Keyframe-Based Visual-Inertial SLAM using Nonlinear Optimization RSS 2013

Perspective Motion Segmentation via Collaborative Clustering ICCV 2013

Real-Time Body Tracking with One Depth Camera and Inertial Sensors ICCV 2013