Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Convolutional Neural Network Architectures for Matching Natural Language Sentences
NIPS 2014
Self-Calibration and Visual SLAM with a Multi-Camera System on a Micro Aerial Vehicle
RSS 2014
Bayesian Co-Boosting for Multi-modal Gesture Recognition
JMLR 2014
Multimodal Neural Language Models
ICML 2014
Multimodal Learning with Deep Boltzmann Machines
JMLR 2014
Heterogeneous Visual Features Fusion via Sparse Multimodal Machine
CVPR 2013
Topical Video Object Discovery from Key Frames by Modeling Word Co-occurrence Prior
CVPR 2013
MAGIC Summoning: Towards Automatic Suggesting and Testing of Gestures With Low Probability of False Positives During Use
JMLR 2013
Bringing Semantics into Focus Using Visual Abstraction
CVPR 2013
From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding
ICCV 2013
MODEC: Multimodal Decomposable Models for Human Pose Estimation
CVPR 2013
Joint Detection, Tracking and Mapping by Semantic Bundle Adjustment
CVPR 2013
A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching
CVPR 2013
Bayesian Structure Learning for Functional Neuroimaging
AISTATS 2013
Multisensory Encoding, Decoding, and Identification
NIPS 2013
Learning to Predict Gaze in Egocentric Video
ICCV 2013
The Multi-Task Learning View of Multimodal Data
ACML 2013
A Bayesian Approach to Multimodal Visual Dictionary Learning
CVPR 2013
Cross-View Image Geolocalization
CVPR 2013
Toward Interactive Grounded Language Acqusition
RSS 2013
Video Event Understanding Using Natural Language Descriptions
ICCV 2013
Predicting Primary Gaze Behavior Using Social Saliency Fields
ICCV 2013
Keyframe-Based Visual-Inertial SLAM using Nonlinear Optimization
RSS 2013
Perspective Motion Segmentation via Collaborative Clustering
ICCV 2013
Real-Time Body Tracking with One Depth Camera and Inertial Sensors
ICCV 2013
<
1
…
519
520
521
522
523
>