Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
Spatial-aware Speaker Diarizaiton for Multi-channel Multi-party Meeting
INTERSPEECH 2022
End-to-End Audio-Visual Neural Speaker Diarization
INTERSPEECH 2022
Dialogue Acts Aided Important Utterance Detection Based on Multiparty and Multimodal Information
INTERSPEECH 2022
Cross-Modal Decision Regularization for Simultaneous Speech Translation
INTERSPEECH 2022
Dual-Channel Evidence Fusion for Fact Verification over Texts and Tables
NAACL 2022
DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks
NAACL 2022
Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
NAACL 2022
Bilingual Tabular Inference: A Case Study on Indic Languages
NAACL 2022
GMN: Generative Multi-modal Network for Practical Document Information Extraction
NAACL 2022
VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems
NAACL 2022
Dynamic Gazetteer Integration in Multilingual Models for Cross-Lingual and Cross-Domain Named Entity Recognition
NAACL 2022
Multi-Relational Graph Transformer for Automatic Short Answer Grading
NAACL 2022
Hate Speech and Counter Speech Detection: Conversational Context Does Matter
NAACL 2022
KCD: Knowledge Walks and Textual Cues Enhanced Political Perspective Detection in News Media
NAACL 2022
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering
NAACL 2022
Persona or Context? Towards Building Context adaptive Personalized Persuasive Virtual Sales Assistant
IJCNLP 2022
Missing Modality meets Meta Sampling (M3S): An Efficient Universal Approach for Multimodal Sentiment Analysis with Missing Modality
IJCNLP 2022
Exposing the Limits of Video-Text Models through Contrast Sets
NAACL 2022
Co-promotion Predictions of Financing Market and Sales Market: A Cooperative-Competitive Attention Approach
AAAI 2022
TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations
NAACL 2022
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition
NAACL 2022
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages
NAACL 2022
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims
NAACL 2022
ReSTR: Convolution-Free Referring Image Segmentation Using Transformers
CVPR 2022
How to Represent Context Better? An Empirical Study on Context Modeling for Multi-turn Response Selection
EMNLP 2022
<
1
…
36
37
38
…
49
>