← Learning Types

Machine Learning › Learning Types ›

Multi-Modal Learning

1213 directly classified papers

Papers per year

Papers

Spatial-aware Speaker Diarizaiton for Multi-channel Multi-party Meeting INTERSPEECH 2022

End-to-End Audio-Visual Neural Speaker Diarization INTERSPEECH 2022

Dialogue Acts Aided Important Utterance Detection Based on Multiparty and Multimodal Information INTERSPEECH 2022

Cross-Modal Decision Regularization for Simultaneous Speech Translation INTERSPEECH 2022

Dual-Channel Evidence Fusion for Fact Verification over Texts and Tables NAACL 2022

DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks NAACL 2022

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer NAACL 2022

Bilingual Tabular Inference: A Case Study on Indic Languages NAACL 2022

GMN: Generative Multi-modal Network for Practical Document Information Extraction NAACL 2022

VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems NAACL 2022

Dynamic Gazetteer Integration in Multilingual Models for Cross-Lingual and Cross-Domain Named Entity Recognition NAACL 2022

Multi-Relational Graph Transformer for Automatic Short Answer Grading NAACL 2022

Hate Speech and Counter Speech Detection: Conversational Context Does Matter NAACL 2022

KCD: Knowledge Walks and Textual Cues Enhanced Political Perspective Detection in News Media NAACL 2022

Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering NAACL 2022

Persona or Context? Towards Building Context adaptive Personalized Persuasive Virtual Sales Assistant IJCNLP 2022

Missing Modality meets Meta Sampling (M3S): An Efficient Universal Approach for Multimodal Sentiment Analysis with Missing Modality IJCNLP 2022

Exposing the Limits of Video-Text Models through Contrast Sets NAACL 2022

Co-promotion Predictions of Financing Market and Sales Market: A Cooperative-Competitive Attention Approach AAAI 2022

TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations NAACL 2022

ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition NAACL 2022

TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages NAACL 2022

Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims NAACL 2022

ReSTR: Convolution-Free Referring Image Segmentation Using Transformers CVPR 2022

How to Represent Context Better? An Empirical Study on Context Modeling for Multi-turn Response Selection EMNLP 2022