Multimodal Categorization of Crisis Events in Social Media

Mahdi Abavisani; Liwei Wu; Shengli Hu; Joel Tetreault; Alejandro Jaimes

2020 CVPR CVPR 2020

Multimodal Categorization of Crisis Events in Social Media

Abstract

Recent developments in image classification and natural language processing, coupled with the rapid growth in social media usage, have enabled fundamental advances in detecting breaking events around the world in real-time. Emergency response is one such area that stands to gain from these advances. By processing billions of texts and images a minute, events can be automatically detected to enable emergency response workers to better assess rapidly evolving situations and deploy resources accordingly. To date, most event detection techniques in this area have focused on image-only or text-only approaches, limiting detection performance and impacting the quality of information delivered to crisis response teams. In this paper, we present a new multimodal fusion method that leverages both images and texts as input. In particular, we introduce a cross-attention module that can filter uninformative and misleading components from weak modalities on a sample by sample basis. In addition, we employ a multimodal graph-based approach to stochastically transition between embeddings of different multimodal pairs during training to better regularize the learning process as well as dealing with limited training data by constructing new matched pairs from different samples. We show that our method outperforms the unimodal approaches and strong multimodal baselines by a large margin on three crisis-related tasks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Interdisciplinary and Machine Learning

🧭 Keyword Pioneer — cross-attention module

🐣 Hot Topic Early Bird — multimodal fusion

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mahdi Abavisani , Liwei Wu , Shengli Hu , Joel Tetreault , Alejandro Jaimes

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Deep Learning > Architectures > Transformers Computer Vision > Analysis > Anomaly Detection Interdisciplinary > Social > Social Media Analysis Machine Learning > Learning Types > Multi-Modal Learning Deep Learning > Learning Types > Multi-Modal Learning Deep Learning > Learning Types > Multimodal Learning Artificial Intelligence > Core AI > Multi-Modal Learning Artificial Intelligence > Core AI > Information Extraction

Keywords

image classification text classification social media analysis event detection multimodal fusion cross-attention module social media crisis detection graph-based approach crisis event detection multimodal graph

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020