Multimodal Named Entity Disambiguation for Noisy Social Media Posts

Seungwhan Moon; Leonardo Neves; Vitor Carvalho

2018 ACL ACL 2018

Multimodal Named Entity Disambiguation for Noisy Social Media Posts

Abstract

AbstractWe introduce the new Multimodal Named Entity Disambiguation (MNED) task for multimodal social media posts such as Snapchat or Instagram captions, which are composed of short captions with accompanying images. Social media posts bring significant challenges for disambiguation tasks because 1) ambiguity not only comes from polysemous entities, but also from inconsistent or incomplete notations, 2) very limited context is provided with surrounding words, and 3) there are many emerging entities often unseen during training. To this end, we build a new dataset called SnapCaptionsKB, a collection of Snapchat image captions submitted to public and crowd-sourced stories, with named entity mentions fully annotated and linked to entities in an external knowledge base. We then build a deep zeroshot multimodal network for MNED that 1) extracts contexts from both text and image, and 2) predicts correct entity in the knowledge graph embeddings space, allowing for zeroshot disambiguation of entities unseen in training set as well. The proposed model significantly outperforms the state-of-the-art text-only NED models, showing efficacy and potentials of the MNED task.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Knowledge & Reasoning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — zero-shot disambiguation

🐣 Hot Topic Early Bird — knowledge graph embedding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Seungwhan Moon , Leonardo Neves , Vitor Carvalho

Topics

Machine Learning > Learning Types > Zero-Shot Learning Natural Language Processing > Understanding > Named Entity Recognition Knowledge & Reasoning > Representation > Knowledge Graphs Natural Language Processing > Applications > Named Entity Recognition Deep Learning > Learning Types > Multi-Modal Learning Deep Learning > Learning Types > Zero-Shot Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

zero-shot learning entity linking named entity recognition multimodal learning entity disambiguation named entity disambiguation knowledge graph embedding zero-shot disambiguation

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018