Benchmarking Diverse-Modal Entity Linking with Generative Models

Sijia Wang; Alexander Hanbo Li; Henghui Zhu; Sheng Zhang; Pramuditha Perera; Chung-Wei Hang; Jie Ma; William Yang Wang; Zhiguo Wang; Vittorio Castelli; Bing Xiang; Patrick Ng

2023 ACL ACL 2023

Benchmarking Diverse-Modal Entity Linking with Generative Models

Abstract

AbstractEntities can be expressed in diverse formats, such as texts, images, or column names and cell values in tables. While existing entity linking (EL) models work well on per modality configuration, such as text-only EL, visual grounding or schema linking, it is more challenging to design a unified model for diverse modality configurations. To bring various modality configurations together, we constructed a benchmark for diverse-modal EL (DMEL) from existing EL datasets, covering all three modalities including text, image and table. To approach the DMEL task, we proposed a generative diverse-modal model (GDMM) following a multimodal-encoder-decoder paradigm. Pre-training GDMM with rich corpora builds a solid foundation for DMEL without storing the entire KB for inference. Fine-tuning GDMM builds a stronger DMEL baseline, outperforming state-of-the-art task-specific EL models by 8.51 F1 score on average. Additionally, extensive error analyses are conducted to highlight the challenge of DMEL, facilitating future researches on this task.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sijia Wang , Alexander Hanbo Li , Henghui Zhu , Sheng Zhang , Pramuditha Perera , Chung-Wei Hang , Jie Ma , William Yang Wang , Zhiguo Wang , Vittorio Castelli , Bing Xiang , Patrick Ng

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Application Areas > Domain Adaptation Deep Learning > Models > Generative Models

Keywords

benchmark evaluation entity linking multimodal learning generative model table understanding

Download PDF

History Semantic Graph Enhanced Conversational KBQA with Temporal Information Modeling 2023

Efficient Transformers with Dynamic Token Pooling 2023

HHU at SemEval-2023 Task 3: An Adapter-based Approach for News Genre Classification 2023

NAP at SemEval-2023 Task 3: Is Less Really More? (Back-)Translation as Data Augmentation Strategies for Detecting Persuasion Techniques 2023

Benchmarking Diverse-Modal Entity Linking with Generative Models

Abstract

Authors

Topics

Keywords

Related papers