More than Text: Multi-modal Chinese Word Segmentation

Dong Zhang; Zheng Hu; Shoushan Li; Hanqian Wu; Qiaoming Zhu; Guodong Zhou

2021 ACL ACL 2021

More than Text: Multi-modal Chinese Word Segmentation

Abstract

AbstractChinese word segmentation (CWS) is undoubtedly an important basic task in natural language processing. Previous works only focus on the textual modality, but there are often audio and video utterances (such as news broadcast and face-to-face dialogues), where textual, acoustic and visual modalities normally exist. To this end, we attempt to combine the multi-modality (mainly the converted text and actual voice information) to perform CWS. In this paper, we annotate a new dataset for CWS containing text and audio. Moreover, we propose a time-dependent multi-modal interactive model based on Transformer framework to integrate multi-modal information for word sequence labeling. The experimental results on three different training sets show the effectiveness of our approach with fusing text and audio.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dong Zhang , Zheng Hu , Shoushan Li , Hanqian Wu , Qiaoming Zhu , Guodong Zhou

Topics

Artificial Intelligence > Core AI > Multimodal Learning Natural Language Processing > Understanding > Semantic Analysis Deep Learning > Learning Types > Multi-Modal Learning Natural Language Processing > Applications > Word Sense Disambiguation

Keywords

sequence labeling speech processing chinese word segmentation multi-modal learning

Download PDF

Related papers

Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training 2021

A Non-Autoregressive Edit-Based Approach to Controllable Text Simplification 2021

How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements 2021

Exploring Discourse Structures for Argument Impact Classification 2021

Language Embeddings for Typology and Cross-lingual Transfer Learning 2021