2021
ACL
ACL 2021
Textual Representations for Crosslingual Information Retrieval
Abstract
AbstractIn this paper, we explored different levels of textual representations for cross-lingual information retrieval. Beyond the traditional token level representation, we adopted the subword and character level representations for information retrieval that had shown to improve neural machine translation by reducing the out-of-vocabulary issues in machine translation. We found that crosslingual information retrieval performance can be improved by combining search results from subwords and token level representation. Additionally, we improved the search performance by combining and re-ranking the result sets from the different text representations for German, French and Japanese.
🌉
Interdisciplinary Bridge
— Computer Science and Deep Learning and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— out-of-vocabulary issue
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio
Authors
Topics
Natural Language Processing > Applications > Information Retrieval
Natural Language Processing > Resources & Methods > Multilingual NLP
Computer Science > Applications > Information Retrieval
Deep Learning > Learning Types > Multi-Modal Learning
Machine Learning > Application Areas > Information Retrieval