An Effective Post-training Embedding Binarization Approach for Fast Online Top-K Passage Matching

yankai Chen; Yifei Zhang; Huifeng Guo; Ruiming Tang; Irwin King

2022 AACL AACL 2022

An Effective Post-training Embedding Binarization Approach for Fast Online Top-K Passage Matching

Abstract

AbstractWith the rapid development of Natural Language Understanding for information retrieval, fine-tuned deep language models, e.g., BERT-based, perform remarkably effective in passage searching tasks. To lower the architecture complexity, the recent state-of-the-art model ColBERT employs Contextualized Late Interaction paradigm to independently learn fine-grained query-passage representations. Apart from the architecture simplification, embedding binarization, as another promising branch in model compression, further specializes in the reduction of memory and computation overheads. In this concise paper, we propose an effective post-training embedding binarization approach over ColBERT, achieving both architecture-level and embedding-level optimization for online inference. The empirical results demonstrate the efficaciousness of our proposed approach, empowering it to perform online query-passage matching acceleration.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — colbert model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

yankai Chen , Yifei Zhang , Huifeng Guo , Ruiming Tang , Irwin King

Topics

Machine Learning > Application Areas > Efficient Computing Natural Language Processing > Applications > Information Retrieval Deep Learning > Optimization & Theory > Model Compression

Keywords

model compression knowledge distillation passage retrieval embedding binarization late interaction colbert model

Download PDF

Related papers

A Japanese Corpus of Many Specialized Domains for Word Segmentation and Part-of-Speech Tagging 2022

Enhancing Tabular Reasoning with Pattern Exploiting Training 2022

Re-contextualizing Fairness in NLP: The Case of India 2022

Adversarially Improving NMT Robustness to ASR Errors with Confusion Sets 2022

Promoting Pre-trained LM with Linguistic Features on Automatic Readability Assessment 2022