Rank-Approximate Nearest Neighbor Search: Retaining Meaning and Speed in High Dimensions

Parikshit Ram; Dongryeol Lee; Hua Ouyang; Alexander G. Gray

2009 NIPS NeurIPS 2009

Rank-Approximate Nearest Neighbor Search: Retaining Meaning and Speed in High Dimensions

Abstract

The long-standing problem of efficient nearest-neighbor (NN) search has ubiquitous applications ranging from astrophysics to MP3 fingerprinting to bioinformatics to movie recommendations. As the dimensionality of the dataset increases, exact NN search becomes computationally prohibitive; (1+eps)-distance-approximate NN search can provide large speedups but risks losing the meaning of NN search present in the ranks (ordering) of the distances. This paper presents a simple, practical algorithm allowing the user to, for the first time, directly control the true accuracy of NN search (in terms of ranks) while still achieving the large speedups over exact NN. Experiments with high-dimensional datasets show that it often achieves faster and more accurate results than the best-known distance-approximate method, with much more stable behavior.

🌉 Interdisciplinary Bridge — Computer Science and Data Science & Analytics and Machine Learning

📈 Trend Setter — Data Mining

🧭 Keyword Pioneer — high-dimensional indexing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio

🐣 Hot Topic Early Bird — nearest neighbor search

Authors

Parikshit Ram , Dongryeol Lee , Hua Ouyang , Alexander G. Gray

Topics

Machine Learning > Core Methods > Metric Learning Machine Learning > Optimization & Theory > Optimization Data Science & Analytics > Methods > Data Mining Computer Science > Foundations > Algorithms Computer Science > Applications > Information Retrieval Data Science & Analytics > Applications > Information Retrieval Machine Learning > Core Methods > Retrieval

Keywords

Download PDF

Related papers

Solving Stochastic Games 2009

Bilinear classifiers for visual recognition 2009

Zero-shot Learning with Semantic Output Codes 2009

Matrix Completion from Power-Law Distributed Samples 2009

Heavy-Tailed Symmetric Stochastic Neighbor Embedding 2009