2007 NIPS NeurIPS 2007

McRank: Learning to Rank Using Multiple Classification and Gradient Boosting

Abstract

We cast the ranking problem as (1) multiple classification (“Mc”) (2) multiple or- dinal classification, which lead to computationally tractable learning algorithms for relevance ranking in Web search. We consider the DCG criterion (discounted cumulative gain), a standard quality measure in information retrieval. Our ap- proach is motivated by the fact that perfect classifications result in perfect DCG scores and the DCG errors are bounded by classification errors. We propose us- ing the Expected Relevance to convert class probabilities into ranking scores. The class probabilities are learned using a gradient boosting tree algorithm. Evalua- tions on large-scale datasets show that our approach can improve LambdaRank [5] and the regressions-based ranker [6], in terms of the (normalized) DCG scores. An efficient implementation of the boosting tree algorithm is also presented.

🌱 Topic Pioneer — Information Retrieval
🌉 Interdisciplinary Bridge — Computer Science and Data Science & Analytics and Machine Learning and Natural Language Processing
📈 Trend Setter — Information Retrieval
🧭 Keyword Pioneer — web search ranking
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio
🐣 Hot Topic Early Bird — information retrieval