McRank: Learning to Rank Using Multiple Classification and Gradient Boosting

Ping Li; Qiang Wu; Christopher J. Burges

2007 NIPS NeurIPS 2007

McRank: Learning to Rank Using Multiple Classification and Gradient Boosting

Abstract

We cast the ranking problem as (1) multiple classiﬁcation (“Mc”) (2) multiple or- dinal classiﬁcation, which lead to computationally tractable learning algorithms for relevance ranking in Web search. We consider the DCG criterion (discounted cumulative gain), a standard quality measure in information retrieval. Our ap- proach is motivated by the fact that perfect classiﬁcations result in perfect DCG scores and the DCG errors are bounded by classiﬁcation errors. We propose us- ing the Expected Relevance to convert class probabilities into ranking scores. The class probabilities are learned using a gradient boosting tree algorithm. Evalua- tions on large-scale datasets show that our approach can improve LambdaRank [5] and the regressions-based ranker [6], in terms of the (normalized) DCG scores. An efﬁcient implementation of the boosting tree algorithm is also presented.

🌱 Topic Pioneer — Information Retrieval

🌉 Interdisciplinary Bridge — Computer Science and Data Science & Analytics and Machine Learning and Natural Language Processing

📈 Trend Setter — Information Retrieval

🧭 Keyword Pioneer — web search ranking

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio

🐣 Hot Topic Early Bird — information retrieval

Authors

Ping Li , Qiang Wu , Christopher J. Burges

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Natural Language Processing > Applications > Information Retrieval Computer Science > Applications > Information Retrieval Data Science & Analytics > Applications > Information Retrieval Machine Learning > Core Methods > Ranking Machine Learning > Core Methods > Ensemble Methods Machine Learning > Learning Types > Ranking

Keywords

information retrieval discounted cumulative gain gradient boosting learning to rank web search web search ranking dcg optimization multi-class classification multiple classification dcg ranking relevance ranking dcg scoring

Download PDF

Related papers

Exponential Family Predictive Representations of State 2007

Privacy-Preserving Belief Propagation and Sampling 2007

Efficient Principled Learning of Thin Junction Trees 2007

How SVMs can estimate quantiles and the median 2007

Rapid Inference on a Novel AND/OR graph for Object Detection, Segmentation and Parsing 2007