Collaborative Filtering on a Budget

Alexandros Karatzoglou; Alex Smola; Markus Weimer

2010 AISTATS AISTATS 2010

Collaborative Filtering on a Budget

Abstract

Matrix factorization is a successful technique for building collaborative filtering systems. While it works well on a large range of problems, it is also known for requiring significant amounts of storage for each user or item to be added to the database. This is a problem whenever the collaborative filtering task is larger than the medium-sized Netflix Prize data. In this paper, we propose a new model for representing and compressing matrix factors via hashing. This allows for essentially unbounded storage (at a graceful storage / performance trade-off) for users and items to be represented in a pre-defined memory footprint. It allows us to scale recommender systems to very large numbers of users or conversely, obtain very good performance even for tiny models (e.g. 400kB of data suffice for a representation of the EachMovie problem). We provide both experimental results and approximation bounds for our compressed representation and we show how this approach can be extended to multipartite problems.

🚀 Conference Pioneer — AISTATS 2010

🌉 Interdisciplinary Bridge — Data Science & Analytics and Machine Learning

📈 Trend Setter — Recommender Systems

🧭 Keyword Pioneer — hash encoding

🐣 Hot Topic Early Bird — model compression

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Alexandros Karatzoglou , Alex Smola , Markus Weimer

Topics

Machine Learning > Core Methods > Representation Learning Data Science & Analytics > Applications > Recommender Systems Machine Learning > Application Areas > Model Compression Machine Learning > Application Areas > Recommender Systems

Keywords

model compression matrix factorization collaborative filtering hash encoding recommender system

Download PDF

Related papers

Towards Understanding Situated Natural Language 2010

Mass Fatality Incident Identification based on nuclear DNA evidence 2010

Locally Linear Denoising on Image Manifolds 2010

Negative Results for Active Learning with Convex Losses 2010

Inductive Principles for Restricted Boltzmann Machine Learning 2010