Feature Set Embedding for Incomplete Data

David Grangier; Iain Melvin

2010 NIPS NeurIPS 2010

Feature Set Embedding for Incomplete Data

Abstract

We present a new learning strategy for classification problems in which train and/or test data suffer from missing features. In previous work, instances are represented as vectors from some feature space and one is forced to impute missing values or to consider an instance-specific subspace. In contrast, our method considers instances as sets of (feature,value) pairs which naturally handle the missing value case. Building onto this framework, we propose a classification strategy for sets. Our proposal maps (feature,value) pairs into an embedding space and then non-linearly combines the set of embedded vectors. The embedding and the combination parameters are learned jointly on the final classification objective. This simple strategy allows great flexibility in encoding prior knowledge about the features in the embedding step and yields advantageous results compared to alternative solutions over several datasets.

🧭 Keyword Pioneer — set classification

🐣 Hot Topic Early Bird — representation learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

📈 Trend Setter — Feature Learning

Authors

David Grangier , Iain Melvin

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Embedding Learning Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Learning Types > Feature Learning

Keywords

representation learning classification embedding space feature embedding incomplete data missing data set classification set representation incomplete datum missing datum missing feature

Download PDF

Related papers

Link Discovery using Graph Feature Tracking 2010

Trading off Mistakes and Don't-Know Predictions 2010

A Novel Kernel for Learning a Neuron Model from Spike Train Data 2010

Decomposing Isotonic Regression for Efficiently Solving Large Problems 2010

Learning Kernels with Radiuses of Minimum Enclosing Balls 2010