Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows

Julien Mairal; Bin Yu

2013 JMLR JMLR 2013

Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows

Abstract

We consider supervised learning problems where the features are embedded in a graph, such as gene expressions in a gene network. In this context, it is of much interest to automatically select a subgraph with few connected components; by exploiting prior knowledge, one can indeed improve the prediction performance or obtain results that are easier to interpret. Regularization or penalty functions for selecting features in graphs have recently been proposed, but they raise new algorithmic challenges. For example, they typically require solving a combinatorially hard selection problem among all connected subgraphs. In this paper, we propose computationally feasible strategies to select a sparse and well-connected subset of features sitting on a directed acyclic graph (DAG). We introduce structured sparsity penalties over paths on a DAG called âpath codingâ penalties. Unlike existing regularization functions that model long-range interactions between features in a graph, path coding penalties are tractable. The penalties and their proximal operators involve path selection problems, which we efficiently solve by leveraging network flow optimization. We experimentally show on synthetic, image, and genomic data that our approach is scalable and leads to more connected subgraphs than other regularization functions for graphs. [abs] [ pdf ][ bib ] © JMLR 2013. (edit, beta)

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — gene network

🐣 Hot Topic Early Bird — directed acyclic graph

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Julien Mairal , Bin Yu

Topics

Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Mathematics > Graph Theory Mathematics & Optimization > Optimization > Combinatorial Optimization

Keywords

feature selection network flow structured sparsity directed acyclic graph gene network path coding penalty

Download PDF

Related papers

Parallel Vector Field Embedding 2013

Semi-Supervised Learning Using Greedy Max-Cut 2013

Random Spanning Trees and the Prediction of Weighted Graphs 2013

JKernelMachines: A Simple Framework for Kernel Machines 2013

Conjugate Relation between Loss Functions and Uncertainty Sets in Classification Problems 2013