Unsupervised Spectral Learning of Finite State Transducers

Raphael Bailly; Xavier Carreras; Ariadna Quattoni

2013 NIPS NeurIPS 2013

Unsupervised Spectral Learning of Finite State Transducers

Abstract

Finite-State Transducers (FST) are a standard tool for modeling paired input-output sequences and are used in numerous applications, ranging from computational biology to natural language processing. Recently Balle et al. presented a spectral algorithm for learning FST from samples of aligned input-output sequences. In this paper we address the more realistic, yet challenging setting where the alignments are unknown to the learning algorithm. We frame FST learning as finding a low rank Hankel matrix satisfying constraints derived from observable statistics. Under this formulation, we provide identifiability results for FST distributions. Then, following previous work on rank minimization, we propose a regularized convex relaxation of this objective which is based on minimizing a nuclear norm penalty subject to linear constraints and can be solved efficiently.

🌉 Interdisciplinary Bridge — Computer Science and Machine Learning

🧭 Keyword Pioneer — low rank minimization

🐣 Hot Topic Early Bird — unsupervised learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Raphael Bailly , Xavier Carreras , Ariadna Quattoni

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Optimization & Theory > Theory Mathematics & Optimization > Mathematics > Linear Algebra Computer Science > Foundations > Algorithms Machine Learning > Core Methods > Dimensionality Reduction

Keywords

unsupervised learning convex relaxation low-rank matrix nuclear norm minimization nuclear norm regularization low rank approximation spectral learning hankel matrix nuclear norm low rank minimization low rank hankel matrix finite state transducer finite-state transducer

Download PDF

Related papers

Latent Structured Active Learning 2013

On Flat versus Hierarchical Classification in Large-Scale Taxonomies 2013

Generalized Method-of-Moments for Rank Aggregation 2013

Third-Order Edge Statistics: Contour Continuation, Curvature, and Cortical Connections 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent 2013