Learning Trees from Strings: A Strong Learning Algorithm for some Context-Free Grammars

Alexander Clark

2013 JMLR JMLR 2013

Learning Trees from Strings: A Strong Learning Algorithm for some Context-Free Grammars

Abstract

Standard models of language learning are concerned with weak learning: the learner, receiving as input only information about the strings in the language, must learn to generalise and to generate the correct, potentially infinite, set of strings generated by some target grammar. Here we define the corresponding notion of strong learning: the learner, again only receiving strings as input, must learn a grammar that generates the correct set of structures or parse trees. We formalise this using a modification of Gold's identification in the limit model, requiring convergence to a grammar that is isomorphic to the target grammar. We take as our starting point a simple learning algorithm for substitutable context-free languages, based on principles of distributional learning, and modify it so that it will converge to a canonical grammar for each language. We prove a corresponding strong learning result for a subclass of context-free grammars. [abs] [ pdf ][ bib ] © JMLR 2013. (edit, beta)

🌉 Interdisciplinary Bridge — Interdisciplinary and Machine Learning

📈 Trend Setter — Computational Linguistics

🧭 Keyword Pioneer — context-free grammar

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Alexander Clark

Topics

Machine Learning > Optimization & Theory > Learning Theory Interdisciplinary > Linguistics > Computational Linguistics Machine Learning > Learning Paradigms > Self-Supervised Learning

Keywords

grammar inference distributional learning grammatical inference context-free grammar strong learning identification in the limit

Download PDF

Related papers

Parallel Vector Field Embedding 2013

Semi-Supervised Learning Using Greedy Max-Cut 2013

Random Spanning Trees and the Prediction of Weighted Graphs 2013

JKernelMachines: A Simple Framework for Kernel Machines 2013

Conjugate Relation between Loss Functions and Uncertainty Sets in Classification Problems 2013