Symbolic Regression with a Learned Concept Library

Arya Grayeli; Atharva Sehgal; Omar Costilla-Reyes; Miles Cranmer; Swarat Chaudhuri

2024 NIPS NeurIPS 2024

Symbolic Regression with a Learned Concept Library

Abstract

We present a novel method for symbolic regression (SR), the task of searching for compact programmatic hypotheses that best explain a dataset. The problem is commonly solved using genetic algorithms; we show that we can enhance such methods by inducing a library of abstract textual concepts. Our algorithm, called LaSR, uses zero-shot queries to a large language model (LLM) to discover and evolve concepts occurring in known high-performing hypotheses. We discover new hypotheses using a mix of standard evolutionary steps and LLM-guided steps (obtained through zero-shot LLM queries) conditioned on discovered concepts. Once discovered, hypotheses are used in a new round of concept abstraction and evolution. We validate LaSR on the Feynman equations, a popular SR benchmark, as well as a set of synthetic tasks. On these benchmarks, LaSR substantially outperforms a variety of state-of-the-art SR approaches based on deep learning and evolutionary algorithms. Moreover, we show that LASR can be used to discover a new and powerful scaling law for LLMs.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

📈 Trend Setter — Evolutionary Algorithm

🧭 Keyword Pioneer — concept library

🐣 Hot Topic Early Bird — genetic algorithm

Authors

Arya Grayeli , Atharva Sehgal , Omar Costilla-Reyes , Miles Cranmer , Swarat Chaudhuri

Topics

Artificial Intelligence > Core AI > Foundation Models Machine Learning > Core Methods > Regression Machine Learning > Learning Types > Self-Supervised Learning Machine Learning > Learning Types > Zero-Shot Learning Mathematics & Optimization > Optimization > Optimization Machine Learning > Learning Types > Meta-Learning Artificial Intelligence > Core AI > Large Language Models Deep Learning > Models > Large Language Models Mathematics & Optimization > Optimization > Evolutionary Algorithm Machine Learning > Learning Types > Symbolic Regression

Keywords

zero-shot learning concept learning evolutionary algorithm symbolic regression genetic algorithm concept discovery large language model concept library

Download PDF

Related papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers 2024

Training for Stable Explanation for Free 2024

NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks 2024

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence 2024