2025
ACL
ACL 2025
What is an “Abstract Reasoner”? Revisiting Experiments and Arguments about Large Language Models
Abstract
AbstractRecent work has argued that large language models (LLMs) are not “abstract reasoners”, citing their poor zero-shot performance on a variety of challenging tasks as evidence. We revisit these experiments in order to add nuance to the claim. First, we show that while LLMs indeed perform poorly in a zero-shot setting, even tuning a small subset of parameters for input encoding can enable near-perfect performance. However, we also show that this finetuning does not necessarily transfer across datasets. We take this collection of empirical results as an invitation to (re-)open the discussion of what it means to be an “abstract reasoner”, and why it matters whether LLMs fit the bill.
❓
The Questioner
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Natural Language Processing
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Learning Types > Zero-Shot Learning
Natural Language Processing > Resources & Methods > Large Language Models
Artificial Intelligence > Learning Paradigms > Zero-Shot Learning
Artificial Intelligence > Core AI > Large Language Models
Artificial Intelligence > Core AI > Reasoning
Machine Learning > Learning Types > Fine-Tuning