Language models can learn implicit multi-hop reasoning, but only if they have lots of training data

Yuekun Yao; Yupei Du; Dawei Zhu; Michael Hahn; Alexander Koller

2025 EMNLP EMNLP 2025

Language models can learn implicit multi-hop reasoning, but only if they have lots of training data

Abstract

AbstractImplicit reasoning is the ability of a language model to solve multi-hop reasoning tasks in a single forward pass, without chain of thought.We investigate this capability using GPT2-style language models trained from scratch on controlled k-hop reasoning datasets (k = 2, 3, 4). We show that while such models can indeed learn implicit k-hop reasoning,the required training data grows exponentially in k, and the requirednumber of transformer layers grows linearly in k.We offer a theoretical explanation for why this depth growth is necessary.We further find that the data requirement can be mitigated, but not eliminated,through curriculum learning.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yuekun Yao , Yupei Du , Dawei Zhu , Michael Hahn , Alexander Koller

Topics

Machine Learning > Optimization & Theory > Learning Theory Deep Learning > Architectures > Transformers Natural Language Processing > Generation > Language Modeling Natural Language Processing > Resources & Methods > Large Language Models Artificial Intelligence > Core AI > Reasoning Machine Learning > Learning Paradigms > Curriculum Learning

Keywords

curriculum learning language model multi-hop reasoning implicit reasoning reasoning chain data scaling

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025