From End-to-end to Step-by-step: Learning to Abstract via Abductive Reinforcement Learning

Zilong Wang; Jiongda Wang; Xiaoyong Chen; Meng Wang; Ming Ma; Zhipeng Wang; Zhenyu Zhou; Tianming Yang; Wang-Zhou Dai

2025 IJCAI IJCAI 2025

From End-to-end to Step-by-step: Learning to Abstract via Abductive Reinforcement Learning

Abstract

Abstraction is a critical technique in general problem-solving, allowing complex tasks to be decomposed into smaller, manageable sub-tasks. While traditional symbolic planning relies on predefined primitive symbols to construct structured abstractions, its reliance on formal representations limits applicability to real-world tasks. On the other hand, reinforcement learning excels at learning end-to-end policies directly from sensory inputs in unstructured environments but struggles with compositional generalization in complex tasks with delayed rewards. In this paper, we propose Abductive Abstract Reinforcement Learning (A2RL), a novel neuro-symbolic RL framework bridging the two paradigms based on Abductive Learning (ABL), enabling RL agents to learn abstractions directly from raw sensory inputs without predefined symbols. A2RL induces a finite state machine to represent high-level, step-by-step procedures, where each abstract state corresponds to a sub-algebra of the original Markov Decision Process (MDP). This approach not only bridges the gap between symbolic abstraction and sub-symbolic learning but also provides a natural mechanism for the emergence of new symbols. Experiments show that A2RL can mitigate the delayed reward problem and improve the generalization capability compared to traditional end-to-end RL methods.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Zilong Wang , Jiongda Wang , Xiaoyong Chen , Meng Wang , Ming Ma , Zhipeng Wang , Zhenyu Zhou , Tianming Yang , Wang-Zhou Dai

Topics

Machine Learning > Learning Types > Continual Learning Reinforcement Learning > Methods > Deep RL

Keywords

abductive learning symbolic planning state machine neuro-symbolic learning

Download PDF

Related papers

Learning Advanced Self-Attention for Linear Transformers in the Singular Value Domain 2025

Responsibility Anticipation and Attribution in LTLf 2025

Argument-based Multi-Issue Negotiation 2025

Online Resource Sharing: Better Robust Guarantees via Randomized Strategies 2025

Equitable Mechanism Design for Facility Location 2025