2019
IJCAI
IJCAI 2019
Measuring Structural Similarities in Finite MDPs
Abstract
In this paper, we investigate the structural similarities within a finite Markov decision process (MDP). We view a finite MDP as a heterogeneous directed bipartite graph and propose novel measures for state similarity and action similarity in a mutual reinforcement manner. We prove that the state similarity is a metric and the action similarity is a pseudometric. We also establish the connection between the proposed similarity measures and the optimal values of the MDP. Extensive experiments show that the proposed measures are effective.
🌉
Interdisciplinary Bridge
— Machine Learning and Reinforcement Learning
🧭
Keyword Pioneer
— finite mdp
🐝
Cross-Pollinator
— Artificial Intelligence, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Reinforcement Learning