Measuring Structural Similarities in Finite MDPs

Hao Wang; Shaokang Dong; Ling Shao

2019 IJCAI IJCAI 2019

Measuring Structural Similarities in Finite MDPs

Abstract

In this paper, we investigate the structural similarities within a finite Markov decision process (MDP). We view a finite MDP as a heterogeneous directed bipartite graph and propose novel measures for state similarity and action similarity in a mutual reinforcement manner. We prove that the state similarity is a metric and the action similarity is a pseudometric. We also establish the connection between the proposed similarity measures and the optimal values of the MDP. Extensive experiments show that the proposed measures are effective.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — finite mdp

🐝 Cross-Pollinator — Artificial Intelligence, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Reinforcement Learning

Authors

Hao Wang , Shaokang Dong , Ling Shao

Topics

Machine Learning > Core Methods > Metric Learning Reinforcement Learning > Applications > Value Iteration

Keywords

action similarity optimal value finite mdp state similarity mutual reinforcement

Download PDF

Related papers

Causal Embeddings for Recommendation: An Extended Abstract 2019

Pivotal Relationship Identification: The K-Truss Minimization Problem 2019

Portioning Using Ordinal Preferences: Fairness and Efficiency 2019

Probabilistic Strategy Logic 2019

Multi-Agent Pathfinding with Continuous Time 2019