Popularity Agnostic Evaluation of Knowledge Graph Embeddings

Aisha Mohamed; Shameem Parambath; Zoi Kaoudi; Ashraf Aboulnaga

2020 UAI UAI 2020

Popularity Agnostic Evaluation of Knowledge Graph Embeddings

Abstract

In this paper, we show that the distribution of entities and relations in common knowledge graphs is highly skewed, with some entities and relations being much more popular than the rest. We show that while knowledge graph embedding models give state-of-the-art performance in many relational learning tasks such as link prediction, current evaluation metrics like hits@k and mrr are biased towards popular entities and relations. We propose two new evaluation metrics, strat-hits@k and strat-mrr, which are unbiased estimators of the true hits@k and mrr when the items follow a power-law distribution. Our new metrics are generalizations of hits@k and mrr that take into account the popularity of the entities and relations in the data, with a tuning parameter determining how much emphasis the metric places on popular vs. unpopular items. Using our metrics, we run experiments on benchmark datasets to show that the performance of embedding models degrades as the popularity of the entities and relations decreases, and that current reported results overestimate the performance of these models by magnifying their accuracy on popular items.

🌉 Interdisciplinary Bridge — Knowledge & Reasoning and Machine Learning

🧭 Keyword Pioneer — mean reciprocal rank

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Aisha Mohamed , Shameem Parambath , Zoi Kaoudi , Ashraf Aboulnaga

Topics

Machine Learning > Core Methods > Metric Learning Machine Learning > Core Methods > Embedding Learning Knowledge & Reasoning > Representation > Knowledge Graphs

Keywords

link prediction knowledge graph embedding evaluation metric mean reciprocal rank hits at k entity popularity

Download PDF

Related papers

Walking on Two Legs: Learning Image Segmentation with Noisy Labels 2020

Finite-Memory Near-Optimal Learning for Markov Decision Processes with Long-Run Average Reward 2020

Automated Dependence Plots 2020

Collapsible IDA: Collapsing Parental Sets for Locally Estimating Possible Causal Effects 2020

Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect 2020