Importance sampling for unbiased on-demand evaluation of knowledge base population

Arun Chaganty; Ashwin Paranjape; Percy Liang; Christopher D. Manning

2017 EMNLP EMNLP 2017

Importance sampling for unbiased on-demand evaluation of knowledge base population

Abstract

AbstractKnowledge base population (KBP) systems take in a large document corpus and extract entities and their relations. Thus far, KBP evaluation has relied on judgements on the pooled predictions of existing systems. We show that this evaluation is problematic: when a new system predicts a previously unseen relation, it is penalized even if it is correct. This leads to significant bias against new systems, which counterproductively discourages innovation in the field. Our first contribution is a new importance-sampling based evaluation which corrects for this bias by annotating a new system’s predictions on-demand via crowdsourcing. We show this eliminates bias and reduces variance using data from the 2015 TAC KBP task. Our second contribution is an implementation of our method made publicly available as an online KBP evaluation service. We pilot the service by testing diverse state-of-the-art systems on the TAC KBP 2016 corpus and obtain accurate scores in a cost effective manner.

🌉 Interdisciplinary Bridge — Knowledge & Reasoning and Machine Learning

📈 Trend Setter — Sampling

🧭 Keyword Pioneer — system evaluation

🐣 Hot Topic Early Bird — statistical learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Arun Chaganty , Ashwin Paranjape , Percy Liang , Christopher D. Manning

Topics

Machine Learning > Optimization & Theory > Statistical Learning Knowledge & Reasoning > Representation > Knowledge Graphs Machine Learning > Learning Types > Sampling Machine Learning > Optimization & Theory > Sampling

Keywords

statistical learning information extraction importance sampling knowledge base population system evaluation

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017