Iterative development of family history annotation guidelines using a synthetic corpus of clinical text

Taraka Rama; Pål Brekke; Øystein Nytrø; Lilja Øvrelid

2018 EMNLP EMNLP 2018

Iterative development of family history annotation guidelines using a synthetic corpus of clinical text

Abstract

AbstractIn this article, we describe the development of annotation guidelines for family history information in Norwegian clinical text. We make use of incrementally developed synthetic clinical text describing patients’ family history relating to cases of cardiac disease and present a general methodology which integrates the synthetically produced clinical statements and guideline development. We analyze inter-annotator agreement based on the developed guidelines and present results from experiments aimed at evaluating the validity and applicability of the annotated corpus using machine learning techniques. The resulting annotated corpus contains 477 sentences and 6030 tokens. Both the annotation guidelines and the annotated corpus are made freely available and as such constitutes the first publicly available resource of Norwegian clinical text.

🧭 Keyword Pioneer — family history

🐣 Hot Topic Early Bird — clinical text

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Taraka Rama , Pål Brekke , Øystein Nytrø , Lilja Øvrelid

Topics

Machine Learning > Learning Types > Weakly Supervised Learning

Keywords

clinical text inter-annotator agreement annotation guideline corpus development family history

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018