Do sentence embeddings capture discourse properties of sentences from Scientific Abstracts ?

Laurine Huber; Chaker Memmadi; Mathilde Dargnat; Yannick Toussaint

2020 EMNLP EMNLP 2020

Do sentence embeddings capture discourse properties of sentences from Scientific Abstracts ?

Abstract

AbstractWe introduce four tasks designed to determine which sentence encoders best capture discourse properties of sentences from scientific abstracts, namely coherence and cohesion between clauses of a sentence, and discourse relations within sentences. We show that even if contextual encoders such as BERT or SciBERT encodes the coherence in discourse units, they do not help to predict three discourse relations commonly used in scientific abstracts. We discuss what these results underline, namely that these discourse relations are based on particular phrasing that allow non-contextual encoders to perform well.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — discourse properties

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Laurine Huber , Chaker Memmadi , Mathilde Dargnat , Yannick Toussaint

Topics

Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Understanding > Syntax Natural Language Processing > Resources & Methods > Text Representation Deep Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

text representation sentence embedding sentence encoder discourse relation scientific abstract contextual encoder coherence cohesion discourse properties

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020