Mxgra at SemEval-2020 Task 4: Common Sense Making with Next Token Prediction

Kris Collins; Max Grathwohl; Heba Ahmed

2020 COLING COLING 2020

Mxgra at SemEval-2020 Task 4: Common Sense Making with Next Token Prediction

Abstract

AbstractIn this paper, we explore solutions to a common sense making task in which a model must discern which of two sentences is against common sense. We used a pre-trained language model which we used to calculate complexity scores for input to discern which sentence contained an unlikely sequence of tokens. Other approaches we tested were word vector distances, which were used to find semantic outliers within a sentence, and siamese network. By using the pre-trained language model to calculate perplexity scores based on the sequence of tokens in input sentences, we achieved an accuracy of 75 percent.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kris Collins , Max Grathwohl , Heba Ahmed

Topics

Artificial Intelligence > Core AI > Foundation Models Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Generation > Language Modeling

Keywords

semantic analysis language model commonsense reasoning siamese network perplexity score

Download PDF

Related papers

Persuasiveness of News Editorials depending on Ideology and Personality 2020

A Graph Representation of Semi-structured Data for Web Question Answering 2020

Span-based Joint Entity and Relation Extraction with Attention-based Span-specific and Contextual Semantic Representations 2020

Hierarchical Chinese Legal event extraction via Pedal Attention Mechanism 2020

End-to-End Emotion-Cause Pair Extraction with Graph Convolutional Network 2020