Research Explorer

Implications of Annotation Artifacts in Edge Probing Test Datasets

Sagnik Ray Choudhury, Jushaan Kalra

2023 CONLL

Investigating the Nature of Disagreements on Mid-Scale Ratings: A Case Study on the Abstractness-Concreteness Continuum

Urban Knupleš, Diego Frassinelli, Sabine Schulte im Walde

2023 CONLL

JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models

Yuiga Wada, Kanta Kaneda, Komei Sugiura

2023 CONLL

Med-HALT: Medical Domain Hallucination Test for Large Language Models

Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu

2023 CONLL

Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning

Lucas Weber, Elia Bruni, Dieuwke Hupkes

2023 CONLL

MuLER: Detailed and Scalable Reference-based Evaluation

Taelin Karidi, Leshem Choshen, Gal Patel et al.

2023 CONLL

On the Effects of Structural Modeling for Neural Semantic Parsing

Xiang Zhang, Shizhu He, Kang Liu et al.

2023 CONLL

On the utility of enhancing BERT syntactic bias with Token Reordering Pretraining

Yassir El Mesbahi, Atif Mahmud, Abbas Ghaddar et al.

2023 CONLL

PROPRES: Investigating the Projectivity of Presupposition with Various Triggers and Environments

Daiki Asami, Saku Sugawara

2023 CONLL

PSST! Prosodic Speech Segmentation with Transformers

Nathan Roll, Calbert Graham, Simon Todd

2023 CONLL

Quantifying Information of Tokens for Simple and Flexible Simultaneous Machine Translation

DongHyun Lee, Minkyung Park, Byung-Jun Lee

2023 CONLL

Quirk or Palmer: A Comparative Study of Modal Verb Frameworks with Annotated Datasets

Risako Owan, Maria Gini, Dongyeop Kang

2023 CONLL

REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization

Mohammad Reza Ghasemi Madani, Pasquale Minervini

2023 CONLL

Revising with a Backward Glance: Regressions and Skips during Reading as Cognitive Signals for Revision Policies in Incremental Processing

Brielen Madureira, Pelin Çelikkol, David Schlangen

2023 CONLL

Strategies to Improve Low-Resource Agglutinative Languages Morphological Inflection

Gulinigeer Abudouwaili, Wayit Ablez, Kahaerjiang Abiderexiti et al.

2023 CONLL

Structural Ambiguity and its Disambiguation in Language Model Based Parsers: the Case of Dutch Clause Relativization

Gijs Wijnholds, Michael Moortgat

2023 CONLL

Syntactic Inductive Bias in Transformer Language Models: Especially Helpful for Low-Resource Languages?

Luke Gessler, Nathan Schneider

2023 CONLL

The Impact of Familiarity on Naming Variation: A Study on Object Naming in Mandarin Chinese

Yunke He, Xixian Liao, Jialing Liang et al.

2023 CONLL

Theory of Mind in Large Language Models: Examining Performance of 11 State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests

Max van Duijn, Bram van Dijk, Tom Kouwenhoven et al.

2023 CONLL

The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks

Kaiser Sun, Adina Williams, Dieuwke Hupkes

2023 CONLL

The Zipfian Challenge: Learning the statistical fingerprint of natural languages

Christian Bentz

2023 CONLL

ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of Mind

Xiaomeng Ma, Lingyu Gao, Qihui Xu

2023 CONLL

Towards Better Evaluation of Instruction-Following: A Case-Study in Summarization

Ondrej Skopek, Rahul Aralikatte, Sian Gooding et al.

2023 CONLL

Tree-shape Uncertainty for Analyzing the Inherent Branching Bias of Unsupervised Parsing Models

Taiga Ishii, Yusuke Miyao

2023 CONLL

A Fine-grained Interpretability Evaluation Benchmark for Neural NLP

Lijie Wang, Yaozong Shen, Shuyuan Peng et al.

2022 CONLL

Papers