Papers
290 papers found
Implications of Annotation Artifacts in Edge Probing Test Datasets
Sagnik Ray Choudhury, Jushaan Kalra
Investigating the Nature of Disagreements on Mid-Scale Ratings: A Case Study on the Abstractness-Concreteness Continuum
Urban Knupleš, Diego Frassinelli, Sabine Schulte im Walde
JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models
Yuiga Wada, Kanta Kaneda, Komei Sugiura
Med-HALT: Medical Domain Hallucination Test for Large Language Models
Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning
Lucas Weber, Elia Bruni, Dieuwke Hupkes
MuLER: Detailed and Scalable Reference-based Evaluation
Taelin Karidi, Leshem Choshen, Gal Patel et al.
On the Effects of Structural Modeling for Neural Semantic Parsing
Xiang Zhang, Shizhu He, Kang Liu et al.
On the utility of enhancing BERT syntactic bias with Token Reordering Pretraining
Yassir El Mesbahi, Atif Mahmud, Abbas Ghaddar et al.
PROPRES: Investigating the Projectivity of Presupposition with Various Triggers and Environments
Daiki Asami, Saku Sugawara
PSST! Prosodic Speech Segmentation with Transformers
Nathan Roll, Calbert Graham, Simon Todd
Quantifying Information of Tokens for Simple and Flexible Simultaneous Machine Translation
DongHyun Lee, Minkyung Park, Byung-Jun Lee
Quirk or Palmer: A Comparative Study of Modal Verb Frameworks with Annotated Datasets
Risako Owan, Maria Gini, Dongyeop Kang
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization
Mohammad Reza Ghasemi Madani, Pasquale Minervini
Revising with a Backward Glance: Regressions and Skips during Reading as Cognitive Signals for Revision Policies in Incremental Processing
Brielen Madureira, Pelin Çelikkol, David Schlangen
Strategies to Improve Low-Resource Agglutinative Languages Morphological Inflection
Gulinigeer Abudouwaili, Wayit Ablez, Kahaerjiang Abiderexiti et al.
Structural Ambiguity and its Disambiguation in Language Model Based Parsers: the Case of Dutch Clause Relativization
Gijs Wijnholds, Michael Moortgat
Syntactic Inductive Bias in Transformer Language Models: Especially Helpful for Low-Resource Languages?
Luke Gessler, Nathan Schneider
The Impact of Familiarity on Naming Variation: A Study on Object Naming in Mandarin Chinese
Yunke He, Xixian Liao, Jialing Liang et al.
Theory of Mind in Large Language Models: Examining Performance of 11 State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests
Max van Duijn, Bram van Dijk, Tom Kouwenhoven et al.
The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks
Kaiser Sun, Adina Williams, Dieuwke Hupkes
ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of Mind
Xiaomeng Ma, Lingyu Gao, Qihui Xu
Towards Better Evaluation of Instruction-Following: A Case-Study in Summarization
Ondrej Skopek, Rahul Aralikatte, Sian Gooding et al.
Tree-shape Uncertainty for Analyzing the Inherent Branching Bias of Unsupervised Parsing Models
Taiga Ishii, Yusuke Miyao
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP
Lijie Wang, Yaozong Shen, Shuyuan Peng et al.