GLaRef@CRAC2025: Should we transform coreference resolution into a text generation task?

Olga Seminck; Antoine Bourgois; Yoann Dupont; Mathieu Dehouck; Marine Delaborde

2025 EMNLP EMNLP 2025

GLaRef@CRAC2025: Should we transform coreference resolution into a text generation task?

Abstract

AbstractWe present the submissions of our team to the Unconstrained and LLM tracks of the Computational Models of Reference, Anaphora and Coreference (CRAC2025) shared task, where we ended respectively in the fifth and the first place, but nevertheless with similar scores: average CoNLL-F1 scores of 61.57 and 62.96 on the test set, but with very large differences in computational cost. Indeed, the classical pair-wise resolution system submitted to the Unconstrained track obtained similar performance but with less than 10% of the computational cost. Reflecting on this fact, we point out problems that we ran into using generative AI to perform coreference resolution. We explain how the framework of text generation stands in the way of a reliable text-global coreference representation. Nonetheless, we realize there are many potential improvements of our LLM-system; we discuss them at the end of this article.

❓ The Questioner

🧭 Keyword Pioneer — pairwise resolution

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Olga Seminck , Antoine Bourgois , Yoann Dupont , Mathieu Dehouck , Marine Delaborde

Topics

Natural Language Processing > Understanding > Coreference Resolution Natural Language Processing > Generation > Text Generation

Keywords

text generation coreference resolution computational cost pairwise resolution

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025