2024 EMNLP EMNLP 2024

Leading Whitespaces of Language Models’ Subword Vocabulary Pose a Confound for Calculating Word Probabilities