BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation

David Dale; Marta R. Costa-jussà

2024 EMNLP EMNLP 2024

BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation

Abstract

AbstractWe present BLASER 2.0, an automatic metric of machine translation quality which supports both speech and text modalities. Compared to its predecessor BLASER (Chen et al., 2023), BLASER 2.0 is based on better underlying text and speech representations that cover 202 text languages and 57 speech ones and extends the training data. BLASER 2.0 comes in two varieties: a reference-based and a reference-free (quality estimation) model. We demonstrate that the reference-free version is applicable not only at the dataset level, for evaluating the overall model performance, but also at the sentence level, for scoring individual translations. In particular, we show its applicability for detecting translation hallucinations and filtering training datasets to obtain more reliable translation models. The BLASER 2.0 models are publicly available at https://github.com/facebookresearch/sonar.

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing and Speech & Audio

🧭 Keyword Pioneer — translation hallucination detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

David Dale , Marta R. Costa-jussà

Topics

Natural Language Processing > Applications > Machine Translation Natural Language Processing > Applications > Text Classification Natural Language Processing > Resources & Methods > Multilingual NLP Natural Language Processing > Generation > Machine Translation Speech & Audio > Analysis > Speech Analysis Deep Learning > Learning Types > Multi-Modal Learning

Keywords

machine translation multilingual translation quality estimation reference-free evaluation machine translation evaluation multilingual speech speech translation multilingual speech translation reference-free metric translation hallucination detection

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024