Papers
Serialized Output Training by Learned Dominance
INTERSPEECH 2024
Phonological-Level Mispronunciation Detection and Diagnosis
INTERSPEECH 2024
Positional Description for Numerical Normalization
INTERSPEECH 2024
Modality Translation Learning for Joint Speech-Text Model
INTERSPEECH 2024
Do VSR Models Generalize Beyond LRS3?
WACV 2024