Papers
FVTTS : Face Based Voice Synthesis for Text-to-Speech
INTERSPEECH 2024
XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model
INTERSPEECH 2024
Word-level Text Markup for Prosody Control in Speech Synthesis
INTERSPEECH 2024
RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech
INTERSPEECH 2023
Towards Robust FastSpeech 2 by Modelling Residual Multimodality
INTERSPEECH 2023
VC-T: Streaming Voice Conversion Based on Neural Transducer
INTERSPEECH 2023
Cross-lingual Prosody Transfer for Expressive Machine Dubbing
INTERSPEECH 2023