Papers
Lightweight Zero-shot Text-to-Speech with Mixture of Adapters
INTERSPEECH 2024
Text-aware and Context-aware Expressive Audiobook Speech Synthesis
INTERSPEECH 2024
Positional Description for Numerical Normalization
INTERSPEECH 2024
Multi-modal Adversarial Training for Zero-Shot Voice Cloning
INTERSPEECH 2024
FVTTS : Face Based Voice Synthesis for Text-to-Speech
INTERSPEECH 2024
Assessing the impact of contextual framing on subjective TTS quality
INTERSPEECH 2024