Speech & Audio › Synthesis ›

Speech Synthesis

164 directly classified papers

Papers per year

Papers

UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis AAAI 2023

DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect ACL 2023

The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks ACL 2023

Length-Aware NMT and Adaptive Duration for Automatic Dubbing ACL 2023

NAIST Simultaneous Speech-to-speech Translation System for IWSLT 2023 ACL 2023

The Kyoto Speech-to-Speech Translation System for IWSLT 2023 ACL 2023

Towards Voice Reconstruction from EEG during Imagined Speech AAAI 2023

Non-parallel Accent Transfer based on Fine-grained Controllable Accent Modelling EMNLP 2023

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment ACL 2023

Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units EMNLP 2023

RWEN-TTS: Relation-Aware Word Encoding Network for Natural Text-to-Speech Synthesis AAAI 2023

DPP-TTS: Diversifying prosodic features of speech via determinantal point processes EMNLP 2023

Improving Chinese Pop Song and Hokkien Gezi Opera Singing Voice Synthesis by Enhancing Local Modeling EMNLP 2023

What Does Your Face Sound Like? 3D Face Shape towards Voice AAAI 2023

Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos CVPR 2023

CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior CVPR 2023

Learning To Dub Movies via Hierarchical Prosody Models CVPR 2023

Avocodo: Generative Adversarial Network for Artifact-Free Vocoder AAAI 2023

A System for Generating Voice Source Signals that Implements the Transformed LF-model Parameter Control INTERSPEECH 2023

Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks INTERSPEECH 2023

UniSplice: Universal Cross-Lingual Data Splicing for Low-Resource ASR INTERSPEECH 2023

A Generative Framework for Conversational Laughter: Its 'Language Model' and Laughter Sound Synthesis INTERSPEECH 2023

Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization ACL 2022

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion INTERSPEECH 2022

Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training INTERSPEECH 2022