Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Synthesis
Speech & Audio
›
Synthesis
›
Speech Synthesis
164 directly classified papers
Papers per year
2007: 1
2012: 2
2013: 1
2016: 1
2017: 5
2018: 3
2019: 10
2020: 14
2021: 7
2022: 23
2023: 24
2024: 28
2025: 45
Papers
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
AAAI 2023
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect
ACL 2023
The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks
ACL 2023
Length-Aware NMT and Adaptive Duration for Automatic Dubbing
ACL 2023
NAIST Simultaneous Speech-to-speech Translation System for IWSLT 2023
ACL 2023
The Kyoto Speech-to-Speech Translation System for IWSLT 2023
ACL 2023
Towards Voice Reconstruction from EEG during Imagined Speech
AAAI 2023
Non-parallel Accent Transfer based on Fine-grained Controllable Accent Modelling
EMNLP 2023
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
ACL 2023
Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units
EMNLP 2023
RWEN-TTS: Relation-Aware Word Encoding Network for Natural Text-to-Speech Synthesis
AAAI 2023
DPP-TTS: Diversifying prosodic features of speech via determinantal point processes
EMNLP 2023
Improving Chinese Pop Song and Hokkien Gezi Opera Singing Voice Synthesis by Enhancing Local Modeling
EMNLP 2023
What Does Your Face Sound Like? 3D Face Shape towards Voice
AAAI 2023
Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos
CVPR 2023
CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior
CVPR 2023
Learning To Dub Movies via Hierarchical Prosody Models
CVPR 2023
Avocodo: Generative Adversarial Network for Artifact-Free Vocoder
AAAI 2023
A System for Generating Voice Source Signals that Implements the Transformed LF-model Parameter Control
INTERSPEECH 2023
Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks
INTERSPEECH 2023
UniSplice: Universal Cross-Lingual Data Splicing for Low-Resource ASR
INTERSPEECH 2023
A Generative Framework for Conversational Laughter: Its 'Language Model' and Laughter Sound Synthesis
INTERSPEECH 2023
Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization
ACL 2022
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
INTERSPEECH 2022
Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training
INTERSPEECH 2022
<
1
2
3
4
5
6
7
>