Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Synthesis
Speech & Audio
›
Synthesis
›
Speech Synthesis
164 directly classified papers
Papers per year
2007: 1
2012: 2
2013: 1
2016: 1
2017: 5
2018: 3
2019: 10
2020: 14
2021: 7
2022: 23
2023: 24
2024: 28
2025: 45
Papers
Flow-Based Unconstrained Lip to Speech Generation
AAAI 2022
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
AAAI 2022
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
ACL 2022
Learning the Beauty in Songs: Neural Singing Voice Beautifier
ACL 2022
Revisiting Over-Smoothness in Text to Speech
ACL 2022
Text-Free Prosody-Aware Generative Spoken Language Modeling
ACL 2022
Development of the Siberian Ingrian Finnish Speech Corpus
ACL 2022
MLLP-VRAIN UPV systems for the IWSLT 2022 Simultaneous Speech Translation and Speech-to-Speech Translation tasks
ACL 2022
V2C: Visual Voice Cloning
CVPR 2022
Talking Face Generation With Multilingual TTS
CVPR 2022
More Than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
CVPR 2022
Textless Speech Emotion Conversion using Discrete & Decomposed Representations
EMNLP 2022
Adversarial Text-to-Speech for low-resource languages
EMNLP 2022
NatiQ: An End-to-end Text-to-Speech System for Arabic
EMNLP 2022
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus
NIPS 2022
INRAS: Implicit Neural Representation for Audio Scenes
NIPS 2022
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
NIPS 2022
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
NIPS 2022
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis
NIPS 2022
Audio-Driven Co-Speech Gesture Video Generation
NIPS 2022
Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
NIPS 2021
fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit
EMNLP 2021
TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis
AAAI 2021
Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis
AAAI 2021
Prosody: Models, Methods, and Applications
ACL 2021
<
1
2
3
4
5
6
7
>