Speech & Audio › Synthesis ›

Speech Synthesis

164 directly classified papers

Papers per year

Papers

Flow-Based Unconstrained Lip to Speech Generation AAAI 2022

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism AAAI 2022

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing ACL 2022

Learning the Beauty in Songs: Neural Singing Voice Beautifier ACL 2022

Revisiting Over-Smoothness in Text to Speech ACL 2022

Text-Free Prosody-Aware Generative Spoken Language Modeling ACL 2022

Development of the Siberian Ingrian Finnish Speech Corpus ACL 2022

MLLP-VRAIN UPV systems for the IWSLT 2022 Simultaneous Speech Translation and Speech-to-Speech Translation tasks ACL 2022

V2C: Visual Voice Cloning CVPR 2022

Talking Face Generation With Multilingual TTS CVPR 2022

More Than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech CVPR 2022

Textless Speech Emotion Conversion using Discrete & Decomposed Representations EMNLP 2022

Adversarial Text-to-Speech for low-resource languages EMNLP 2022

NatiQ: An End-to-end Text-to-Speech System for Arabic EMNLP 2022

M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus NIPS 2022

INRAS: Implicit Neural Representation for Audio Scenes NIPS 2022

GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech NIPS 2022

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech NIPS 2022

HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis NIPS 2022

Audio-Driven Co-Speech Gesture Video Generation NIPS 2022

Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations NIPS 2021

fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit EMNLP 2021

TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis AAAI 2021

Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis AAAI 2021

Prosody: Models, Methods, and Applications ACL 2021