2022
NAACL
NAACL 2022
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Abstract
AbstractPaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/PaddleSpeech.
🌉
Interdisciplinary Bridge
— Natural Language Processing and Speech & Audio
🧭
Keyword Pioneer
— speech toolkit
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Hui Zhang
,
Tian Yuan
,
Junkun Chen
,
Xintong Li
,
Renjie Zheng
,
Yuxin Huang
,
Xiaojie Chen
,
Enlei Gong
,
Zeyu Chen
,
Xiaoguang Hu
,
Dianhai Yu
,
Yanjun Ma
,
Liang Huang