Kai Yu
120 papers
· 2006–2026
· 13 conferences
· across top CS/AI conferences
Achievements
π
Academic Marathon
(19)
π§
Keyword Pioneer
π
Conference Polyglot
(13)
πΊοΈ
Taxonomy Completionist
(38)
π£
Hot Topic Early Bird
π
Renaissance Researcher
(9)
π
Interdisciplinary Bridge
π
Cross-Pollinator
(7)
π
Conference Loyalist
(36)
π
Keyword Trendsetter Combo
(5)
π¬
Deep Specialist
(20)
π±
Topic Pioneer
π
Keyword Champion
π§¬
Topic Evolution
π₯
Mega-Team
(23)
π€
Dynamic Duo
(48)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(14)
β
The Questioner
(4)
β‘
Prolific Year
(17)
π
Century Club
(116)
ποΈ
Keyword Collector
(116)
Conferences
INTERSPEECH (36)
ACL (18)
EMNLP (17)
NIPS (12)
AAAI (9)
COLING (7)
NAACL (5)
ICCV (4)
ICML (4)
IJCNLP (3)
CVPR (2)
EACL (2)
MICCAI (1)
Top co-authors
Keywords
large language model
(15)
semantic parsing
(9)
speech synthesis
(7)
domain adaptation
(6)
data augmentation
(6)
speaker verification
(5)
automatic speech recognition
(5)
speaker embedding
(5)
knowledge distillation
(5)
graph neural network
(5)
long short-term memory
(4)
model compression
(4)
vector quantization
(4)
connectionist temporal classification
(4)
text-to-speech synthesis
(4)
speech recognition
(4)
dialogue state tracking
(4)
transfer learning
(4)
unsupervised learning
(4)
semi-supervised learning
(4)
Papers
On the Effectiveness of Acoustic BPE in Decoder-Only TTS
INTERSPEECH 2024
Text-aware Speech Separation for Multi-talker Keyword Spotting
INTERSPEECH 2024
FakeSound: Deepfake General Audio Detection
INTERSPEECH 2024
UnSE: Unsupervised Speech Enhancement Using Optimal Transport
INTERSPEECH 2023
DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
INTERSPEECH 2023
How ChatGPT is Robust for Spoken Language Understanding?
INTERSPEECH 2023
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild
INTERSPEECH 2022
Efficient Speech Enhancement with Neural Homomorphic Synthesis
INTERSPEECH 2022
Neural Homomorphic Vocoder
INTERSPEECH 2020
Semantic Parsing with Dual Learning
ACL 2019
The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge
INTERSPEECH 2019
Joint Decoding of CTC Based Systems for Speech Recognition
INTERSPEECH 2019
Binarized LSTM Language Model
NAACL 2018
Knowledge Distillation for Sequence Model
INTERSPEECH 2018
Towards Universal Dialogue State Tracking
EMNLP 2018
What Does the Speaker Embedding Encode?
INTERSPEECH 2017
Comparison of Modeling Target in LSTM-RNN Duration Model
INTERSPEECH 2017
Discrete Duration Model for Speech Synthesis
INTERSPEECH 2017
Binary Deep Neural Networks for Speech Recognition
INTERSPEECH 2017
Affordable On-line Dialogue Policy Learning
EMNLP 2017
Unrestricted Vocabulary Keyword Spotting Using LSTM-CTC
INTERSPEECH 2016
Phone Synchronous Decoding with CTC Lattice
INTERSPEECH 2016
Deep Coding Network
NIPS 2010
Predictive Matrix-Variate t Models
NIPS 2007