Papers
1,038 papers found
Robust Keyword Spotting via Recycle-Pooling for Mobile Game
Shounan An, Youngsoo Kim, Hu Xu et al.
End-to-End Multi-Look Keyword Spotting
Meng Yu, Xuan Ji, Bo Wu et al.
CAM: Uninteresting Speech Detector
Weiyi Lu, Yi Xu, Peng Yang et al.
On Front-End Gain Invariant Modeling for Wake Word Spotting
Yixin Gao, Noah D. Stein, Chieh-Chi Kao et al.
Self-Training for End-to-End Speech Translation
Juan Pino, Qiantong Xu, Xutai Ma et al.
Neural Architecture Search for Keyword Spotting
Tong Mo, Yakun Yu, Mohammad Salameh et al.
Streaming Keyword Spotting on Mobile Devices
Oleg Rybakov, Natasha Kononenko, Niranjan Subrahmanya et al.
Metadata-Aware End-to-End Keyword Spotting
Hongyi Liu, Apurva Abhyankar, Yuriy Mishchenko et al.
Deep Convolutional Spiking Neural Networks for Keyword Spotting
Emre Yılmaz, Özgür Bora Gevrek, Jibin Wu et al.
Deep Template Matching for Small-Footprint and Configurable Keyword Spotting
Peng Zhang, Xueliang Zhang
Multi-Scale Convolution for Robust Keyword Spotting
Chen Yang, Xue Wen, Liming Song
Attention Forcing for Speech Synthesis
Qingyun Dou, Joshua Efiong, Mark J.F. Gales
Noisy Student-Teacher Training for Robust Keyword Spotting
Hyun-Jin Park, Pai Zhu, Ignacio Lopez Moreno et al.
Graph Attention Networks for Anti-Spoofing
Hemlata Tak, Jee-weon Jung, Jose Patino et al.
Lost in Interpreting: Speech Translation from Source or Interpreter?
Dominik Macháček, Matúš Žilinec, Ondřej Bojar
Adapting Speaker Embeddings for Speaker Diarisation
Youngki Kwon, Jee-weon Jung, Hee-Soo Heo et al.
Few-Shot Keyword Spotting in Any Language
Mark Mazumder, Colby Banbury, Josh Meyer et al.
Keyword Transformer: A Self-Attention Model for Keyword Spotting
Axel Berg, Mark O’Connor, Miguel Tairum Cruz
Device Playback Augmentation with Echo Cancellation for Keyword Spotting
Kuba Łopatka, Katarzyna Kaszuba-Miotke, Piotr Klinke et al.
Presentation Matters: Evaluating Speaker Identification Tasks
Benjamin O’Brien, Christine Meunier, Alain Ghio
Generalized Keyword Spotting using ASR embeddings
Kirandevraj R, Vinod Kumar Kurmi, Vinay Namboodiri et al.
SpeechPainter: Text-conditioned Speech Inpainting
Zalan Borsos, Matthew Sharifi, Marco Tagliasacchi
Personalized Keyword Spotting through Multi-task Learning
Seunghan Yang, Byeonggeun Kim, Inseop Chung et al.
Latency Control for Keyword Spotting
Christin Jose, Joe Wang, Grant Strimel et al.
Comparison of Models for Detecting Off-Putting Speaking Styles
Diego Aguirre, Nigel Ward, Jonathan E. Avila et al.