Papers
Spoken-to-written text conversion with Large Language Model
INTERSPEECH 2024
NAST: Noise Aware Speech Tokenization for Speech Language Models
INTERSPEECH 2024
Binaural Selective Attention Model for Target Speaker Extraction
INTERSPEECH 2024
PAM: Prompting Audio-Language Models for Audio Quality Assessment
INTERSPEECH 2024
VoxFlow AI: wearable voice converter for atypical speech
INTERSPEECH 2024