Papers
Can Large Language Models Understand Spatial Audio?
INTERSPEECH 2024
VSASV: a Vietnamese Dataset for Spoofing-Aware Speaker Verification
INTERSPEECH 2024
EEND-M2F: Masked-attention mask transformers for speaker diarization
INTERSPEECH 2024
Improved DeepFake Detection Using Whisper Features
INTERSPEECH 2023