Speech & Audio › Analysis ›

Speaker Verification

410 directly classified papers

Papers per year

Papers

MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms INTERSPEECH 2024

Real-time scheme for rapid extraction of speaker embeddings in challenging recording conditions INTERSPEECH 2024

Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech INTERSPEECH 2024

TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024 INTERSPEECH 2024

Multi-Channel Extension of Pre-trained Models for Speaker Verification INTERSPEECH 2024

Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language INTERSPEECH 2024

ALLIES: A Speech Corpus for Segmentation, Speaker Diarization, Speech Recognition and Speaker Change Detection COLING 2024

Harder or Different? Understanding Generalization of Audio Deepfake Detection INTERSPEECH 2024

On the Success and Limitations of Auxiliary Network Based Word-Level End-to-End Neural Speaker Diarization INTERSPEECH 2024

Can Large Language Models Understand Spatial Audio? INTERSPEECH 2024

Refining Self-supervised Learnt Speech Representation using Brain Activations INTERSPEECH 2024

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models INTERSPEECH 2024

The reasonable effectiveness of speaker embeddings for violence detection INTERSPEECH 2024

VSASV: a Vietnamese Dataset for Spoofing-Aware Speaker Verification INTERSPEECH 2024

EEND-M2F: Masked-attention mask transformers for speaker diarization INTERSPEECH 2024

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization INTERSPEECH 2024

SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection NIPS 2024

Experimenting with Additive Margins for Contrastive Self-Supervised Speaker Verification INTERSPEECH 2023

Build a SRE Challenge System: Lessons from VoxSRC 2022 and CNSRC 2022 INTERSPEECH 2023

A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures INTERSPEECH 2023

On-Device Speaker Anonymization of Acoustic Embeddings for ASR based on Flexible Location Gradient Reversal Layer INTERSPEECH 2023

Language Identification Networks for Multilingual Everyday Recordings INTERSPEECH 2023

DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model INTERSPEECH 2023

Improved DeepFake Detection Using Whisper Features INTERSPEECH 2023

From adaptive score normalization to adaptive data normalization for speaker verification systems INTERSPEECH 2023