An analytic study on clustering driven self-supervised speaker verification

PATTERN RECOGNITION LETTERS(2024)

引用 0|浏览1
暂无评分
摘要
One of the most widely used self -supervised speaker verification system training methods is to optimize the speaker embedding network in a discriminative fashion using clustering algorithm -driven Pseudo -Labels. Although the pseudo -labels -based self -supervised training scheme showed impressive performance, recent studies have shown that label noise can significantly impact performance. In this paper, we have explored various pseudo -labels driven by different clustering algorithms and conducted a fine-grained analysis of the relationship between the quality of the pseudo -labels and the speaker verification performance. Experimentally, we shed light on several previously overlooked aspects of the pseudo -labels that can impact speaker verification performance. Moreover, we could observe that the self -supervised speaker verification performance is heavily dependent on multiple qualitative aspects of the clustering algorithms used to generate the pseudo -labels. Furthermore, we show that speaker verification performance can be severely degraded from overfitting the noisy pseudo -labels and that the mixup strategy can mitigate the memorization effects of label noise.
更多
查看译文
关键词
Speaker verification,Clustering,Self-supervised speaker verification,Pseudo-labels,Label noise
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要