IMPROVING SPEAKER VERIFICATION IN REVERBERANT ENVIRONMENTS

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021)（2021）

引用 1|浏览7

暂无评分

摘要

Speaker verification technology has been successfully adopted and integrated into many applications. However, most of these applications require a microphone located near the talker. For the case of distant microphones, speech signals are corrupted by reverberations caused by the large speaker to microphone distance. In this paper, we first introduce a new feature set that gives more details in the frequency dimension in the 2-D time-frequency space used to represent speech. These features are computed using two sets of basis vectors, both of which are applied directly to the amplitude compressed FFT spectrum. One set of basis vectors accounts for the spectral envelope while the second set accounts for pitch. Those features are used to train a Convolutional Neural Network (CNN), with the goal of reducing the negative effects of reverberation. The proposed frontend is shown to be robust for speaker verification in reverberant environments.

查看译文

关键词

CNN, dereverberation, speaker verification, pitch, frontend

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要