Acoustic vector sensor based reverberant speech separation with probabilistic time-frequency masking

Xionghu Zhong,Xiaoyi Chen,Wenwu Wang,Atiyeh Alinaghi,A. Benjamin Premkumar

EUSIPCO（2013）

引用 28|浏览24

暂无评分

摘要

Most existing speech source separation algorithms have been developed for separating sound mixtures acquired by using a conventional microphone array. In contrast, little attention has been paid to the problem of source separation using an acoustic vector sensor (AVS). We propose a new method for the separation of convolutive mixtures by incorporating the intensity vector of the acoustic field, obtained using spatially co-located microphones which carry the direction of arrival (DOA) information. The DOA cues from the intensity vector, together with the frequency bin-wise mixing vector cues, are then used to determine the probability of each time-frequency (T-F) point of the mixture being dominated by a specific source, based on the Gaussian mixture models (GMM), whose parameters are evaluated and refined iteratively using an expectation-maximization (EM) algorithm. Finally, the probability is used to derive the T-F masks for recovering the sources. The proposed method is evaluated in simulated reverberant environments in terms of signal-to-distortion ratio (SDR), giving an average improvement of approximately 1:5 dB as compared with a related T-F mask approach based on a conventional microphone setting.

查看译文

关键词

acoustic intensity,acoustic vector sensor,expectation-maximisation algorithm,expectation-maximization algorithm,speech recognition,speech intelligibility,direction of arrival,doa information,signal-to-distortion ratio,spatially colocated microphones,direction of arrival information,sound mixtures,convolutive mixtures,intensity vector,mixture models,source separation,acoustic signal processing,blind source separation,speech source separation algorithms,gaussian processes,direction-of-arrival estimation,frequency bin-wise mixing vector cues,time-frequency point,gaussian mixture models,microphone array,microphone arrays,em algorithm,probabilistic time-frequency masking,reverberant speech separation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要