Binaural Multichannel Blind Speaker Separation With a Causal Low-Latency and Low-Complexity Approach

Nils L. Westhausen,Bernd T. Meyer

IEEE OPEN JOURNAL OF SIGNAL PROCESSING(2024)

引用 0|浏览2
暂无评分
摘要
In this article, we introduce a causal low-latency low-complexity approach for binaural multichannel blind speaker separation in noisy reverberant conditions. The model, referred to as Group Communication Binaural Filter and Sum Network (GCBFSnet) predicts complex filters for filter-and-sum beamforming in the time-frequency domain. We apply Group Communication (GC), i.e., latent model variables are split into groups and processed with a shared sequence model with the aim of reducing the complexity of a simple model only containing one convolutional and one recurrent module. With GC we are able to reduce the size of the model by up to 83% and the complexity up to 73% compared to the model without GC, while mostly retaining performance. Even for the smallest model configuration, GCBFSnet matches the performance of a low-complexity TasNet baseline in most metrics despite the larger size and higher number of required operations of the baseline.
更多
查看译文
关键词
Complexity theory,Low latency communication,Speech enhancement,Convolution,Signal processing algorithms,Microphones,MIMO communication,Binaural,low-latency,multi-channel,real-time,speaker-separation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要