A Multitask Learning Approach For Sound Source Tracking With Icosahedral Convolutional Neural Networks

2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP)(2023)

引用 0|浏览2
暂无评分
摘要
Many different modern applications require systems that are able to track sound sources. To this end, recent methodologies combine microphone array signal processing with deep learning. In this paper, a new technique for neural network training using multitask learning is proposed. To illustrate the new approach, a state-of-the-art architecture for source direction of arrival (DOA) tracking based on icosahedral convolutional neural networks (icoCNN) was taken as a starting point for the novel training method. In the proposed strategy, a deconvolutional layer for hexagonal grid icosahedral maps was employed in an autoencoder pre-training stage, in which SNR and T60 classification networks are trained in parallel. The AutoEncoder+icoCNN (AEicoCNN) combination was able to achieve a 6% reduction in root mean square angular error when compared to the regular icoCNN training.
更多
查看译文
关键词
microphone array signal processing,sound source tracking,deep learning,multitask learning,icosahedral convolutional neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要