Cvx-Optimized Beamforming And Vector Taylor Series Compensation With German Asr Employing Star-Shaped Microphone Array

IberSPEECH 2014: Proceedings of the Second International Conference on Advances in Speech and Language Technologies for Iberian Languages - Volume 8854(2014)

引用 0|浏览27
暂无评分
摘要
This paper addresses the problem of distant speech recognition in reverberant noisy conditions employing a star-shaped microphone array and vector Taylor series (VTS) compensation. First, a beamformer yields an enhanced single-channel signal by applying convex (CVX) optimization over three spatial dimensions given the spatio-temporal position of the target speaker as prior knowledge. Then, VTS compensation is applied over the speech features extracted from the temporal signal obtained by the beamformer. Finally, the compensated features are used for speech recognition. Due to a lack of existing resources in German to evaluate the proposed enhancement framework, this paper also introduces a new speech database. In particular, we present a medium-vocabulary German database for microphone array made of embedded clean signals contaminated with real room impulsive responses and mixed in a 'natural' way with real noises. We show that the proposed enhancement framework performs better than other related systems on the presented database.
更多
查看译文
关键词
distant speech recognition,cvx-optimized beamforming,vector Taylor series compensation,star-shaped microphone array,reverberant and noisy environment,natural mixing,German database
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要