Exemplar-Based Speech Enhancement For Deep Neural Network Based Automatic Speech Recognition

Deepak Baby,Jort F. Gemmeke,Tuomas Virtanen,Hugo Van Hamme

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)（2015）

引用 24|浏览66

暂无评分

摘要

Deep neural network (DNN) based acoustic modelling has been successfully used for a variety of automatic speech recognition (ASR) tasks, thanks to its ability to learn higher-level information using multiple hidden layers. This paper investigates the recently proposed exemplar-based speech enhancement technique using coupled dictionaries as a pre-processing stage for DNN-based systems. In this setting. the noisy speech is decomposed as a weighted sum of atoms in an input dictionary containing exemplars sampled from a domain of choice. and the resulting weights are applied to a coupled output dictionary containing exemplars sampled in the short-time Fourier transform (STFT) domain to directly obtain the speech and noise estimates for speech enhancement. In this work, settings using input dictionary of exemplars sampled from the STFT, Mel-integrated magnitude STFT and modulation envelope spectra are evaluated. Experiments performed on the AURORA-4 database revealed that these pre-processing stages can improve the performance of the DNN-HMM-based ASR systems with both clean and multi-condition training.

查看译文

关键词

deep neural networks,non-negative matrix factorisation,coupled dictionaries,speech enhancement,modulation envelope

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要