Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition

IEEE Signal Process. Lett.(2014)

引用 286|浏览92
暂无评分
摘要
With the availability of speech data obtained from different devices and varied acquisition conditions, we are often faced with scenarios, where the intrinsic discrepancy between the training and the test data has an adverse impact on affective speech analysis. To address this issue, this letter introduces an Adaptive Denoising Autoencoder based on an unsupervised domain adaptation method, where prior knowledge learned from a target set is used to regularize the training on a source set. Our goal is to achieve a matched feature space representation for the target and source sets while ensuring target domain knowledge transfer. The method has been successfully evaluated on the 2009 INTERSPEECH Emotion Challenge's FAU Aibo Emotion Corpus as target corpus and two other publicly available speech emotion corpora as sources. The experimental results show that our method significantly improves over the baseline performance and outperforms related feature domain adaptation methods.
更多
查看译文
关键词
signal denoising,speech recognition,speech data availability,acquisition conditions,target domain knowledge transfer,interspeech emotion challenge,domain adaptation,fau aibo emotion corpus,emotion recognition,affective speech analysis,speech emotion recognition,feature space representation,adaptive denoising autoencoder,adaptive denoising autoencoders,autoencoder-based unsupervised domain adaptation,speech emotion corpora,unsupervised learning,speech,noise reduction,vectors
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要