A Deep Neural Network For Audio-Visual Person Recognition

2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS)(2015)

引用 5|浏览26
暂无评分
摘要
This paper presents applications of special types of deep neural networks (DNNs) for audio-visual biometrics. A common example is the DBN-DNN that uses the generative weights of deep belief networks (DBNs) to initialize the feature detecting layers of deterministic feed forward DNNs. In this paper, we propose the DBM-DNN that uses the generative weights of deep Boltzmann machines (DBMs) for initialization of DNNs. Then, a softmax layer is added on top and the DNNs are trained discriminatively. Our experimental results show that lower error rates can be achieved using the DBM-DNN compared to the support vector machine (SVM), linear regression-based classifier (LRC) and the DBN-DNN. Experiments were carried out on two publicly available audio-visual datasets: the VidTIMIT and MOBIO.
更多
查看译文
关键词
audio-visual person recognition,deep neural networks,audio-visual biometrics,DBN-DNN,deep belief networks,deterministic feed forward DNN,generative weights,deep Boltzmann machines,DBM,softmax layer,error rates,support vector machine,SVM,linear regression-based classifier,LRC,audio-visual datasets,VidTIMIT,MOBIO
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要