A Deep Neural Network For Audio-Visual Person Recognition

Mohammad Rafiqul Alam,Mohammed Bennamoun,Roberto Togneri,Ferdous Sohel

2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS)（2015）

引用 5|浏览26

暂无评分

摘要

This paper presents applications of special types of deep neural networks (DNNs) for audio-visual biometrics. A common example is the DBN-DNN that uses the generative weights of deep belief networks (DBNs) to initialize the feature detecting layers of deterministic feed forward DNNs. In this paper, we propose the DBM-DNN that uses the generative weights of deep Boltzmann machines (DBMs) for initialization of DNNs. Then, a softmax layer is added on top and the DNNs are trained discriminatively. Our experimental results show that lower error rates can be achieved using the DBM-DNN compared to the support vector machine (SVM), linear regression-based classifier (LRC) and the DBN-DNN. Experiments were carried out on two publicly available audio-visual datasets: the VidTIMIT and MOBIO.

查看译文

关键词

audio-visual person recognition,deep neural networks,audio-visual biometrics,DBN-DNN,deep belief networks,deterministic feed forward DNN,generative weights,deep Boltzmann machines,DBM,softmax layer,error rates,support vector machine,SVM,linear regression-based classifier,LRC,audio-visual datasets,VidTIMIT,MOBIO

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要