Denserecognition Of Spoken Languages

Jaybrata Chakraborty,Bappaditya Chakraborty,Ujjwal Bhattacharya

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)（2020）

引用 2|浏览2

暂无评分

摘要

In the present study, we have considered a large number (27) of Indian languages for recognition from their speech signals of different sources. A dense convolutional network architecture (DenseNet) has been used for this classification task. Dynamic elimination of low energy frames from the input speech signal has been considered as a preprocessing operation. Melspectrogram of pre-processed speech signal is foci as input to the DenseNet architecture. Language recognition performance of this architecture has been compared with that of several state-of-the-art deep architectures which include a convolutional neural network (CNN), RosNet, CNN-BLSTM and DenseNet-BLSTM hybrid architectures. Additionally, we obtained recognition performances of a stacked BLSTM architecture fed with different sets of handcrafted features for comparison purpose. Simulations for both speaker independent and speaker dependent scenarios have been performed on two different standard datasets which include (i) IITKGP-MLILSC dataset of news clips in 27 different Indian languages and (ii) Linguistic Data Consortium (LDC) dataset of telephonic conversations in 5 different Indian languages. In each case, recognition performance of the DenseNet architecture along with Mel-spectrogram features has been found to be significantly better than all other frameworks implemented in this study.

查看译文

关键词

speech signals,dense convolutional network architecture,classification task,dynamic elimination,low energy frames,input speech signal,preprocessing operation,pre-processed speech signal,DenseNet architecture,language recognition performance,state-of-the-art deep architectures,convolutional neural network,CNN-BLSTM,DenseNet-BLSTM hybrid architectures,stacked BLSTM architecture,speaker dependent scenarios,different standard datasets,27 different Indian languages,5 different Indian languages,Mel-spectrogram features

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要