Towards A Deep Speech Model For Romanian Language

2019 22ND INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS)(2019)

引用 2|浏览15
暂无评分
摘要
Automatic speech recognition systems have gained popularity due to their gain in terms of usability and integration in cross domain applications. While traditional approaches are developed over elaborated pipelines that need specific pre-trained models for a language (acoustic model, a phonetic dictionary, etc.), deep learning architectures like Recurrent Neural Networks have been trained for automatic speech recognition using only large datasets of speech corpora (audio and aligned transcript files). Starting from the DeepSpeech architecture, we present the performance of the model trained for Romanian language over the SWARA speech corpus which contains almost 21 hours of speech data using 17 different speakers. The experiments were focused on obtaining the best performance of the network in terms of Word Error Rate by tweaking the parameters of the model on the SWARA dataset. We present preliminary results obtained for this Romanian dataset, alongside with the encountered limitations while training the model on other languages besides English.
更多
查看译文
关键词
speech recognition, deep learning, Natural Language Processing, recurrent neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要