Advances in All-Neural Speech Recognition

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2017)

引用 126|浏览325
暂无评分
摘要
This paper advances the design of CTC-based all-neural (or end-to-end) speech recognizers. We propose a novel symbol inventory, and a novel iterated-CTC method in which a second system is used to transform a noisy initial output into a cleaner version. We present a number of stabilization and initialization methods we have found useful in training these networks. We evaluate our system on the commonly used NIST 2000 conversational telephony test set, and significantly exceed the previously published performance of similar systems, both with and without the use of an external language model and decoding technology.
更多
查看译文
关键词
recurrent neural network,CTC,speech recognition,end-to-end training
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要