A 6 mW, 5,000-Word Real-Time Speech Recognizer Using WFST Models

Solid-State Circuits, IEEE Journal of  (2015)

引用 50|浏览16
暂无评分
摘要
We describe an IC that provides a local speech recognition capability for a variety of electronic devices. We start with a generic speech decoder architecture that is programmable with industry-standard WFST and GMM speech models. Algorithm and architectural enhancements are incorporated in order to achieve real-time performance amid system-level constraints on internal memory size and external memory bandwidth. A 2.5 × 2.5 mm test chip implementing this architecture was fabricated using a 65 nm process. The chip performs a 5,000 word recognition task in real-time with 13.0% word error rate, 6.0 mW core power consumption, and a search efficiency of approximately 16 nJ per hypothesis.
更多
查看译文
关键词
Gaussian processes,speech recognition,GMM speech models,WFST models,external memory bandwidth,generic speech decoder architecture,internal memory size,local speech recognition capability,real-time speech recognizer,CMOS digital integrated circuits,Gaussian mixture models (GMM),low-power electronics,speech recognition,weighted finite-state transducers (WFST)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要