A 6 mW, 5,000-Word Real-Time Speech Recognizer Using WFST Models

Michael Price,James Glass,Anantha P. Chandrakasan

Solid-State Circuits, IEEE Journal of （2015）

引用 50|浏览16

暂无评分

摘要

We describe an IC that provides a local speech recognition capability for a variety of electronic devices. We start with a generic speech decoder architecture that is programmable with industry-standard WFST and GMM speech models. Algorithm and architectural enhancements are incorporated in order to achieve real-time performance amid system-level constraints on internal memory size and external memory bandwidth. A 2.5 × 2.5 mm test chip implementing this architecture was fabricated using a 65 nm process. The chip performs a 5,000 word recognition task in real-time with 13.0% word error rate, 6.0 mW core power consumption, and a search efficiency of approximately 16 nJ per hypothesis.

查看译文

关键词

Gaussian processes,speech recognition,GMM speech models,WFST models,external memory bandwidth,generic speech decoder architecture,internal memory size,local speech recognition capability,real-time speech recognizer,CMOS digital integrated circuits,Gaussian mixture models (GMM),low-power electronics,speech recognition,weighted finite-state transducers (WFST)

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要