A Novel Incremental Dictionary Learning Method for Low Bit Rate Speech Streaming.
WISE(2018)
摘要
Speech streaming, which is widely used nowadays, cost a huge amount of transfer bandwidth and storage space. It is significant to compress them with as few bits as possible, meanwhile keep the voice clear and meaning unchanged. According to speech contexts, the proposed method can dynamically adapt to speech stream of any speaker by appending atoms to dictionary. Furthermore, in order to smoothly represent the amplitude envelopes shifting over the frequency, the dictionary is extended by Hilbert transform. The upper bounds of weights of atoms are constrained, so they can be quantized in practical applications. Experimental results show the advantages of our method. When the minimum reconstruction accuracy is 99.8%, which is applicable to general voice communications, the space saving is over 99%. Our method can adapt to the application with extreme bandwidth/storage limitation and large scale dataset, meanwhile keep reasonable perception quality.
更多查看译文
关键词
Streaming speech, Low bit rate compression, Incremental dictionary learning, Sparse coding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络