Speech Emotion Recognition Based on Acoustic Segment Model

Siyuan Zheng,Jun Du,Hengshun Zhou,Xue Bai,Chin-Hui Lee,Shipeng Li

2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP)（2021）

引用 2|浏览52

暂无评分

摘要

Accurate detection of emotion from speech is a challenging task due to the variability in speech and emotion. In this paper, we propose a speech emotion recognition (SER) method based on acoustic segment model (ASM) to deal with this issue. Specifically, speech with different emotions is segmented more finely by ASM. Each of these acoustic segments is modeled by Hidden Markov Models (HMMs) and decoded into a series of ASM sequences in an unsupervised way. Then feature vectors are obtained from these sequences above by latent semantic analysis (LSA). Finally, these feature vectors are fed to a classifier. Validated on the IEMOCAP corpus, results demonstrate the proposed method outperforms the state-of-the-art methods with a weighted accuracy of 73.9% and an unweighted accuracy of 70.8% respectively.

查看译文

关键词

speech emotion recognition,acoustic segment model,latent semantic analysis

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要