Speaker Identification Enhancement Using Emotional Features

COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023(2023)

引用 0|浏览7
暂无评分
摘要
Speaker recognition is a broad field that encompasses many different tasks related to identifying speakers in audio recordings. Two specific sub-tasks that are often studied are speaker segmentation and speaker identification. These tasks typically involve analyzing acoustic features of the audio to determine who is speaking. However, one limitation of traditional speaker identification methods is that they can struggle when dealing with emotional conversations, as the acoustic features can change due to the emotions being expressed. To address this limitation, focuses on studying the effect of emotion on speaker identification by combining features of both the emotions and speakers. This approach has shown to improve identification accuracy, increasing it from 72% using speaker features alone to 75% when both emotion and speaker features are used.
更多
查看译文
关键词
SR,Speaker Segmentation,Triplet Loss,Emotion Recognition,CNN,Bi-LSTM
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要