CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning.

IEEE/ACM Transactions on Audio, Speech, and Language Processing(2019)

引用 50|浏览120
暂无评分
摘要
Estimating the maximum number of concurrent speakers from single-channel mixtures is a challenging problem and an essential first step to address various audio-based tasks such as blind source separation, speaker diarization, and audio surveillance. We propose a unifying probabilistic paradigm, where deep neural network architectures are used to infer output posterior distributions. These probabil...
更多
查看译文
关键词
Estimation,Task analysis,Speech processing,Neural networks,Microphones,Surveillance
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要