CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning.

Fabian-Robert Stöter,Soumitro Chakrabarty,Bernd Edler,Emanuel A. P. Habets

IEEE/ACM Transactions on Audio, Speech, and Language Processing（2019）

引用 50|浏览120

暂无评分

摘要

Estimating the maximum number of concurrent speakers from single-channel mixtures is a challenging problem and an essential first step to address various audio-based tasks such as blind source separation, speaker diarization, and audio surveillance. We propose a unifying probabilistic paradigm, where deep neural network architectures are used to infer output posterior distributions. These probabil...

查看译文

关键词

Estimation,Task analysis,Speech processing,Neural networks,Microphones,Surveillance

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要