Spatial location from running binaural signals using cepstral analysis and logarithmic filtering

Journal of the Acoustical Society of America(2023)

引用 0|浏览0
暂无评分
摘要
This paper proposes an acoustic model for predicting the acoustical room characteristics from a running binaural signal. This is accomplished via training a convolutional neural network on a precedence effect model to extract the spatial locations of the direct sound source and its early reflections. The precedence effect model extends and modifies the BICAM algorithm with cepstral analysis [Tyler, J., Si, M., \& Braasch, J. Acoust. Soc. Am. 151] and a logarithmic filter. The logarithmic filter takes human perception into account and provides better separation at higher frequencies. A synthetic dataset of binaural signals was generated using anechoic orchestral recordings with added reflections and reverberations. The binaural model generates binaural activity maps from binaural input signals, which are then used to train a convolutional neural network. The ability to predict the traits of a direct sound source and its reflections has applications in academic areas like perceptual modeling and room acoustical analysis. It can also be applied to industrial areas such as television and movies, video games, and augmented and virtual reality, to name a few. [Work supported by the National Science Foundation: HCC-1909229.]
更多
查看译文
关键词
binaural signals,cepstral analysis,spatial location,logarithmic filtering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要