谷歌浏览器插件
订阅小程序
在清言上使用

Self-supervised Learning Framework for Speaker Localisation with a Humanoid Robot

International Conference on Development and Learning(2021)

引用 3|浏览2
暂无评分
摘要
Locating a speaker in the space is a skill that plays an essential role in conducting smooth and natural social interactions. Equipping robots with this ability could lead to more fluid human-robot interaction, also by facilitating voice recognition in noisy environments. Most recently proposed sound localisation systems rely on model-based approaches. However, their performances depend on carefully chosen parameters, especially in the binaural and noisy settings typical of humanoids setups. The need for fine-tuning and for adaptation when facing new environments represents a considerable obstacle to the use and portability of such systems in real human-robot interaction scenarios. To overcome these limitations we propose to rely on data-driven approaches (i.e., deep learning) and exploit multi-sensory mechanisms to leverage the direct experience sensed by the robot during an interaction. Taking inspiration from how humans use vision to calibrate their auditory space representation through experiences, we enabled the robot to learn to localize a speaker in a self-supervised way. Our results show that this approach is suitable to learn to localise speakers in the challenging environments typical of human-robot collaboration.
更多
查看译文
关键词
humanoid robot,natural social interactions,human-robot interaction,voice recognition,noisy environments,sound localisation systems,model-based approaches,binaural settings,humanoids setups,human-robot interaction scenarios,data-driven approaches,deep learning,multisensory mechanisms,auditory space representation,human-robot collaboration,self-supervised learning framework,speaker localisation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要