ALNet: An adaptive channel attention network with local discrepancy perception for accurate indoor visual localization

Hongbo Gao,Kun Dai,Ke Wang,Ruifeng Li,Lijun Zhao, Mengyuan Wu

Expert Systems with Applications(2024)

引用 0|浏览0
暂无评分
摘要
Visual localization, a fundamental component of several computer vision tasks, has been predominantly realized by scene coordinate regression (SCoRe) techniques. These methods leverage neural networks for scene coordinates prediction, followed by a PnP algorithm to recover the 6-DOF camera pose. However, similar image patches are prevalent in indoor scenes, which results in the extraction of comparable features for the regression of different scene coordinates. As a result, the localization accuracy is severely declined. In this work, we develop ALNet, a novel SCoRe method that incorporates a local discrepancy perception module (LDPM) and an adaptive channel attention module (ACAM) to address this challenge. For LDPM, our key insight lies in that scene attributes around different similar image patches are inconsistent. Technically, for each image patch, LDPM identifies a certain number of the most dissimilar patches around it and computes difference vectors to enrich its own features, thereby enabling the differentiation of similar image patches. Considering geometric attributes are beneficial for distinguishing similar patches while semantic context is conducive to encoding regression issues, integrating multi-level features is an effective approach to elevate the localization accuracy. Therefore, ACAM concatenates multi-level features together and leverages both average pooling and max pooling to generate reliable channel-wise weighting coefficient, thereby modeling the correlation among channels to integrate multi-level features effectively. Comprehensive experiments are conducted on mainstream indoor localization benchmarks and an actual environment, showing that ALNet achieves impressive performance. Source code and the experimental results video are available at https://github.com/DAMMONGAO/alnet.
更多
查看译文
关键词
Visual localization,Adaptive channel attention module,Local discrepancy perception module
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要