A novel visual classification framework on panoramic attention mechanism network

Wenshu Li, Shenhao Li,Lingzhi Yin,Xiaoying Guo, Xu Yang

IET COMPUTER VISION(2022)

引用 1|浏览0
暂无评分
摘要
Fine-grained classification is a challenging task due to the difficulty of finding discriminative features and the localization of feature regions. To handle these challenges, a novel visual classification framework on panoramic attention mechanism that combines multiple attention networks to locate and identify features with more semantic interest is proposed. Firstly, based on the classical convolutional neural network, the global information of the image feature is expressed by linear fusion. Secondly, the foreground attention branch is used to further extract the distinguishing details of the salient features. Then, more features are mined from the complementary object area through the background attention branch to learn more perfect fine-grained feature expression. Finally, three network branches are trained together to enhance the network's ability to express representative features of fine-grained images. Our model can be viewed as a multi-branch network, which benefits each other and optimizes the network together. Experiments were conducted on CUB-200-2011, Stanford Dogs and FGVC-Aircraft datasets, and the accuracy was used as the quantitative measurement. Experimental results show that the proposed method has the highest accuracy; the average accuracy is 89.8%. It is effective and superior to the current advanced methods.
更多
查看译文
关键词
fine-grained image classification model,multi-branch network for cooperative training,panoramic attention mechanism,significant feature extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要