Increased Query2Label (IQ) for Small Fine-grained Multi-label Classification

Thinh Tran Huu Nguyen,Phuc Nguyen,Van Phuc Nguyen, Linh H. G. Tran, Manh Van Le,Binh T. Nguyen

2023 15th International Conference on Knowledge and Systems Engineering (KSE)(2023)

引用 0|浏览0
暂无评分
摘要
Multi-label image classification aims to identify all object labels within an image. Small objects and similar objects are still the primary difficulties of previous related models due to the representative capacity of convolutional kernels. Recent vision transformer networks employ the attention mechanism to extract the spatial feature, which expresses richer local semantic information but is insufficient for creating high performance in fine-grained or small-object scenarios. To overcome these disadvantages, this paper proposes a solution by using the definition of the label as text embedding and adjusting the modification of the decoder stage so the model can acquire more information. The new framework is simple, efficient, and consistently outperforms all preceding works on the FATHOMNET, FAIR1M, and DOTA datasets.
更多
查看译文
关键词
Small object detection,Fine-grained visual categorization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要