Sparse coding and normalization for deep Fisher score representation

Computer Vision and Image Understanding(2022)

引用 1|浏览4
暂无评分
摘要
Fisher scores have been shown to be accurate global image features for classification. However, their performance is very dependent on the quality of the input features as well as the normalization steps applied to them. In this paper, we propose to embed the Fisher scores in an end-to-end trainable deep network by concentrating on two elements: adapting the encoding to the deep features and normalizing the extracted second-order statistics. Therefore, we make use of a deep sparse coding module that allows to sample the center of each Gaussian function from a learned subspace and thus to better fit the high dimensional data distribution. Second, we introduce a new normalization module that computes an approximate square root matrix normalization well adapted to the Fisher scores. These processing steps are embedded in a deep network so that all the modules work together for the sole purpose of improving classification performance. Experimental results show that this solution outperforms many alternatives in the context of material, indoor scene and fine-grained image classification.
更多
查看译文
关键词
Fisher score,Sparse coding,Orderless pooling,Square root normalization,Classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要