Aggregating Local Deep Features for Image Retrieval.

2015 IEEE International Conference on Computer Vision (ICCV)(2015)

引用 792|浏览141
暂无评分
摘要
Several recent works have shown that image descriptors produced by deep convolutional neural networks provide state-of-the-art performance for image classification and retrieval problems. It also has been shown that the activations from the convolutional layers can be interpreted as local features describing particular image regions. These local features can be aggregated using aggregating methods developed for local features (e.g. Fisher vectors), thus providing new powerful global descriptor. In this paper we investigate possible ways to aggregate local deep features to produce compact descriptors for image retrieval. First, we show that deep features and traditional hand-engineered features have quite different distributions of pairwise similarities, hence existing aggregation methods have to be carefully re-evaluated. Such re-evaluation reveals that in contrast to shallow features, the simple aggregation method based on sum pooling provides the best performance for deep convolutional features. This method is efficient, has few parameters, and bears little risk of overfitting when e.g. learning the PCA matrix. In addition, we suggest a simple yet efficient query expansion scheme suitable for the proposed aggregation method. Overall, the new compact global descriptor improves the state-of-the-art on four common benchmarks considerably.
更多
查看译文
关键词
compact global descriptor,query expansion scheme,PCA matrix,deep convolutional feature,sum pooling,shallow feature,aggregation method,pairwise similarity,hand-engineered feature,compact descriptor,Fisher vector,aggregating method,image region,local feature,convolutional layer,image retrieval problem,image classification,deep convolutional neural network,image descriptor,local deep feature
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要