Deep Multi-Patch Aggregation Network For Image Style, Aesthetics, And Quality Estimation

Xin Lu,Zhe Lin,Xiaohui Shen,Radomir Mech,James Z. Wang

2015 IEEE International Conference on Computer Vision (ICCV)（2015）

引用 353|浏览74

暂无评分

摘要

This paper investigates problems of image style, aesthetics, and quality estimation, which require fine-grained details from high-resolution images, utilizing deep neural network training approach. Existing deep convolutional neural networks mostly extracted one patch such as a downsized crop from each image as a training example. However, one patch may not always well represent the entire image, which may cause ambiguity during training. We propose a deep multi-patch aggregation network training approach, which allows us to train models using multiple patches generated from one image. We achieve this by constructing multiple, shared columns in the neural network and feeding multiple patches to each of the columns. More importantly, we propose two novel network layers (statistics and sorting) to support aggregation of those patches. The proposed deep multi-patch aggregation network integrates shared feature learning and aggregation function learning into a unified framework. We demonstrate the effectiveness of the deep multi-patch aggregation network on the three problems, i.e., image style recognition, aesthetic quality categorization, and image quality estimation. Our models trained using the proposed networks significantly outperformed the state of the art in all three applications.

查看译文

关键词

deep multipatch aggregation network,image aesthetics,image quality estimation,high-resolution images,deep neural network training approach,deep convolutional neural networks,down-sized crop,deep multipatch aggregation network training approach,feature learning,aggregation function learning,aesthetic quality categorization,image style recognition

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要