Enhancing coal-gangue object detection using GAN-based data augmentation strategy with dual attention mechanism

Kefei Zhang,Xiaolin Yang, Liang Xu,Jesse The, Zhongchao Tan,Hesheng Yu

ENERGY(2024)

引用 0|浏览5
暂无评分
摘要
Coal separation based on computer vision has attracted substantial attention in recent years. However, developing reliable object detection models relies on large-scale annotated dataset, which in industrial practice is time-consuming and labor-intensive to obtain. In this paper, we propose a novel data augmentation model called dual attention deep convolutional generative adversarial network (DADCGAN) to expand dataset scale and improve object detection. For the first time, the proposed DADCGAN, which adopts DCGAN as its foundation architecture, introduces efficient channel attention and external attention mechanisms to capture essential feature information from the channel and spatial dimensions of images, respectively. Moreover, spectral normalization and two time-scale update rule strategies are incorporated to stabilize the training process. The implementation of our proposed data augmentation strategy includes two steps. First, traditional pixel transformation is used to expand an original small dataset. Then, our GAN-based data augmentation is executed to further expand the dataset by generating synthetic images. Experimental results show that our DADCGAN model achieves the lowest FID value, decreasing the FID by 21.30-71.96 % compared to other baseline GAN models, showcasing its ability to produce more realistic coal-gangue images. Finally, the data augmentation strategies are applied to the YOLOv4 model, enhancing the mAP by 9.26 %, highlighting its significance in enhancing coalgangue object detection. These results have important implications for the development and implementation of computer vision-based technologies, enabling the realization of cleaner and more efficient coal separation methods.
更多
查看译文
关键词
Generative adversarial network,Efficient channel attention,External attention,Coal-gangue detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要