Chrome Extension
WeChat Mini Program
Use on ChatGLM

Generating Labeled Samples Based on Improved Cdcgan for Hyperspectral Data Augmentation: A Case Study of Drought Stress Identification of Strawberry Leaves

COMPUTERS AND ELECTRONICS IN AGRICULTURE(2024)

Zhejiang Univ Technol | Zhejiang SLA Engn Design Co | Hangzhou City Univ | Zhejiang Univ

Cited 0|Views5
Abstract
Deep learning has been increasingly adopted in analyzing the hyperspectral imaging (HSI) data, and a large amount of high-quality labeled dataset is indispensable for its superior modeling performance. However, the acquisition of large-scale annotated spectral data is very time-consuming and costly. To address this challenge, this study proposed an improved conditional Deep Convolutional Generative Adversarial Network (cDCGAN) model to generate labeled HSI samples for data augmentation. The identification of drought stressed strawberry leaves was taken as the study object. Seven small-sample training datasets were constructed with sample sizes of 6, 10, 20, 30, 40, 50, 70 and 100, respectively. Different cDCGAN architectures were tested by varying the network depth and label concatenation pattern. An Indicator of Generated Data Quality (IGDQ) was proposed to evaluate the quality of generated spectra for exploring the optimal architecture of cDCGAN. Then, on each of seven training datasets with limited samples, high-quality pseudo spectral data were generated using the proposed cDCGAN and merged to the original training datasets for data augmentation. Residual Network (ResNet) classifier was established respectively before and after data augmentation. Conventional machine learning classifiers, including Support Vector Machine (SVM) and Decision Tree (DT), were also constructed. Results showed that the accuracy of ResNet, SVM, and DT improved by an average of 6.9%, 3.4%, and 3.1%, respectively, after data augmentation. Moreover, the minimal sample size achieving effective data augmentation could be as low as 20, its augmented datasets achieved comparable or even superior accuracy than the original training dataset with 100 samples. The various aspects affecting the quality of generated spectral data were also discussed, including different model frameworks (cDCGAN and cWGAN) and architectures. The overall results demonstrated that the proposed cDCGAN model achieved satisfactory results on the small-sample datasets of drought stressed strawberry leaves. This method has great potential for the common scenario of imbalanced or small-sample datasets in the domain of plant science.
More
Translated text
Key words
cDCGAN,Spectral generation,Data augmentation,Hyperspectral data,Drought stressed leaves
求助PDF
上传PDF
Bibtex
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
  • Pretraining has recently greatly promoted the development of natural language processing (NLP)
  • We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
  • We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
  • The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
  • Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Upload PDF to Generate Summary
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper
GPU is busy, summary generation fails
Rerequest