Construction of Sensitive Image Datasets Based on Generative Methods

2023 12th International Conference of Information and Communication Technology (ICTech)(2023)

引用 0|浏览0
暂无评分
摘要
Due to the special nature of sensitive information, available sensitive data resources are relatively scarce to meet the needs of research in this field. The purpose of this paper is to construct a large sensitive image dataset by generative methods, with a view to supporting research exploration in related aspects. To achieve this goal, we adopt a new idea. First, we crawl sensitive and non-sensitive class datasets from web pages and filter them manually. Here, the sensitive classes mainly involve violence, terror and pornography; second, the selected data are used as the training set to fine-tune the pre-trained generative model; then, the fine-tuned generative model is used for sensitive/non-sensitive image generation, which expands the training dataset; finally, the generated training set is passed into the classification model to do the classification task for result verification. The experimental results show that the classification results of the expanded dataset using the generative method are higher than those of the original dataset, which proves the effectiveness of the generative method to construct the dataset.
更多
查看译文
关键词
sensitive information,dataset,generate,classfication
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要