Multi-depth Fusion Transformer and Batch Piecewise Loss for Visual Sentiment Analysis

Haochun Ou,Chunmei Qing, Jinglun Cen,Xiangmin Xu

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X(2024)

引用 0|浏览1
暂无评分
摘要
Visual sentiment analysis is an important research direction in the field of affective computing, which is of great significance in user behavior prediction, visual scene construction, etc. In order to address the issue of low efficiency in high-level semantic perception of CNNs, in this paper, the Transformer architecture is introduced into the task of visual sentiment analysis, proposing the Multi-depth Fusion Transformer (MFT) model that allows nesting different architectures. Additionally, addressing the problem of imbalanced data samples and varying learning difficulties in visual sentiment analysis datasets, this paper proposes a batch piecewise loss suitable for visual sentiment binary classification tasks. Experimental results on multiple datasets demonstrate that the proposed method outperforms existing approaches. Visualization experiments comparing CNN and Transformer-based approaches demonstrate the effectiveness of the proposed method in semantic perception.
更多
查看译文
关键词
Multi-depth Fusion Transformer,Batch Piecewise Loss,Sentiment analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要