Comprehensive-perception dynamic reasoning for visual question answering

Pattern Recognition(2022)

引用 1|浏览9
暂无评分
摘要
•The proposed comprehensive-perception dynamic reasoning model can perceive all the object features from the previous reasoning process.•The introduction of relation network as a guide for interaction between features enhances the relational reasoning capability of the model.•Employing intra- and inter-layer attention weights optimizes the importance of object features in the reasoning process.•Incorporating our CPDR module into the VLP models brings considerable performance improvements.
更多
查看译文
关键词
Cross-modal information fusion,Visual question answering,Comprehensive perception,Relational reasoning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要