Think Beyond Words: Exploring Context-Relevant Visual Commonsense for Diverse Dialogue Generation.

Yiting Liu,Liang Li,Beichen Zhang,Qingming Huang

EMNLP (Findings)（2022）

引用 0|浏览28

暂无评分

摘要

Commonsense knowledge has been widely considered for building intelligent open-domain dialogue agents, aiming to generate meaningful and diverse responses.Previous works in this field usually lack the ability to effectively obtain and utilize auxiliary commonsense from the external visual world.In this paper, we argue that exploiting logical information in images related to context can be effective to enrich and steer the generation process.In view of this, we propose VICTOR, a context-relevant VIsual Commonsense enhanced dialogue gen-eraTOR for generating coherent and informative responses.To obtain the associated visual commonsense, we devise a novel approach that expands topic words on the knowledge graph and maps them into daily scenarios.During the generation, the model adopts multimodal fusion mechanism to integrate visual and textual information, and adaptively combine their decoding distributions for better response generation.The experimental results on two public datasets show that our proposed method outperforms the latest competitive methods in terms of coherence and diversity.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要