Exploring Deep Multimodal Fusion Of Text And Photo For Hate Speech Classification

Fan Yang,Xiaochang Peng,Gargi Ghosh, Reshef Shilon,Hao Ma, Eider Moore, Goran Predovic

THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE(2019)

引用 37|浏览6
暂无评分
摘要
Interactions among users on social network platforms are usually positive, constructive and insightful. However, sometimes people also get exposed to objectionable content such as hate speech, bullying, and verbal abuse etc. Most social platforms have explicit policy against hate speech because it creates an environment of intimidation and exclusion, and in some cases may promote real-world violence. As users' interactions on today's social networks involve multiple modalities, such as texts, images and videos, in this paper we explore the challenge of automatically identifying hate speech with deep multimodal technologies, extending previous research which mostly focuses on the text signal alone. We present a number of fusion approaches to integrate text and photo signals. We show that augmenting text with image embedding information immediately leads to a boost in performance, while applying additional attention fusion methods brings further improvement.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要