Enhanced Task-based Knowledge for Lexicon-based Approach in Vietnamese Hate Speech Detection

Suong N. Hoang,Binh Nguyen, Nam P. Nguyen,Son T. Luu,Hieu T. Phan,Hien D. Nguyen

2022 14th International Conference on Knowledge and Systems Engineering (KSE)（2022）

引用 1|浏览7

暂无评分

摘要

The explosion of free-text content on social media has brought the exponential propagation of hate speech. The definition of hate speech is well-defined in the community guidelines of many popular platforms such as Facebook, Tiktok, and Twitter, where any communication judges towards the minor, protected groups are considered hateful content. This paper first points out the sophisticated word-play of malicious users in a Vietnamese Hate Speech (VHS) Dataset. The Center Loss in the training process to disambiguate the task-based sentence embedding is proposed for improving generalizations of the model. Moreover, a task-based lexical attention pooling is also proposed to highlight lexicon-level information and then combined into sentence embedding. The experimental results show that the proposed method improves the F1 score in the ViHSD dataset, while the training time and inference speed are insignificantly changed.

查看译文

关键词

hate speech detection,text classification,slot attention,BERT,transformer

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要