谷歌浏览器插件
订阅小程序
在清言上使用

Efficient Creation of Japanese Tweet Emotion Dataset Using Sentence-Final Expressions

2021 IEEE 3rd Global Conference on Life Sciences and Technologies (LifeTech)(2021)

引用 3|浏览0
暂无评分
摘要
Emotion recognition in natural language text is one of the critical technologies in the human-computer interface in a wide range of fields, including health and well-being, and labeled data plays a significant role in developing such technology. This paper presents a method for efficiently collecting Japanese emotion tweets carrying the first-person's emotion using emotional expressions and sentence-final expressions. By exploiting sentence-final expressions, we can identify the targeted tweets even though the subjects of sentences are often omitted, and first-person pronouns are often not explicitly in Japanese. By applying the method to Japanese tweet data, we constructed a Japanese tweet dataset comprising 2,234 tweets with labels of emotion types and intensities for two types of emotions: joy and sadness. The evaluation results show that the proposed method can improve the collection efficiency of targeted tweets and the reliability of data labels. We developed classifiers from the dataset that recognize emotion intensities. We show that a classifier using a deep learning-based language model outperforms conventional baseline methods using a Bag of Words model and that the Japanese tweet emotion dataset constructed by our method is useful for the emotion intensity recognition.
更多
查看译文
关键词
emotion dataset,emotion intensity,emotion recognition,affective technology
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要