Improving the Classification of Drunk Texting in Tweets Using Semantic Enrichment

Marcos A. Grzeça,Karin Becker,Renata Galante

2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)（2018）

引用 2|浏览14

暂无评分

摘要

Excessive alcohol consumption is a worldwide problem, and social networks such as Twitter can provide valuable data that help understanding factors related to alcoholism, particularly among youngsters. The identification of drunk tweets (i.e. posted under the influence of alcohol) is complex because tweets are short, sparse and written with diverse and internet specific vocabulary, possibly with errors due to alcohol influence. In this paper, we propose an enriching framework that integrates conceptual and semantic features that expand and generalize the vocabulary, providing context to tweet terms. It also handles misspellings and the selection of discriminative features resulting from contextual enrichment. We outperformed the baseline, achieving improvements of 13.79 percentage points in recall, with no significant harm to precision. We illustrate the value of drunk tweets classification by developing an exploratory analysis that reveals drunk tweeters demographics and tweet properties.

查看译文

关键词

drunk texting, social networks, semantic enrichment, LOD, conceptual enrichment

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要