An Approach Of Vector Space Model To Link Concrete Concepts With Wiki Entities

2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing(2015)

引用 3|浏览6
暂无评分
摘要
Entity Linking (EL) search and labeling are important research topics with various web applications. The challenge is to find and link the important concepts from web text to online encyclopedia databases instead of simple personal and place names. This paper presents a new approach to link concrete concepts from English texts with Wiki entities. Using part-of-speech tagging to detect concrete concepts, Vector Space Model has been applied to perform the disambiguation and selection of Wiki entities. Comparing to existing method, the proposed framework, named UnBWiki VSM, achieved satisfactory result and was adjusted with the Wikilinks database of 2.8 million entities and 18 million words. As for a case study, five web texts from the British Royal Family History were analyzed manually. The results obtained using UnBWiki VSM were satisfactory, with recall of 73.5% for the five selected texts. This process shows the perspective for the automatic Entity Linking (EL) from web pages.
更多
查看译文
关键词
entity linking,concrete concepts,text mining,Vector Space Model,Web mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要