谷歌浏览器插件
订阅小程序
在清言上使用

A text clustering algorithm based on category resolve power

Information Science and Engineering(2010)

引用 0|浏览1
暂无评分
摘要
As an unsupervised machine learning technology, document clustering has been widely used in many fields, such as Information Retrieval (IR) and Text Categorization (TC). But, because of the bag of words used in document clustering as document index, the feature space of corpus must be high dimension space. This problem makes a negative effect to the efficiency and precision of text clustering. Based on category resolve power, a new feature selection function is constructed. Through integration between this function and document clustering algorithm, a high-powered text clustering algorithm is presented. Experiments on a universal corpus show that it has a good performance.
更多
查看译文
关键词
dimension reduction,feature selection,result evaluaton,text clustering,
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要