Web Search Results Clustering Based on a Novel Suffix Tree Structure

Junze Wang,Yijun Mo,Benxiong Huang,Jie Wen,Li He

AUTONOMIC AND TRUSTED COMPUTING, PROCEEDINGS（2008）

引用 32|浏览2

暂无评分

摘要

Web search results clustering are navigator for users to search needed results. With suffix tree clustering (STC), search results can be clustered fast, automatically, and each cluster is labeled with a common phrase. Due to the large memory requirement of suffix tree, some other approaches have been proposed, with lower memory requirement. But unlike other algorithms, STC is an incremental algorithm and a promising approach to work on a long list of snippets returned by search engines. In this paper we proposed an approach for web search results clustering and labeling, based on a new suffix tree data structure. The approach is an incremental and linear time algorithm, with significantly lower memory requirements. This approach also labels every final cluster a common phrase, thus it is suitable for quickly browsing by users. Experimental results show that the new approach has better performance than that of conventional web search result clustering.

查看译文

关键词

web search result,conventional web search result,novel suffix tree structure,suffix tree clustering,common phrase,lower memory requirement,promising approach,search engine,large memory requirement,new approach,web search results,search result,data structure,document clustering

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要