Multi-Cluster Text Mining on the Grid using the D-Grid UNICORE environment

msra(2007)

引用 23|浏览8
暂无评分
摘要
Text mining is inherently more computation-intensive than information retrieval on pre-structured data, and it requires transfer and filtering of huge amounts of data. Grid environments provide a suitable infrastructure for ac- complishing these tasks. We present the mapping and implementation of a standard text mining workflow for the analysis of biomedical text data from PubMed to a D-Grid UNICORE environment with multiple PC-clusters. We, furthermore, discuss the gain in applicability, the open is sues of our solution and possible future enhancements.
更多
查看译文
关键词
data grid,text mining,structured data,grid computing,information retrieval
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要