RABID -- A General Distributed R Processing Framework Targeting Large Data-Set Problems

Big Data(2013)

引用 7|浏览0
暂无评分
摘要
Large-scale data mining and deep data analysis are in high demand in modern enterprises. This work describes the RABID (R Analytics for BIg Data) framework to provide a highly parallel R. We achieve the goal of providing data analysts with an easy-to-use R interface to effectively perform deep data analysis on clusters by integrating R and a MapReduce-like platform. By leveraging a distributed runtime system, our framework enables R, the single-threaded language, to efficiently perfrom parallel analysis of data that cannot fit into a single shared memory machine in parallel. Experiments of data mining benchmarks on our framework show promising results.
更多
查看译文
关键词
r processing framework,easy-to-use r interface,big data,r analytics,data mining benchmarks,mapreduce-like platform,deep data analysis,data analyst,perfrom parallel analysis,large data-set problems,large-scale data mining,parallel r,data analysis,data mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要