EaCRS: an extendible array based compression scheme for high dimensional data

SoICT '11: Proceedings of the 2nd Symposium on Information and Communication Technology(2011)

引用 3|浏览0
暂无评分
摘要
Multidimensional arrays are becoming important data structure for handling large scale multidimensional data; e.g., in scientific databases or MOLAP databases. Due to the increasing size of the data warehouses and high degree of sparsity, it becomes necessity to develop a suitable scheme to compress the multidimensional array in an efficient way so that it takes comparatively low memory storage. In this paper, we propose a new compression scheme namely extendible array based Compressed Row Storage (EaCRS), for large multidimensional sparse array. The main idea of this scheme is to compress the subarrays found from the existing extendible array using CRS method. To evaluate the proposed scheme, we compare it to the CRS on Traditional multidimensional array (TMA). Both analytical analysis and experimental test were conducted. In the analytical analysis, we analyze the CRS and EaCRS schemes in terms of the space requirement and the maximum range of usable data density for practical applications. The analytical analysis and experimental results show that the EaCRS scheme is superior to the CRS scheme for all the evaluated criteria.
更多
查看译文
关键词
crs scheme,multidimensional array,analytical analysis,new compression scheme,suitable scheme,existing extendible array,proposed scheme,traditional multidimensional array,high dimensional data,eacrs scheme,extendible array,compression ratio,data warehouse,data structure
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要