SInC

Ruoyu Wang,Daniel Sun, Raymond Wong, Raj Ranjan,Albert Y. Zomaya

Knowledge-Based Systems(2022)

引用 0|浏览1
暂无评分
摘要
Data compression has been widely adopted in the industry to reduce storage or bandwidth consumption by removing redundant data or encoding information. Redundancy in semantics implies that some facts in a knowledge base can be inferred from the others. For relational databases, it is possible to remove records due to semantic equivalence. In this paper, we present a purely semantic approach, which losslessly compresses relational data in the first place and also enhances data file compression to further reduce the storage. Our Semantic Inductive Compressor ( SInC ) works not only for intra-relation patterns but also inter-relation cases. SInC achieves around 1/3 to 2/3 of semantic compression ratios, and the original data can be entirely retrieved with the informative patterns induced by SInC . We apply industrial data compression tools on semantically compressed databases, and the experiment results indicate an enhanced compression ratio up to 35%. Almost all efforts in our technique turn to the enhancement.
更多
查看译文
关键词
Semantic,Data compression,Relational data,Data mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要