Load-Balanced Cluster for Scale-Out Storage of Knowledge

Zheng Xiong, Guocheng Zhu,Wei Yu, Sen Wang,Zhihong Chong

2018 Sixth International Conference on Advanced Cloud and Big Data (CBD)(2018)

引用 2|浏览6
暂无评分
摘要
In the face of massive data, Knowledge Graph (KG) needs the scale-out storage schema and distributed parallel query engine to guarantee its storage and query performance. In this paper, we propose a Knowledge Graph Storage Access System (KGSAS) based on HBase to deal with these problems. Our approach presents a scalable storage schema which uses random prefix and the pre-partition operation to ensure load-balanced entity storage. Besides presenting the storage schema, in order to improve query efficiency, we propose two distributed parallel query engines: HBase with Spark and HBase with Coprocessor. The HBase with Spark engine accelerate queries in parallel by using the memory calculation on Spark. The HBase with Coprocessor engine utilizes inverted index and Coprocessor technology to speed up queries by scanning cluster in parallel. The evaluation results show that HBase with Coprocessor engine has the better performance for querying KG.
更多
查看译文
关键词
Knowledge Graph,HBase,Storage,Query,Load Balanced
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要