Scalable Semantic Web Data Management Using Vertical Partitioning.

Daniel J. Abadi,Adam Marcus,Samuel Madden, Katherine J. Hollenbach

VLDB '07: Proceedings of the 33rd international conference on Very large data bases(2007)

引用 1027|浏览425
暂无评分
摘要
Efficient management of RDF data is an important factor in realizing the Semantic Web vision. Performance and scalability issues are becoming increasingly pressing as Semantic Web technology is applied to real-world applications. In this paper, we examine the reasons why current data management solutions for RDF data scale poorly, and explore the fundamental scalability limitations of these approaches. We review the state of the art for improving performance for RDF databases and consider a recent suggestion, "property tables." We then discuss practically and empirically why this solution has undesirable features. As an improvement, we propose an alternative solution: vertically partitioning the RDF data. We compare the performance of vertical partitioning with prior art on queries generated by a Web-based RDF browser over a large-scale (more than 50 million triples) catalog of library data. Our results show that a vertical partitioned schema achieves similar performance to the property table technique while being much simpler to design. Further, if a column-oriented DBMS (a database architected specially for the vertically partitioned case) is used instead of a row-oriented DBMS, another order of magnitude performance improvement is observed, with query times dropping from minutes to several seconds.
更多
查看译文
关键词
RDF data,RDF data scale,RDF databases,Web-based RDF browser,current data management solution,library data,magnitude performance improvement,similar performance,Semantic Web technology,Semantic Web vision,scalable semantic web data,vertical partitioning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要