Dbms Data Loading: An Analysis On Modern Hardware

DATA MANAGEMENT ON NEW HARDWARE(2016)

引用 15|浏览163
暂无评分
摘要
Data loading has traditionally been considered a "one-time deal" - an offline process out of the critical path of query execution. The architecture of DBMS is aligned with this assumption. Nevertheless, the rate in which data is produced and gathered nowadays has nullified the "one-off" assumption, and has turned data loading into a major bottleneck of the data analysis pipeline.This paper analyzes the behavior of modern DBMS in order to quantify their ability to fully exploit multicore processors and modern storage hardware during data loading. We examine multiple state-of-the-art DBMS, a variety of hardware configurations, and a combination of synthetic and real-world datasets to identify bottlenecks in the data loading process and to provide guidelines on how to accelerate data loading. Our findings show that modern DBMS are unable to saturate the available hardware resources. We therefore identify opportunities to accelerate data loading.
更多
查看译文
关键词
Hard Disk Drive, Query Execution, Solid State Drive, Post Approach, Read Pattern
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要