Data Handling Optimization in Russian Data Lake Prototype

A Alekseev,A Kiryanov,A Klimentov,T Korchuganova,V Mitsyn,D Oleynik,A Petrosyan,S Smirnov,A Zarochentsev

Journal of Physics: Conference Series（2023）

引用 0|浏览7

暂无评分

摘要

Abstract CERN experiments are preparing for the HL-LHC era, which will bring an unprecedented volume of scientific data. These data will need to be stored and processed by thousands of physicists, but expected resource growth is nowhere near the extrapolated requirements of existing models, in terms of both storage volume and compute power. Opportunistic CPU resources such as HPCs and university clusters can provide extra CPU cycles, but there is no opportunistic storage. In this article, we will present the main architectural ideas, deployment details, and test results, with emphasis on our research to build a prototype of a distributed data processing and storage system with a focus on optimizing the efficiency of resources by reducing overhead costs for accessing the data. The described prototype was built using the geographically distributed WLCG sites and university clusters in Russia.

查看译文

关键词

data handling optimization,lake

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要