A data science pipeline synchronisation method for edge-fog-cloud continuum.

Dante D. Sánchez-Gallegos, José Luis González Compeán,Jesús Carretero , Heidy Marisol Marín Castro

SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis(2023)

引用 0|浏览3
暂无评分
摘要
This paper presents an adaptive data delivery method for data science pipelines. While this method is feasible for processes communicating over any network, in this work we focus on edge-fog-cloud infrastructures. In a diagnostic phase, a model based on the Bernoulli principle is used to create a representation of bottlenecks in a pipeline. In a supervision phase, a watchman/sentinel cooperative system monitors the throughput of the pipeline stages to create a bottleneck-stage scheme. In a rectification phase, this system produces replicas of bottlenecks stages, mitigating the workload congestion using implicit parallelism and load balancing algorithms. This method is automatically and transparently invoked to produce a steady continuum dataflow. To test our proposal, we conducted a case study about the processing of medical and satellite data. The evaluation revealed that this method creates continuum dataflows, without neither characterising workloads nor knowing infrastructure details, which yields a competitive performance with state-of-the-art solutions.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要