Resource allocation and aging priority-based scheduling of linear workflow applications with transient failures and selective imprecise computations

Cluster Computing(2024)

引用 0|浏览1
暂无评分
摘要
A wide range of applications in distributed environments have a linear structure, varying priorities, and may experience transient software failures. As the computational demands of such linear workflow (LW) jobs continue to grow, their efficient, fair, and fault-tolerant resource allocation and scheduling is becoming more challenging. To address this problem, we propose a fair and efficient scheduling approach, which considers that the priorities of the jobs age with time. We jointly use this scheduling strategy with three practical routing techniques, as well as two variants of an application-directed checkpointing scheme. The first variant of this scheme incorporates imprecise computations in a selective manner, whereas the second one does not use imprecise computations at all. Our aim is to dynamically allocate and schedule LW jobs with different priorities and transient software failures in a distributed system. Through extensive experimentation, we evaluate the system performance under the considered routing methods and checkpointing schemes, utilizing various load cases and failure probabilities. The simulation results showcase the impact of selective imprecise computations on the system performance, while providing insights into how the examined routing strategies perform in each of the investigated scenarios.
更多
查看译文
关键词
Resource allocation,Scheduling,Linear workflows,Transient failures,Aging priorities,Imprecise computations
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要