Taming Latency In Data Center Networking With Erasure Coded Files

2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING(2015)

引用 7|浏览0
暂无评分
摘要
This paper proposes an approach to minimize service latency in a data center network where erasure-coded files are stored on distributed disks/racks and access requests are scattered across the network. Due to limited bandwidth available at both top-of-the-rack and aggregation switches, network bandwidth must be apportioned among different intra-and inter-rack data flows in line with their traffic statistics. We formulate this problem as weighted queuing and employ a class of probabilistic request scheduling policies to derive a closed-form outer-bound of service latency for erasure-coded storage with arbitrary file access patterns and service time distributions. The result enables us to propose a joint latency optimization over three entangled "control knobs": the bandwidth allocation at top-of-the-rack and aggregation switches, the probabilities for scheduling file requests, and the placement of encoded file chunks, which affects data locality. The joint optimization is shown to be a mixed-integer problem. We develop an iterative algorithm which decouples and solves the joint optimization as three sub-problems, which are either convex or solvable via bipartite matching in polynomial time. The proposed algorithm is prototyped in an open-source, distributed file system, Tahoe, and evaluated on a cloud testbed with 16 separate physical hosts in an OpenStack cluster. Experiments validate our theoretical latency analysis and show significant latency reduction for diverse file access patterns. The results provide valuable insight on designing low-latency data center networks with erasure-coded storage.
更多
查看译文
关键词
computer centres,distributed databases,iterative methods,open systems,pattern matching,queueing theory,OpenStack cluster,Tahoe,aggregation switches,bandwidth allocation,bipartite matching,control knobs,data center networking,distributed disks,distributed file system,encoded file chunks,erasure coded files,erasure-coded storage,file access patterns,interrack data flows,intrarack data flows,iterative algorithm,joint latency optimization,low-latency data center networks,mixed-integer problem,network bandwidth,open-source system,probabilistic request scheduling policies,service latency,service latency minimization,service time distributions,traffic statistics,weighted queuing,Erasure-coded,data center,service latency,storage,
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要