Enabling Switch Memory Management for Distributed Training with In-Network Aggregation.
INFOCOM(2023)
关键词
distributed training,DT job schedulers,in-network aggregation,INA-empowered DT jobs,INAlloc,JCT,job completion time,nondisruptive runtime switch memory reallocation,physical switch memory,resource allocation,shared clusters,switch memory allocation,switch memory management layer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要