DLS: A Fast and Flexible Neural Network Training System with Fine-grained Heterogeneous Device Orchestration
IEEE Transactions on Parallel and Distributed Systems(2022)
关键词
Artificial neural networks,Training,Performance evaluation,Field programmable gate arrays,Device-to-device communication,Task analysis,Predictive models,Neural network,accelerators,GPU,FPGA,flexibility,heterogeneous devices,inter-device communication
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要