Chrome Extension
WeChat Mini Program
Use on ChatGLM

ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS(2024)

Cited 0|Views22
No score
Key words
Graphics processing units,Dynamic scheduling,Throughput,Processor scheduling,Pipelines,Costs,Quality of service,MIG,batch inference,scheduling system,machine learning
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined