ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS(2024)
Key words
Graphics processing units,Dynamic scheduling,Throughput,Processor scheduling,Pipelines,Costs,Quality of service,MIG,batch inference,scheduling system,machine learning
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined