Large-Scale Deep Learning Optimizations: A Comprehensive Survey

Xiaoxin He,Fuzhao Xue,Xiaozhe Ren,Yang You

arxiv（2021）

引用 0|浏览16

暂无评分

摘要

Deep learning have achieved promising results on a wide spectrum of AI applications. Larger datasets and models consistently yield better performance. However, we generally spend longer training time on more computation and communication. In this survey, we aim to provide a clear sketch about the optimizations for large-scale deep learning with regard to the model accuracy and model efficiency. We investigate algorithms that are most commonly used for optimizing, elaborate the debatable topic of generalization gap arises in large-batch training, and review the SOTA strategies in addressing the communication overhead and reducing the memory footprints.

查看译文

关键词

deep learning,large-scale large-scale

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要