An empirical analysis of compute-optimal large language model training
Jordan Hoffmann,Sebastian Borgeaud,Arthur Mensch,Elena Buchatskaya,Trevor Cai,Eliza Rutherford,Diego de las Casas,Lisa Anne Hendricks,Johannes Welbl,Aidan Clark,Tom Hennigan,Eric Noland,Katherine Millican,George van den Driessche,Bogdan Damoc,Aurelia Guy,Simon Osindero,Karen Simonyan,Erich Elsen,Oriol Vinyals,Jack William Rae,Laurent Sifre NeurIPS 2022(2022)
关键词
NLP,Deep Learning,Large Language Models
AI 理解论文
溯源树
样例