Language models scale reliably with over-training and on downstream
tasks
Samir Yitzhak Gadre,Georgios Smyrnis,Vaishaal Shankar,Suchin Gururangan,Mitchell Wortsman,Rulin Shao,Jean Mercat,Alex Fang,Jeffrey Li,Sedrick Keh,Rui Xin,Marianna Nezhurina,Igor Vasiljevic,Jenia Jitsev,Luca Soldaini,Alexandros G. Dimakis,Gabriel Ilharco,Pang Wei Koh,Shuran Song,Thomas Kollar,Yair Carmon,Achal Dave,Reinhard Heckel,Niklas Muennighoff,Ludwig Schmidt arxiv(2024)
AI 理解论文
溯源树
样例