PapersNetwork
PN
An empirical analysis of compute-optimal large language model training
Jordan Hoffmann
Sebastian Borgeaud
Arthur Mensch
Elena Buchatskaya
Trevor Cai
Eliza Rutherford
Diego de las Casas
Lisa Anne Hendricks
Johannes Welbl
Aidan Clark
Tom Hennigan
Eric Noland
Katherine Millican
George van den Driessche
Bogdan Damoc
Aurelia Guy
Simon Osindero
Karen Simonyan
Erich Elsen
Oriol Vinyals
Jack William Rae
Laurent Sifre
NeurIPS
• 2022
Share
Twitter
LinkedIn
Copy Link