Jethro's Braindump
Fast Neural Network Training
Links to this note
Gpipe
LARS Optimizer