Scaling

Training Models at Scale

I gave a tutorial on distributed training strategies for large-scale models.