Multi-GPU Training with Accelerate
ML & AI Pathway - Multi-GPU Training with Accelerate
Date: April 22, 2025
Time: 1-3 PM ET / 12-2 PM CT / 10 AM - 12 PM PT
Description: Accelerate, HuggingFace's library for efficient multi-GPU training enables practitioners to scale their AI/ML applications effectively. This course will explore advanced training techniques for deep learning models, specifically targeting the ResNet50 architecture. Participants will gain a comprehensive understanding of the architecture and the significance of residual connections in deep neural networks. Additionally, the course will cover scheduling strategies to optimize learning rates during training, Weights & Biases (W&B) use for tracking metrics and experiments, and methods for saving intermittent checkpoints and resuming training.