Description: Learn how to distribute PyTorch machine learning tasks across multiple GPUs on multiple nodes to significantly increase data throughput. This seminar will cover everything needed to convert a standard PyTorch model into a fully GPU-parallel model, and to execute these scripts on HPCMP machines. Additional optimizations for decreasing redundant setup, increasing device utilization, and managing memory usage will also be discussed.
| Presenter(s): Dr. Calvin Anderson, GDIT/PET Location: Webcast Date & Time: November 3, 2022, 2:00p - 3:00p ET |
Controlled by: DoD HPCMP Controlled by: PET Program CUI Category: OPSEC Limited Dissemination Control: FEDCON POC: Mr. Ronald Hedgepeth, pet@hpc.mil |
CUI





