... | ... | @@ -8,6 +8,8 @@ Fahad Khalid (@khalid1): Created the [Getting started with ML/DL on Supercompute |
|
|
|
|
|
A workshop ["Intro to Scalable Deep Learning"](https://gitlab.version.fz-juelich.de/MLDL_FZJ/juhaicu/jsc_public/sharedspace/teaching/intro_scalable_dl_2021/course-material) created by Mehdi Cherti, Jan Ebert, Alex Strube, Roshni Kamath, Stefan Kesselheim and Jenia Jitsev features a number of lectures and tutorials on distributed deep learning, including Horovod usage, on supercomputers.
|
|
|
|
|
|
A short, concise tutorial on converting single GPU training code for distributed execution on multi-node supercomputers by Mehdi Cherti and Jenia Jitsev: [Horovod data parallel training tutorial] (https://gitlab.version.fz-juelich.de/MLDL_FZJ/juhaicu/jsc_public/sharedspace/code/distributed_dl/-/tree/master/horovod_tutorial)
|
|
|
|
|
|
# Workflows
|
|
|
|
|
|
## Jupyter Notebooks for HPC
|
... | ... | |