... | ... | @@ -4,11 +4,11 @@ |
|
|
|
|
|
# Tutorials
|
|
|
|
|
|
Fahad Khalid (@khalid1): Created the [Getting started with ML/DL on Supercomputers](https://gitlab.version.fz-juelich.de/hpc4ns/dl_on_supercomputers#getting-started-with-deep-learning-on-supercomputers) tutorial, which has been tested on JUWELS, JURECA, and JURON.
|
|
|
[Getting started with ML/DL on Supercomputers](https://gitlab.version.fz-juelich.de/hpc4ns/dl_on_supercomputers#getting-started-with-deep-learning-on-supercomputers) tutorial, created by Fahad Khalid (@khalid1), which has been tested on JUWELS, JURECA, and JURON (stand: 2019).
|
|
|
|
|
|
A workshop ["Intro to Scalable Deep Learning"](https://gitlab.version.fz-juelich.de/MLDL_FZJ/juhaicu/jsc_public/sharedspace/teaching/intro_scalable_dl_2021/course-material) created by Mehdi Cherti, Jan Ebert, Alex Strube, Roshni Kamath, Stefan Kesselheim and Jenia Jitsev features a number of lectures and tutorials on distributed deep learning, including Horovod usage, on supercomputers.
|
|
|
A workshop ["Intro to Scalable Deep Learning"](https://gitlab.version.fz-juelich.de/MLDL_FZJ/juhaicu/jsc_public/sharedspace/teaching/intro_scalable_dl_2021/course-material) created by Mehdi Cherti, Jan Ebert, Alex Strube, Roshni Kamath, Stefan Kesselheim and Jenia Jitsev features a number of lectures and tutorials on distributed deep learning, including Horovod usage, on supercomputers. (stand: 2021)
|
|
|
|
|
|
A short, concise tutorial on converting single GPU training code for distributed execution on multi-node supercomputers by Mehdi Cherti and Jenia Jitsev: [Horovod data parallel training tutorial](https://gitlab.version.fz-juelich.de/MLDL_FZJ/juhaicu/jsc_public/sharedspace/code/distributed_dl/-/tree/master/horovod_tutorial)
|
|
|
A short, concise tutorial on converting single GPU training code for distributed execution on multi-node supercomputers by Mehdi Cherti and Jenia Jitsev: [Horovod data parallel training tutorial](https://gitlab.version.fz-juelich.de/MLDL_FZJ/juhaicu/jsc_public/sharedspace/code/distributed_dl/-/tree/master/horovod_tutorial) (stand: 2020)
|
|
|
|
|
|
# Workflows
|
|
|
|
... | ... | |