... | ... | @@ -2,6 +2,10 @@ |
|
|
|
|
|
---
|
|
|
|
|
|
# Tutorials
|
|
|
|
|
|
Fahad Khalid (@khalid1): Created the [Getting started with ML/DL on Supercomputers](https://gitlab.version.fz-juelich.de/khalid1/ml_dl_on_supercomputers) tutorial, which has been tested on JURON and JURECA.
|
|
|
|
|
|
# Workflows
|
|
|
|
|
|
## Jupyter Notebooks for HPC
|
... | ... | @@ -18,9 +22,35 @@ tbd: Jens Henrik Goebbert @goebbert1 |
|
|
|
|
|
tbd: Jenia Jitsev @jitsev1
|
|
|
|
|
|
### Juron
|
|
|
### JURECA
|
|
|
|
|
|
Fahad Khalid (@khalid1): The following ML/DL related modules are now available:
|
|
|
|
|
|
```
|
|
|
1. Tensorflow 1.12.0 (Python 2 and Python 3)
|
|
|
2. Keras 2.2.4 (Python 2 and Python 3)
|
|
|
3. PyTorch 1.0.0 (Python 2 and Python 3)
|
|
|
4. Horovod 0.15.2 (Python 2 and Python 3)
|
|
|
5. Caffe 1.0 (Python 2)
|
|
|
```
|
|
|
|
|
|
Many thanks to Damian Alvarez (@alvarezmallon1) and Rajalekshmi Deepu (@deepu1) for installing these modules and the many dependencies.
|
|
|
|
|
|
**Note:** There is currently an issue when running multi-node training jobs with Horovod+PyTorch.
|
|
|
|
|
|
### JURON
|
|
|
|
|
|
Fahad Khalid (@khalid1): The following ML/DL related modules are now available on JURON:
|
|
|
|
|
|
```
|
|
|
1. Tensorflow 1.12.0 (Python 2 and Python 3)
|
|
|
2. Keras 2.2.4 (Python 2 and Python 3)
|
|
|
3. PyTorch 1.0.1 (Python 3)
|
|
|
4. Horovod 0.15.2 (Python 2 and Python 3)
|
|
|
5. Caffe 1.0 (Python 2 and Python 3)
|
|
|
```
|
|
|
|
|
|
tbd: Fahad Khalid @khalid1
|
|
|
All thanks to Andreas Herten (@herten1) for installing these modules and the many dependencies.
|
|
|
|
|
|
### Pytorch & HEAT
|
|
|
|
... | ... | |