We also had some issues when using Tune with multiple GPUs on notebooks. I think they dont allow for "interactive environments". We fixed some bugs when training on multiple GPUs, and it should work now using scripts
f
Farzad E
02/23/2023, 4:17 PM
@Cristian (Nixtla) script didn't work for me either. I converted the same notebook to .py and while it didn't give me an error, it got stuck indefinitely. I tried with multiple different EC2 instances of 2 or 4 GPUs but the result was the same. I wait to see if the Ray's team has any ideas.