You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My guess is: every time moco is instantiated, it creates a distributed process group. Before that PR, we were instantiating it twice (one in the main process, and another in the process that was actually executing it).
🐛 Bug
python xla/benchmarks/experiment_runner.py \ --suite-name torchbench --accelerator cuda --repeat 2 \ --test eval --test train --xla None --dynamo inductor \ -k moco
Environment
@miladm @JackCaoG
The text was updated successfully, but these errors were encountered: