You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I submitted a Katib job. Some of the trials ended up launching trials for which the corresponding training job reached the backoff limit and thus will never succeed.
Yet the trial remains in the running state.
Here's the job spec. I elided some details but left the status to show the job is in the failed state.
/kind bug
I submitted a Katib job. Some of the trials ended up launching trials for which the corresponding training job reached the backoff limit and thus will never succeed.
Yet the trial remains in the running state.
Here's the job spec. I elided some details but left the status to show the job is in the failed state.
Here's the corresponding trial
So the status is stuck in running state but it should be marked as failed since the job will never succeed.
The text was updated successfully, but these errors were encountered: