Support queue-related logic with kube-queue #1519

zw0610 · 2022-01-06T09:39:06Z

For a deep learning cluster, it is common case that all kinds of tasks (like TFJob, MPIJob, Deployment, Statefulset, etc.) submitted by users are waiting for resource to be allocated. Unfortunately, Pod is the minimal scheduling unit, which brings hurdle to mange tasks the way other clusters like Slurm do.

To make up such a feature missing, @denkensk and I work together with other contributors to present a new queue system for tasks on Kubernetes cluster called kube-queue. Unlike the queue in volcano, kube-queue does not hijack the creation/submission of tasks. Instead, kube-queue relies operators of each task API (like TFJob, MPIJob) to wait until a clear ready-to-go message confirmed by kube-queue and delivered to the task itself via annotation of the CR.

We'd like to integrate kube-queue with training-operator, which requires minimal changes to the Reconcile method:

import (
    ...
    queuev1alpha1 "github.com/kube-queue/pkg/apis/scheduling/v1alpha1"
    ....
)

func (r *XXJobReconciler) Reconcile(ctx context.Context, req ctrl.Request) (ctrl.Result, error) {
    ...
    if queuev1alpha1.JobSuspended(job) {
        logger.Info("job suspended by kube-queue")
        return ctrl.Result{RequeueAfter: 10*time.Second}, nil
    }
    ...
}

Certainly, such logic can be turn on and off via the launch argument of training-operator.

The proposal of kube-queue has been submitted to Kubernetes wg-batch, pending further discussion and the implementation is now managing thousands of tasks within Alibaba and Baidu.

The text was updated successfully, but these errors were encountered:

terrytangyuan · 2022-01-06T14:42:06Z

kube-queue automates and optimizes workload and resource quota management to maximize cluster resource utilization.

Are there any experimental results to support this?

denkensk · 2022-01-11T12:23:59Z

Are there any experimental results to support this?

Sorry for late reply. The improvement in cluster resource utilization is related to the different cluster sizes, tenant divisions and the types of workloads. There is a 5%~30% improvement according to actual statistics after using Kube-queue and a reasonable quota management system. @terrytangyuan

alculquicondor · 2022-03-01T17:53:59Z

@denkensk do you mind if we repurpose this issue for kueue? :)

ref kubernetes-sigs/kueue#65

alculquicondor · 2023-01-03T14:17:37Z

cc @tenzen-y, as I see you involved in both kubeflow and kueue :)

tenzen-y · 2023-01-05T03:42:09Z

@alculquicondor Thanks for doing cc.
Yes. I am aiming to support Kueue on training-operator and mpi-operator, finally.
So we need to work on kubeflow/common#196.

KunWuLuan · 2023-03-30T03:28:43Z

Hi! Do u have any other progress?
I think suspend semantics for other workload types is also needed for both kube-queue and kueue.

tenzen-y · 2023-03-30T05:28:33Z

@KunWuLuan We don't have any progress. Before we move suspend feature forward, we need to work on #1714.

tenzen-y · 2023-05-22T07:02:22Z

I'll work on this issue after #1809 is completed.

/assign

tenzen-y · 2023-07-05T19:03:09Z

I started this implementation right now.

tenzen-y · 2023-08-07T14:05:40Z

Completed: #1859
/close

google-oss-prow · 2023-08-07T14:05:46Z

@tenzen-y: Closing this issue.

In response to this:

Completed: #1859
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

alculquicondor mentioned this issue Aug 24, 2022

How can I use a different scheduler from volcano? kubeflow/mpi-operator#474

Closed

alculquicondor mentioned this issue Jan 3, 2023

custom workload kubernetes-sigs/kueue#499

Closed

alculquicondor mentioned this issue Jan 24, 2023

Add suspend semantics kubeflow/mpi-operator#504

Closed

tenzen-y mentioned this issue May 22, 2023

[Release] Training operator 1.7.0 release #1809

Closed

8 tasks

google-oss-prow bot assigned tenzen-y May 22, 2023

This was referenced Jul 9, 2023

Support for suspend semantics #1853

Closed

Implement suspend semantics #1859

Merged

google-oss-prow bot closed this as completed Aug 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support queue-related logic with kube-queue #1519

Support queue-related logic with kube-queue #1519

zw0610 commented Jan 6, 2022

terrytangyuan commented Jan 6, 2022

denkensk commented Jan 11, 2022

alculquicondor commented Mar 1, 2022

alculquicondor commented Jan 3, 2023

tenzen-y commented Jan 5, 2023

KunWuLuan commented Mar 30, 2023

tenzen-y commented Mar 30, 2023

tenzen-y commented May 22, 2023

tenzen-y commented Jul 5, 2023

tenzen-y commented Aug 7, 2023

google-oss-prow bot commented Aug 7, 2023

Support queue-related logic with kube-queue #1519

Support queue-related logic with kube-queue #1519

Comments

zw0610 commented Jan 6, 2022

terrytangyuan commented Jan 6, 2022

denkensk commented Jan 11, 2022

alculquicondor commented Mar 1, 2022

alculquicondor commented Jan 3, 2023

tenzen-y commented Jan 5, 2023

KunWuLuan commented Mar 30, 2023

tenzen-y commented Mar 30, 2023

tenzen-y commented May 22, 2023

tenzen-y commented Jul 5, 2023

tenzen-y commented Aug 7, 2023

google-oss-prow bot commented Aug 7, 2023