You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Milestone: Integrate JAX in Kubeflow Training Operator
Description
This milestone tracks the progress of integrating JAX into the Kubeflow Training Operator to enable distributed training and fine-tuning jobs on Kubernetes. This involves leveraging the JAX jax.distributed.initialize API and utilizing Kubernetes JobSet API for managing job lifecycle.
Checklist
Review JAX documentation and distributed training requirements.
Review Kubeflow Training Operator and JobSet API documentation.
sandipanpanda
changed the title
Tracking Issue: Integrate JAX in Kubeflow Training Operator
[GSOC] Tracking Issue: Integrate JAX in Kubeflow Training Operator
Jul 10, 2024
Milestone: Integrate JAX in Kubeflow Training Operator
Description
This milestone tracks the progress of integrating JAX into the Kubeflow Training Operator to enable distributed training and fine-tuning jobs on Kubernetes. This involves leveraging the JAX
jax.distributed.initialize
API and utilizing Kubernetes JobSet API for managing job lifecycle.Checklist
JaxJob
).JaxJob
resources.JAXJob
jax.distributed.initialize
.JaxJob
resources.Milestone Due Date
TBD
Assignees
@sandipanpanda
/area gsoc
The text was updated successfully, but these errors were encountered: