Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[k8s] how to get the job sutatus change event #3775

Open
shengkaixuan opened this issue Jul 23, 2024 · 5 comments
Open

[k8s] how to get the job sutatus change event #3775

shengkaixuan opened this issue Jul 23, 2024 · 5 comments

Comments

@shengkaixuan
Copy link

skypilot is a great project,run it in k8s, how can i be alerted when the job status changed?, like the status changed to FAILED.
`
Fetching and parsing job queue...

Job queue of cluster llm
ID NAME SUBMITTED STARTED DURATION RESOURCES STATUS LOG
3 - 1 hr ago 1 hr ago 1h 25m 37s 1x[QUADRO-RTX8000:1] RUNNING ~/sky_logs/sky-2024-07-23-16-08-46-454657
2 - 1 hr ago 1 hr ago 10m 21s 1x[QUADRO-RTX8000:1] SUCCEEDED ~/sky_logs/sky-2024-07-23-15-44-46-107347
1 - 1 hr ago 1 hr ago 4m 48s 1x[QUADRO-RTX8000:1] FAILED ~/sky_logs/sky-2024-07-23-15-34-45-318133
`

@concretevitamin
Copy link
Member

@shengkaixuan Just to understand the use case better, are you trying to programmatically alert the user via email, slack, a custom dashboard, or some other system?

@shengkaixuan
Copy link
Author

just want an api to send the event to a custom message system.
By the way, will skypilot offer a web ui in the future to show all the tasks?

@concretevitamin
Copy link
Member

Check out #2106 from @MaoZiming

@shengkaixuan
Copy link
Author

thank you, it would be better to put it in the doc.
where can i find all the predefined environment variables?
i do not think the doc cover all the predefined environment variables. https://skypilot.readthedocs.io/en/latest/running-jobs/environment-variables.html#skypilot-environment-variables

@cblmemo
Copy link
Collaborator

cblmemo commented Sep 4, 2024

thank you, it would be better to put it in the doc. where can i find all the predefined environment variables? i do not think the doc cover all the predefined environment variables. https://skypilot.readthedocs.io/en/latest/running-jobs/environment-variables.html#skypilot-environment-variables

Hi @shengkaixuan ! Could you elaborate on what environment variables you find missing?

@Michaelvll Michaelvll changed the title [k8s] how to get the job sutatus change event [k8s] how to get the job sutatus change event Dec 19, 2024
@Michaelvll Michaelvll added the OSS label Dec 19, 2024 — with Linear
@Michaelvll Michaelvll removed the OSS label Dec 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants