Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[UI] error loading jobs view in firefox #7698

Closed
dbachelder opened this issue Apr 13, 2020 · 14 comments
Closed

[UI] error loading jobs view in firefox #7698

dbachelder opened this issue Apr 13, 2020 · 14 comments

Comments

@dbachelder
Copy link

dbachelder commented Apr 13, 2020

Nomad version

Nomad v0.10.5 (4eb2ca3)

Operating system and Environment details

Firefox 75.0

Issue

Can't click through to individual job from jobs ui. (open in new tab works as expected)

This is the consul after a cold load of /ui/jobs

Jobs_-_Nomad

No additional errors or network activity on click of job.

Reproduction steps

Load jobs UI /ui/jobs
See errors in console
Click any job and see infinite spinner

@dbachelder
Copy link
Author

Similar in Chrome:

Jobs_-_Nomad

@DingoEatingFuzz
Copy link
Contributor

Hi @dbachelder, I've been working to reproduce this and I found a condition that does this, but it's a bit fringe, so it may not be your condition as well.

Do you happen to run parameterized or periodic jobs? Did you or someone else using Nomad happen to purge one of them but leave the children jobs around?

@dbachelder
Copy link
Author

@DingoEatingFuzz We don't have any parameterized or periodic jobs (that we are aware of).

@frederikbosch
Copy link

@DingoEatingFuzz For us it started after we stopped a periodic job, at least that is my observation. In the list of that stopped periodic job are many with the status dead. Maybe that helps.

@frederikbosch
Copy link

And once we relaunched that stopped periodic job, the error was gone again.

@dbachelder
Copy link
Author

@DingoEatingFuzz I stand corrected, we have a single periodic job.. but it is running normally as far as I can tell. Is there further debugging I can do on my end to help?

@DingoEatingFuzz
Copy link
Contributor

My reproduction is what @frederikbosch observes here and what @gmichalec-pandora observes in #7710, which is that when a periodic or parameterized job is purged/gc'd, it orphans all child jobs. The UI then tries to fetch the parent job for the orphaned jobs, gets a 404, and goes 💥

I'm working on a fix for this, but in the meantime, manually purging dead child jobs should fix this.

Based on the way that job IDs work, if you have one periodic job that changed names, that too could lead to older children of the periodic job becoming orphaned.

@frederikbosch
Copy link

manually purging dead child jobs should fix this.

You mean by running nomad system gc?

@DingoEatingFuzz
Copy link
Contributor

I'd start with that, and if that doesn't work you can manually delete jobs from existence with nomad stop -purge

@dbachelder
Copy link
Author

Is there an easy way to find the orphans?

@dbachelder
Copy link
Author

found them! it fixed our issue for now... in our prod cluster it seemed like I only had one job to clean up which was a child of a still valid periodic task.. there was one active child running, and one dead one from that last run (6 hours ago)

@frederikbosch
Copy link

Thanks for the help @DingoEatingFuzz !

@DingoEatingFuzz
Copy link
Contributor

This should be fixed in 0.12.1 (See the explanation here).

I'm going to close this issue to centralize the conversation about this bug in #5936.

@github-actions
Copy link

github-actions bot commented Nov 4, 2022

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Nov 4, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

4 participants