Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[question] Topology dashboard overcounting allocations and reserved resources. #9800

Closed
caiohcl opened this issue Jan 13, 2021 · 5 comments · Fixed by #9886
Closed

[question] Topology dashboard overcounting allocations and reserved resources. #9800

caiohcl opened this issue Jan 13, 2021 · 5 comments · Fixed by #9886

Comments

@caiohcl
Copy link

caiohcl commented Jan 13, 2021

Hi,

I recently updated Nomad to v1.0.1, and something caught my attention. I checked the topology dashboard, and under "Cluster Details" it shows that 151% of memory is currently in use. Another thing I noticed is that the Alloc number is different than what is shown in the graph.

For example, here it shows that client 046 has 7 Allocs, but the graph shows only 2:

image

As far as I can tell, the Topology is also using "completed" allocations to get the Cluster Details. I wonder if I'm missing some configuration or if this is the correct behavior?

Thank you!

@idrennanvmware
Copy link
Contributor

Curious - do you happen to have any prestart tasks?

@caiohcl
Copy link
Author

caiohcl commented Jan 13, 2021

Hi @idrennanvmware. No, I'm not using any prestart task.

@manveru
Copy link
Contributor

manveru commented Jan 20, 2021

The same is happening to me across multiple clusters since the 1.0 release candidates. We also have a high number of dead allocations taking up memory and space on the nomad server nodes until we manually GC them and restart the nodes.

@DingoEatingFuzz
Copy link
Contributor

Hi @caiohcl, as @manveru is suggesting, this is due to terminal allocations. These allocations don't actually reserve resources in the mind of the scheduler which is how you're able to get to "151%" of reserved memory used.

This is definitely a bug and it's naturally where I didn't write tests 😂 😭 ☠️

// TODO: Once we settle on the contents of the info panel, the contents
// should also get acceptance tests.
module('Acceptance | topology', function(hooks) {

@DingoEatingFuzz DingoEatingFuzz changed the title [question] Topology dashboard [question] Topology dashboard overcounting allocations and reserved resources. Jan 25, 2021
@github-actions
Copy link

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Oct 24, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants