Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide telemetry about allocated / available resources #2191

Closed
stew opened this issue Jan 12, 2017 · 11 comments
Closed

Provide telemetry about allocated / available resources #2191

stew opened this issue Jan 12, 2017 · 11 comments
Labels
theme/api HTTP API and SDK issues type/enhancement

Comments

@stew
Copy link

stew commented Jan 12, 2017

Nomad version

0.5.2

I'm wanting to collect telemetry about how much of my cluster resources have already been allocated, and I don't see any current telemetry which allows me to figure out stuff like "how much memory does my cluster have unallocated" "how much cpu do I have unallocated" "what is the most amount of unallocated memory on any client currently", which is telemetry I feel like I need in order to know when I should be scaling my cluster up.

@OferE
Copy link

OferE commented Jan 13, 2017

If i understand correctly, you can calculate it for CPU, Memory and disk space easily by using the following two:
https://www.nomadproject.io/docs/http/client-stats.html
https://www.nomadproject.io/docs/http/client-allocation-stats.html

@diptanu diptanu added theme/api HTTP API and SDK issues type/enhancement labels Jan 13, 2017
@dadgar
Copy link
Contributor

dadgar commented Jan 13, 2017

@stew Are you asking for each client to report its allocated and available resources? You could then aggregate for a cluster wide view

@stew
Copy link
Author

stew commented Jan 13, 2017

@dadgar that would be ideal, yeah. Does it make sense?

@stew
Copy link
Author

stew commented Jan 13, 2017

@OferE yeah, and you see exactly the stats I'm looking for when you look at the output of nomad node-status $nodeid. I just want them to go out through the statsd telemetry which is already wired into our monitoring systems.

@dadgar
Copy link
Contributor

dadgar commented Jan 14, 2017

@stew Yep I was just clarifying to make sure we add the right thing

@dadgar dadgar added this to the v0.5.4 milestone Jan 14, 2017
@dadgar
Copy link
Contributor

dadgar commented Jan 14, 2017

@stew Do these stats not work for you: https://github.com/hashicorp/nomad/blob/master/client/client.go#L2147. They don't take into consideration reserved however

@multani
Copy link
Contributor

multani commented Jan 16, 2017

Adding reserved stats would be cool yes!

@stew
Copy link
Author

stew commented Jan 16, 2017

@dadgar yeah, I do have multiple ways of getting at that stat that work, I guess the reserved stuff is what I'm really after. I want to build stats to help me figure out when I'm close to expecting allocations to get blocked due to a lack of resources.

@burdandrei
Copy link
Contributor

This is what I'm looking for to prevent cluster reservation saturation, maybe AutoScale cluster when less then threshold resources left

@burdandrei
Copy link
Contributor

here is more telemetry per task #2330

@github-actions
Copy link

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Dec 15, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
theme/api HTTP API and SDK issues type/enhancement
Projects
None yet
Development

No branches or pull requests

6 participants