-
Notifications
You must be signed in to change notification settings - Fork 338
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
metrics: seperater metrics as internal and external for slo-controller and koordlet #1807
Conversation
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## main #1807 +/- ##
==========================================
+ Coverage 67.23% 67.26% +0.03%
==========================================
Files 407 410 +3
Lines 45644 45662 +18
==========================================
+ Hits 30687 30715 +28
+ Misses 12741 12731 -10
Partials 2216 2216
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
a403674
to
9b189d1
Compare
9b189d1
to
1f0e5ff
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Need to update the deploy templates under the /config
since the metrics paths are going to change.
/milestone v1.5 |
1f0e5ff
to
8535a51
Compare
the url path has not defined yet both in config and charts |
Is it really necessary to split it into two metrics urls? Users also need to configure two prometheus configurations for slo-controller, and this indicator must be distinguished when adding indicators in future code. Moreover, only two components, koord-manager and koordlet, are involved here. Do scheduler/descheduler also need to be followed? I don't think such a distinction should be made, and it is not a common or recommended practice. |
@eahydra Yes, it is necessary. any extended metric will be charged(ref https://help.aliyun.com/zh/ack/ack-managed-and-ack-dedicated/user-guide/dynamic-resource-overcommitment?spm=a2c4g.11186623.0.0.74d118ab3vDxaM#ef961016a2uds), and for end users they only need care about scheduler/descheduler should also split if they have the same senario in future. |
This seems to be solved through the reable_config mechanism of prometheus. Only need to configure the metrics regex and add action=keep. |
f75a5c9
to
fca8324
Compare
this is not supported for
prometheus.yaml is not configurable for end user by default in product like ACK |
It seems the external metrics are using the path |
updated |
b90ad33
to
3ad578b
Compare
Signed-off-by: 佑祎 <zzw261520@alibaba-inc.com>
3ad578b
to
f021958
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: FillZpp, saintube, zwzhang0107 The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
…r and koordlet (koordinator-sh#1807) Signed-off-by: 佑祎 <zzw261520@alibaba-inc.com>
…r and koordlet (#1807) Signed-off-by: 佑祎 <zzw261520@alibaba-inc.com>
Ⅰ. Describe what this PR does
seperate koordlet/slo-controller metrics as internal and external
ExternalHTTPPath: for end users, such as batch resource util (also compatible with old URL /metircs for koordlet)
InternalHTTPPath: for cluster admin, such as performance of components.
DefaultHTTPPath: merge all internal, external and controller runtime metrics.
Ⅱ. Does this pull request fix one issue?
Ⅲ. Describe how to verify it
Ⅳ. Special notes for reviews
V. Checklist
make test