-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NETOBSERV-466: Add metric to count packet number in FLP #179
NETOBSERV-466: Add metric to count packet number in FLP #179
Conversation
@OlivierCazade Also, any reason we're not adding metric for "FlowDirection: 1" ? cc @jotak - I believe this metric would also scale similarly as sent/received_bytes_total |
@memodi yes ... I'd suggest some improvements in our handling of these metrics. I will create a new JIRA for that. Basically what I'd like to do is to have some default config to not expose all these metrics. The mechanism to ignore metrics already exist, it works with the Basically I think we could only expose
Also I'd like to introduce new similar metrics that have only namespace labels, not workloads, also disabled by default. So that can be used as a mitigation solution if users see too high cardinality with metrics, putting too much pressure on prometheus & FLP, without entirely turning off telemetry: they could turn off per-workload metrics and turn on per-namespace metrics instead. As said, I'll create a new JIRA |
+1 I think we should provide that as well (it can catch some traffic not caught on ingress), however, as said above, we may want to disable it through config by default. |
Thanks for feedback!
The reason for not catching
This is true, this mean this metrics are reporting wrong value, should we remove both until we find a way to clean duplicate? May be we have to wait for connection tracking Would you be fine with splitting these metrics into four ? |
I modified current metrics tag and default ignored metrics. This PR also address part of this task : The only remaining part is about having "light" metrics with less cardinality. @memodi , the last part is renaiming the metrics. Is it fine or would it create a lot of work on your side? |
if it's absolutely desired, go ahead renaming them. thanks! |
- egress | ||
- bytes | ||
- workloads | ||
- namespaces |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wouldn't set "namespaces" here, else if we later add namespace-only metrics later, we wouldn't be able to filter them out without impacting this one as well (or we'd need to find a trick to do it)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
well, depends how we do the namespace-only metrics ... you plan to add label-removal feature like we discussed previously, right? If so, then it's ok for me
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I plan to do the label-removal, but I am not sure that this is the priority.
I removed the namespaces label and I will add it again once the label removal task is done.
- egress | ||
- packets | ||
- workloads | ||
- namespaces |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
same here and same for the other workload metrics
valuekey: Packets | ||
filter: | ||
key: FlowDirection | ||
value: "0" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should be "1" for egress
/ok-to-test |
New image: ["quay.io/netobserv/network-observability-operator:f831013"]. It will expire after two weeks. |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: OlivierCazade The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
This add a metric to count packet number.
Story was to add only total packet count but I aligned this with the bytes count, and we can still sum different tag to get the total number of packets.