Prometheus interaction #7

vyzigold · 2023-07-26T09:16:00Z

This PR adds initial functionality to interact with prometheus.

There are 2 parts to the client. First adds a library to query prometheus from python, this can be used from a future aodh prometheus evaluator. The second part uses this to interact with prometheus from cli. Examples of both can be seen in the README. Kudos to @paramite for the prometheus_client.py

There are a few questions regarding the future development:

Which cli commands (if any) do we want to support?
Right now I added the list, show and query commands. I think it might be useful to replicate at least some of the basic openstack metric commands. Only the query command uses the PrometheusAPIClient right now. If we want to keep list and show, it'll need to be slightly modified. (list and show use the prometheus-api-client library, which we decided not to use because of packaging difficulties)
Where do we want to enforce the RBAC?
There is a class PrometheusRBAC prepared in prometheus_client.py. But I think it might be better to decouple openstack specific logic from prometheus_client.py. We could put the code for that into QueryManager.query, where it is right now (although it would need to be a bit more robust).
Where do we get the host and port for prometheus?
It seems to me like the other openstack plugins just ask keystone for this information, but keystone doesn't know anything about prometheus. It could be specified as a parameter by whoever is creating the client from the python library. In case of cli, we could add --host and --port args. Right now it's hardcoded to 127.0.0.1:9090

observabilityclient/v1/client.py

setup.cfg

observabilityclient/utils/metric_utils.py

paramite · 2023-07-26T13:11:49Z

This is definitely great start. We can discuss the RBAC, but having API client free of OpenStack specific logic sounds fine to me.

README.md

observabilityclient/plugin.py

observabilityclient/v1/client.py

observabilityclient/v1/cli.py

vyzigold · 2023-07-27T08:54:57Z

As was discussed on our meeting, I will:

Rename the command to be "openstack metric" to provide continuity with gnocchi client.
Add cli commands, for start I'll add create, delete, list, show and I'll leave the query command.
Implement the RBAC injection (I have a new idea how to do it). I'll keep it outside of PrometheusAPIClient to keep PrometheusAPIClient free of openstack specifics.
Get the prometheus url configuration from /etc/openstack or from env variables.

This commit adds: - commands: delete clear-tombstones snapshot - Better rbac injection as well as a possibility to disable rbac. - Configuration of prometheus_client through env variables and /etc/openstack/prometheus.yaml It also does some further cleanup. It makes the list and show commands use our prometheus client in prometheus_client.py

vyzigold · 2023-07-31T14:25:58Z

I incorporated the changes we talked about. The changes include mainly:

- commands:
    delete
    clear-tombstones
    snapshot
- Better rbac injection as well as a possibility
  to disable rbac.
- Configuration of prometheus_client through
  env variables and /etc/openstack/prometheus.yaml
- Renaming of the command to openstack metric *

It also does some further cleanup. It makes the list and
show commands use our prometheus client in prometheus_client.py

For the RBAC I'm including a label {project_id='some_id'} after each metric name in each query. The PromQL is quite specific about where labels can be placed, so before injecting the RBAC into the user's query, I'm querying prometheus to find out all of the metric names, so that I can find them inside the query. If we don't want to do this additional query we would probably need to restrict what kind of user queries we support or implement a proper PromQL parser (one can be seen here).

During the development, I encountered a few things, on which I'd like to know your opinion.

Can I use the pyyaml library used in metric_utils.py?
In metric_utils.py, I hardcoded the path to a file, which should contain information about how to connect to prometheus to "/etc/openstack/prometheus.yaml". Are we ok with it staying hardcoded like that?
I added the option to disable rbac to all of the commands/functions. Should everybody be able to use that option freely or should we restrict it somehow?
What do we do about the "query" and "show" commands? Originally I meant the query command to accept any PromQL query and display the result. This was supposed to be the command, which would be later used for autoscalling. The show command was supposed to do something similar to gnocchi plugin's show command - display a value of some metric. So the user is meant to write a metric name as an argument and it'll display all current values of the metric. The thing is, that right now both of the commands do really similar things. "query" is basically just an extension of "show". If you take any "show" command and replace "show" by "query", you'll get exactly the same result. I'd say we could just delete the current implementation of "show" command and replace it by "query".
Do we want to somehow restrict access to the admin endpoints? I know anybody can just curl prometheus directly, but maybe we might not want to make it so easy for anybody to just delete metrics.
Inspired by the prometheus-api-client library I added an ability to specify additional labels separately to a query. The command looks something like this: openstack metric query somequery --label="job='prometheus'". Does this seem useful or just confusing?

vyzigold · 2023-08-01T05:20:32Z

I also wanted to add a "create" command similarly to the gnocchi client, but there isn't an endpoint in prometheus for creating metrics.

paramite

Few points for discussion, but otherwise this is legit.

observabilityclient/v1/python_api.py

README.md

observabilityclient/prometheus_client.py

paramite · 2023-08-01T11:27:14Z

observabilityclient/v1/rbac.py

+
+    def _enrich_labels(self, labels, disable_rbac):
+        if not self.rbac_init_successful and not disable_rbac:
+            raise ObservabilityRbacError("Unauthorized. Couldn't "


Wouldn't it be better to fail in constructor rather than later in the process when enriching is used?

This is what I initially implemented, but then I figured, that there is a possibility to disable rbac with each function/command. So even if the constructor can't find the project_id, you should still be able to do openstack metric list --disable-rbac. That's why I moved the exception from the constructor to here.

Now, that I know, that even the openstack command is able to log, I'll at least log a warning in the constructor.

paramite · 2023-08-01T11:30:17Z

observabilityclient/v1/rbac.py

+        return labels
+
+    # TODO aren't the additional labels just making
+    #      the code confusing? Are they useful?


My 2cents: Maybe and No. User can construct whatever query with various labels in it, so I personally don't see a point of having additional parameters for additional parameters.

I agree, I removed the code

paramite · 2023-08-01T12:18:57Z

Can I use the pyyaml library used in metric_utils.py?

It seems to me that this library is not available downstream (at least it's not obvious from list of Brew builds). Checking on OSP deployment @lnatapov has on seal31 it confirms that we don't have that package (no python-pyyaml nor python3-pyyaml nor PyYAML). So if you can't achieve same with standard YAML lib, we would need to package and ship it ourselves.

In metric_utils.py, I hardcoded the path to a file, which should contain information about how to connect to prometheus to "/etc/openstack/prometheus.yaml". Are we ok with it staying hardcoded like that?

Hardcoding paths in case of configuration is fine, but I would add more option and ideally consistent options with the rest of the client: https://github.com/openstack/python-openstackclient/blob/7ea78b6ef65481c8e97bac959b4f11e3ecae8a3e/doc/source/cli/man/openstack.rst#config-files

I added the option to disable rbac to all of the commands/functions. Should everybody be able to use that option freely or should we restrict it somehow?

That is a good question. If we are sure enough that the query enriching mechanism allows most of the queries (which to me it seems so), I would say that only admin users should be able to fetch metrics of any project.

What do we do about the "query" and "show" commands? Originally I meant the query command to accept any PromQL query and display the result. This was supposed to be the command, which would be later used for autoscalling. The show command was supposed to do something similar to gnocchi plugin's show command - display a value of some metric. So the user is meant to write a metric name as an argument and it'll display all current values of the metric. The thing is, that right now both of the commands do really similar things. "query" is basically just an extension of "show". If you take any "show" command and replace "show" by "query", you'll get exactly the same result. I'd say we could just delete the current implementation of "show" command and replace it by "query".

If we want to have an outupt of show command consistent with the past, then I would keep it as it is now (eg. accepting just metric name). Having a query command to display something like max_over_time(ceilometer_image_size{resource='abcd'}[15m]) for example shoud be a purpose of query command and indeed will be used for autoscaling.

Do we want to somehow restrict access to the admin endpoints? I know anybody can just curl prometheus directly, but maybe we might not want to make it so easy for anybody to just delete metrics.

By admin users maybe, yes.

Inspired by the prometheus-api-client library I added an ability to specify additional labels separately to a query. The command looks something like this: openstack metric query somequery --label="job='prometheus'". Does this seem useful or just confusing?

As I wrote in comments, this seems unnecessary to me.

vyzigold · 2023-08-01T13:31:33Z

I don't think there is a standard yaml lib in python (if there is, please correct me). But I don't think we need to package it just because of this. If the config file doesn't get much more complex, it should be pretty easy to just parse it by ourselves. Using regexes, it's probably just a few lines of code.

paramite · 2023-08-02T11:36:14Z

I don't think there is a standard yaml lib in python (if there is, please correct me).

YAML library is being distributed in two forms. One is a C-binding to libyaml which usually is part of Python distribution and then you can have pure Python implementation which we would have to package. Not that I would know that fact yesterday, but I never had to install additional package to be able to import yaml, and that is what I meant by 'standard' library. I thought that PyYAML you were mentioning yesterday can do something extra. So yeah, you can use PyYAML ;).

vyzigold · 2023-08-03T07:39:42Z

I've implemented Martin's comments.

I added proper logging to PrometheusAPIClient
I removed all the pieces of code related to the "additional labels"
I modified the show query as suggested by Martin
I added a possibility for multiple locations of the prometheus.yaml

I also discovered, that label values can be any unicode character, which would break the current rbac implementation if "}" is a part of a label value. Fix for that will follow shortly.

vyzigold · 2023-08-03T10:09:08Z

I added a support for unicode label values, which wouldn't work before. The rbac is getting quite complex. What I tried seemed to work. I'll come up with some extensive unit tests to make sure it really does what it should.

I'm leaving for 2 weeks vacation. I feel like I implemented most of what Martin noted, and there aren't any other reviews here right now. My suggestion is to merge this (unless somebody sees something awful here), so that Martin can start using it in his aodh evaluator. Please leave notes/reviews here, or somewhere else. I'm planning to take a look at them when I'm back and open another PR with them.

jlarriba

My review was centered around using a better command than "observabilityclient" from the cli. As we agreed on the meeting, re-use "metrics" is perfectly fit.

vyzigold added 2 commits July 24, 2023 05:03

Remove old observability client

c41a7d5

Add initial functionality for prometheus querying

b9dad40

vyzigold requested a review from paramite July 26, 2023 09:16

paramite reviewed Jul 26, 2023

View reviewed changes

observabilityclient/v1/client.py Outdated Show resolved Hide resolved

setup.cfg Outdated Show resolved Hide resolved

observabilityclient/utils/metric_utils.py Outdated Show resolved Hide resolved

Fix a copy-paste error in get_client()

50cb762

paramite requested review from jlarriba and yadneshk July 26, 2023 16:56

jlarriba suggested changes Jul 27, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

README.md Show resolved Hide resolved

observabilityclient/plugin.py Show resolved Hide resolved

observabilityclient/v1/client.py Outdated Show resolved Hide resolved

yadneshk reviewed Jul 27, 2023

View reviewed changes

observabilityclient/v1/cli.py Outdated Show resolved Hide resolved

vyzigold marked this pull request as draft July 27, 2023 08:59

vyzigold added 2 commits July 31, 2023 09:21

Make README up to date

3bf52bf

vyzigold requested review from paramite, yadneshk and jlarriba July 31, 2023 14:26

vyzigold marked this pull request as ready for review July 31, 2023 14:49

paramite reviewed Aug 1, 2023

View reviewed changes

Implement Martin's PR comments

7cd6a0c

vyzigold added 2 commits August 3, 2023 05:43

Implement better support for label values in rbac

6495189

PEP8

8e453c3

jlarriba approved these changes Aug 3, 2023

View reviewed changes

paramite approved these changes Aug 3, 2023

View reviewed changes

vyzigold merged commit a580772 into master Aug 3, 2023

vyzigold deleted the prometheus_interaction branch August 3, 2023 13:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prometheus interaction #7

Prometheus interaction #7

vyzigold commented Jul 26, 2023

paramite commented Jul 26, 2023

vyzigold commented Jul 27, 2023

vyzigold commented Jul 31, 2023

vyzigold commented Aug 1, 2023

paramite left a comment

paramite Aug 1, 2023

vyzigold Aug 1, 2023

vyzigold Aug 3, 2023

paramite Aug 1, 2023

vyzigold Aug 3, 2023

paramite commented Aug 1, 2023 •

edited

Loading

vyzigold commented Aug 1, 2023

paramite commented Aug 2, 2023

vyzigold commented Aug 3, 2023

vyzigold commented Aug 3, 2023

jlarriba left a comment

Prometheus interaction #7

Prometheus interaction #7

Conversation

vyzigold commented Jul 26, 2023

paramite commented Jul 26, 2023

vyzigold commented Jul 27, 2023

vyzigold commented Jul 31, 2023

vyzigold commented Aug 1, 2023

paramite left a comment

Choose a reason for hiding this comment

paramite Aug 1, 2023

Choose a reason for hiding this comment

vyzigold Aug 1, 2023

Choose a reason for hiding this comment

vyzigold Aug 3, 2023

Choose a reason for hiding this comment

paramite Aug 1, 2023

Choose a reason for hiding this comment

vyzigold Aug 3, 2023

Choose a reason for hiding this comment

paramite commented Aug 1, 2023 • edited Loading

vyzigold commented Aug 1, 2023

paramite commented Aug 2, 2023

vyzigold commented Aug 3, 2023

vyzigold commented Aug 3, 2023

jlarriba left a comment

Choose a reason for hiding this comment

paramite commented Aug 1, 2023 •

edited

Loading