py: add basic service URL resolver #421

isinyaaa · 2024-09-23T19:48:32Z

Description

Adds a simple constructor that resolves cluster service URLs.

How Has This Been Tested?

Tested with the latest UI using a MR deployed directly through the settings panel.

The client then connects:

from model_registry import ModelRegistry

mr = ModelRegistry.from_service("modelregistry-sample", "isinyaaa")

and creates a new MV:

mr.register_model("test", "s3://catopia/meow.onnx", model_format_name="oooo", model_format_version="v1", version="123")

❗ IMPORTANT: to make this work with ODH deployments you need to create a clusterwide rolebinding for the current DSP service accounts, with the odh-dashboard role.

Change was reflected on the UI as expected:

Merge criteria:

All the commits have been signed-off (To pass the DCO check)

The commits have meaningful messages; the author will squash them after approval or in case of manual merges will ask to merge with squash.
Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
The developer has manually tested the changes and verified that the changes work.
Code changes follow the kubeflow contribution guidelines.

If you have UI changes

The developer has added tests or explained why testing cannot be added.
Included any necessary screenshots or gifs if it was a UI change.
Verify that UI/UX changes conform the UX guidelines for Kubeflow.

google-oss-prow · 2024-09-23T19:48:38Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign tomcli for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

isinyaaa · 2024-09-23T20:26:53Z

cc: @dhirajsb @tarilabs @Al-Pragliola

tarilabs

thank you @isinyaaa , some early 👀 for your considerations

clients/python/src/model_registry/_client.py

tarilabs · 2024-09-24T06:44:30Z

@Al-Pragliola any thoughts on potential amendments which would not require a rolebinding or anyway any changes to the permission settings?

Al-Pragliola · 2024-09-24T14:13:19Z

@Al-Pragliola any thoughts on potential amendments which would not require a rolebinding or anyway any changes to the permission settings?

mm no, I can't think of another way to let the notebook pod interact with the k8s api-server, but I think the odh-dashboard role is a bit too permissive, we should have an ad-hoc role with only the required permissions

tarilabs · 2024-09-24T15:23:01Z

Thanks @Al-Pragliola , adding to #421 (comment), what about having some "fallbacks" in case that permission is not there? Do we believe we might have some options?

For example, attemping in case a list K8s resources in a set of known to be used namespaces, may require simpler/lower permissions? for example, listing MR CR instances in a set of known namespaces in case this require lower permission, or listing Services again in known namespaces, etc?

isinyaaa · 2024-09-24T16:51:17Z

@tarilabs if we can't get DSC it falls back to looking for the instance in the specified namespace, or the default (kubeflow for upstream, should be changed to odh-model-registries when merging midstream)

tarilabs · 2024-09-24T17:18:51Z

should be changed to odh-model-registries when merging midstream

I don't think that is viable, since there will be only 1 "upstream" pypi of the MR python client?

Hence why I believe would have been preferable to adopt a fallback(s) approach

clients/python/src/model_registry/_utils.py

rareddy · 2024-09-26T13:34:24Z

IMO the Python library should not use any OpenShift calls directly. That brings in additional dependency on where the MR deployed into a client library. I suggest we feed this information through some kind of config or env to keep it simple, then think through how this config can be supplied. I am thinking like .kubeconfig wdyt?

dhirajsb · 2024-09-30T19:28:29Z

Yes, the DSC namespace discovery is an odh/rhoai feature, since there is no equivalent place to discover registries namespace in kubeflow. Although, the default MR manifests kind of make kubeflow namespace the default registry namespace.

We should keep things simple and allow this capability to discover the presence of DSC CR in the client library. The Python client can simply ignore that service discovery path if DSC resource type is not present and try to lookup registries using the hostname provided by users.

If we want to allow Python API to be service location agnostic, another way to do that would be by supporting an env variable for a default MR namespace, and also support specifying a direct service host. That way it can be also be injected into the notebook from a user supplied configmap (which could be cluster/env specific) if needed.

dhirajsb · 2024-09-30T19:30:05Z

BTW, the Python client can do this without binding to OpenShift specific APIs using the K8s unstructured generic API.

dhirajsb · 2024-09-30T19:33:17Z

The odh-dashboard role referenced by @isinyaaa is a workaround, not a requirement once Notebooks grants the right permissions midstream. That's being worked on by the odh notebooks team.

rareddy · 2024-10-01T03:25:03Z

Apologies, when I mentioned OpenShift in general I was referring to Kube. saying in general Python client should not make any assumptions on where the MR is hosted IMO. Thinking more in terms of what you said above

If we want to allow Python API to be service location agnostic, another way to do that would be by supporting an env variable for a default MR namespace, and also support specifying a direct service host. That way it can be also be injected into the notebook from a user supplied configmap (which could be cluster/env specific) if needed.

then make some other automation happen to provide the config/env based on environment.

rareddy · 2024-10-01T18:47:23Z

similar pattern by MLFlow https://mlflow.org/docs/latest/model-registry.html#id10

tarilabs · 2024-10-04T12:40:19Z

similar pattern by MLFlow https://mlflow.org/docs/latest/model-registry.html#id10

I believe this is the link you wanted: https://mlflow.org/docs/latest/model-registry.html#databricks-unity-catalog-model-registry (the original link is an entry in a TOC)

Sounds nice and following up from #447 (comment) , what about using something analogous to Viper on the Go side, like https://pypi.org/project/12factor-configclasses/ that would support said injection?

This way:

for upstream KF, we can provide them in the Manifest for example
for midstream, folks can rely on their own Operators making the env or CM

Signed-off-by: Isabella do Amaral <idoamara@redhat.com>

google-oss-prow bot added the do-not-merge/work-in-progress label Sep 23, 2024

google-oss-prow bot requested review from andreyvelich, ckadner and rareddy September 23, 2024 19:48

github-actions bot added the Area/MR Python client label Sep 23, 2024

google-oss-prow bot added the size/L label Sep 23, 2024

isinyaaa force-pushed the push-uzukprvowwly branch 2 times, most recently from 39ed152 to 6cbc548 Compare September 23, 2024 20:25

isinyaaa marked this pull request as ready for review September 23, 2024 20:35

google-oss-prow bot removed the do-not-merge/work-in-progress label Sep 23, 2024

google-oss-prow bot requested a review from Tomcli September 23, 2024 20:35

tarilabs reviewed Sep 23, 2024

View reviewed changes

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

isinyaaa force-pushed the push-uzukprvowwly branch from 6cbc548 to 7139a22 Compare September 24, 2024 12:11

isinyaaa force-pushed the push-uzukprvowwly branch 4 times, most recently from 7f65be5 to f049130 Compare September 25, 2024 15:04

isinyaaa commented Sep 25, 2024

View reviewed changes

clients/python/src/model_registry/_utils.py Outdated Show resolved Hide resolved

isinyaaa force-pushed the push-uzukprvowwly branch from f049130 to 92b96f1 Compare September 25, 2024 15:10

isinyaaa marked this pull request as draft September 25, 2024 16:04

google-oss-prow bot added the do-not-merge/work-in-progress label Sep 25, 2024

isinyaaa commented Sep 25, 2024

View reviewed changes

clients/python/src/model_registry/_utils.py Outdated Show resolved Hide resolved

isinyaaa force-pushed the push-uzukprvowwly branch from 3b3b124 to 4280577 Compare September 25, 2024 16:19

tarilabs mentioned this pull request Oct 4, 2024

use deployed MR for python e2e tests #447

Merged

7 tasks

isinyaaa added 3 commits November 11, 2024 11:17

py: switch from deprecated get_event_loop

0d45688

Signed-off-by: Isabella do Amaral <idoamara@redhat.com>

py: add basic service URL resolver

ce0c6aa

Signed-off-by: Isabella do Amaral <idoamara@redhat.com>

py: abstract kc API

c01f397

Signed-off-by: Isabella do Amaral <idoamara@redhat.com>

isinyaaa force-pushed the push-uzukprvowwly branch from 4280577 to c01f397 Compare November 11, 2024 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

py: add basic service URL resolver #421

py: add basic service URL resolver #421

isinyaaa commented Sep 23, 2024 •

edited

Loading

google-oss-prow bot commented Sep 23, 2024

isinyaaa commented Sep 23, 2024

tarilabs left a comment

tarilabs commented Sep 24, 2024

Al-Pragliola commented Sep 24, 2024 •

edited

Loading

tarilabs commented Sep 24, 2024

isinyaaa commented Sep 24, 2024 •

edited

Loading

tarilabs commented Sep 24, 2024

rareddy commented Sep 26, 2024

dhirajsb commented Sep 30, 2024

dhirajsb commented Sep 30, 2024

dhirajsb commented Sep 30, 2024

rareddy commented Oct 1, 2024

rareddy commented Oct 1, 2024

tarilabs commented Oct 4, 2024

py: add basic service URL resolver #421

Are you sure you want to change the base?

py: add basic service URL resolver #421

Conversation

isinyaaa commented Sep 23, 2024 • edited Loading

Description

How Has This Been Tested?

Merge criteria:

google-oss-prow bot commented Sep 23, 2024

isinyaaa commented Sep 23, 2024

tarilabs left a comment

Choose a reason for hiding this comment

tarilabs commented Sep 24, 2024

Al-Pragliola commented Sep 24, 2024 • edited Loading

tarilabs commented Sep 24, 2024

isinyaaa commented Sep 24, 2024 • edited Loading

tarilabs commented Sep 24, 2024

rareddy commented Sep 26, 2024

dhirajsb commented Sep 30, 2024

dhirajsb commented Sep 30, 2024

dhirajsb commented Sep 30, 2024

rareddy commented Oct 1, 2024

rareddy commented Oct 1, 2024

tarilabs commented Oct 4, 2024

isinyaaa commented Sep 23, 2024 •

edited

Loading

Al-Pragliola commented Sep 24, 2024 •

edited

Loading

isinyaaa commented Sep 24, 2024 •

edited

Loading