-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Support simple model serving #75
Conversation
56eebb7
to
da62b69
Compare
c62f55a
to
6ca8ec2
Compare
4ec54a4
to
44f7515
Compare
/cc @gaocegege |
- name: KLEVER_MODEL_REGISTRY_PORT | ||
value: {{ .Values.service.Port }} | ||
- name: ORMB_DOMAIN |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can use ORMB_DOMAIN I think.
pkg/registry/serving/composer.go
Outdated
MountPath: modelMountPath, | ||
}, | ||
}, | ||
ImagePullPolicy: corev1.PullAlways, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we avoid pulling always?
Args: []string{p.Graph.ModelURI, modelMountPath}, | ||
Env: []corev1.EnvVar{ | ||
{ | ||
Name: "AWS_ACCESS_KEY_ID", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add some comments about the KEY to let readers know why we use AWS_ACCESS_KEY_ID instead of ORMB_XXX.
@@ -25,7 +25,7 @@ export SHELLOPTS := errexit | |||
ROOT := github.com/kleveross/klever-model-registry | |||
|
|||
# Target binaries. You can build multiple binaries for a single project. | |||
TARGETS := klever-model-registry klever-modeljob-operator | |||
TARGETS := model-registry modeljob-operator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I remember that we decide to have one separate binary for inference?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the inference binary is trtserver and openscoring jar package, not include source code.
34d1415
to
f070591
Compare
comments are addressed, PTAL. thanks |
reviewing? can merge? |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: gaocegege The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What this PR does / why we need it:
Add your description
Which issue(s) this PR is related to (optional, link to 3rd issue(s)):
Fixes #
Reference to #
Special notes for your reviewer:
/cc @your-reviewer
Release note: