Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training gateway plugin #718

Merged
merged 36 commits into from
Nov 14, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
36 commits
Select commit Hold shift + click to select a range
e6b0f89
Introduce gateway plugin for model training
AmartC Sep 29, 2022
43af72d
Update protobuf definitions for WorkloadResponse
AmartC Oct 3, 2022
30a9358
Update proto files to reflect workload log count
AmartC Oct 3, 2022
c4d04fa
Update model_training proto file as well as api, plugin and system files
AmartC Oct 5, 2022
42abcaf
Update modifications to api.go, plugin.go and system.go files
AmartC Oct 5, 2022
c6e26ca
Update plugin code
AmartC Oct 6, 2022
4c75294
Update grpc proto file for model training
AmartC Oct 6, 2022
187dc9e
Update plugin for model training to address bug
AmartC Oct 6, 2022
9423adc
Update plugin
AmartC Oct 10, 2022
4700022
Update API for training a model
AmartC Oct 10, 2022
b09779a
Update proto files for training controller gateway
AmartC Oct 12, 2022
e528af0
Update proto files for training controller gateway
AmartC Oct 12, 2022
535af3c
Add endpoint for checking GPU
AmartC Oct 13, 2022
d831d1b
Update model training api to search for GPU
AmartC Oct 13, 2022
39ae227
Add opensearch aggregation routine as part of training controller gat…
AmartC Oct 14, 2022
169e336
Update model training plugin to include service for aggregating data …
AmartC Oct 18, 2022
1b75114
Rename directory from modelTraining camel case to modeltraining
AmartC Oct 20, 2022
24ac0d1
Update model training directory names
AmartC Oct 20, 2022
99c9319
Fix up modeltraining plugin imports
AmartC Oct 20, 2022
9203e01
Update main.go file to reflect latest naming style
AmartC Oct 20, 2022
63a3974
Undo changes to opensearch image in repository
AmartC Oct 20, 2022
4c604b2
Update plugin to fetch Opensearch endpoint and credentials
AmartC Oct 25, 2022
bbe26f5
Update code to make sure Opensearch credentials are properly being fe…
AmartC Oct 25, 2022
be12430
Remove logging debug statements
AmartC Oct 25, 2022
5eff8ed
Update code to address PR comments
AmartC Oct 26, 2022
2f00962
Remove unused structs
AmartC Oct 26, 2022
4e5a847
Update code to follow gofmt
AmartC Oct 26, 2022
2af4b8d
Update proto to rename GPUInfo list to items
AmartC Oct 27, 2022
8d7deaa
Update proto files with PR comments
AmartC Oct 27, 2022
0147522
Update aggregation service to use context from ModelTrainingPlugin
AmartC Oct 27, 2022
bb49abb
Update proto file definitions to be more distinct for each endpoint
AmartC Oct 27, 2022
58ae269
Add error handling for api endpoints and update proto definitions
AmartC Oct 27, 2022
5a984b2
Update plugin file to check if os-workload-aggregation already exists…
AmartC Oct 28, 2022
be64f21
Update proto definitions to be more concise
AmartC Oct 28, 2022
a16f198
Update struct field formatting to make it consistent
AmartC Nov 4, 2022
5f51000
Update composite aggregation size to be from 4 records per scroll to …
AmartC Nov 4, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion pkg/resources/gateway/rbac.go
Original file line number Diff line number Diff line change
Expand Up @@ -118,7 +118,7 @@ func (r *Reconciler) rbac() ([]resources.Resource, error) {
},
{
APIGroups: []string{""},
Resources: []string{"endpoints"},
Resources: []string{"endpoints", "nodes"},
Verbs: []string{
"get",
"list",
Expand Down
21 changes: 21 additions & 0 deletions plugins/modeltraining/main.go
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
package main

import (
"context"
"time"

"github.com/gin-gonic/gin"
"github.com/rancher/opni/pkg/plugins"
"github.com/rancher/opni/pkg/tracing"
"github.com/rancher/opni/pkg/util/waitctx"
modeltraining "github.com/rancher/opni/plugins/modeltraining/pkg/modeltraining"
)

func main() {
tracing.Configure("plugin_modeltraining")
gin.SetMode(gin.ReleaseMode)
ctx, ca := context.WithCancel(waitctx.Background())
plugins.Serve(modeltraining.Scheme(ctx))
ca()
waitctx.Wait(ctx, 5*time.Second)
}
Loading