Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deploy Model DB with durable storage #1860

Closed
jlewi opened this issue Oct 25, 2018 · 7 comments
Closed

Deploy Model DB with durable storage #1860

jlewi opened this issue Oct 25, 2018 · 7 comments

Comments

@jlewi
Copy link
Contributor

jlewi commented Oct 25, 2018

In 0.4.0 we should be able to deploy ModelDB with durable storage so that it can be used for primitive model tracking.

Context:
kubeflow/katib#137
kubeflow/community#195
kubeflow/examples#134

I know there's discussion about redesigning model tracking from scratch (kubeflow/community#195 ) but I don't see that landing in 0.4.

ModelDB on the other hand covers a lot of the use cases we care about. I wouldn't expect it to be difficult to deploy ModelDB with durable storage (e.g. PVC). That would unblock a lot of useful scenarios as people could begin using it to track models.

Related:
ModelDB v2: goo.gl/NaDXpQ

/cc @mpvartak
/cc @garganubhav

@jlewi
Copy link
Contributor Author

jlewi commented Oct 30, 2018

I think there are two databases used by ModelDB

  1. There is an instance of MongoDB
    This is the dbDeployment
    https://github.com/kubeflow/kubeflow/blob/master/kubeflow/katib/modeldb.libsonnet#L108

  2. There is ModelDB itself which I think uses an SQLLite database
    https://github.com/kubeflow/kubeflow/blob/master/kubeflow/katib/modeldb.libsonnet#L44

So both of these need to be stored on PVC.

For MongoDB we are just using the standard MongoDB image; so I'm guessing there are command line arguments or environment variables that we can use to store the database on a PVC.

For ModelDB I think we might need: VertaAI/modeldb#310

It looks like that PR makes some changes to store the model in the directory "/db" which we could then mount on PV.

/cc @mpvartak @garganubhav

@jlewi
Copy link
Contributor Author

jlewi commented Nov 2, 2018

@mpvartak To integrate this into Kubeflow you'll want to port over the changes in VertaAI/modeldb#310 into the modeldb.libsonnet file at the locations mentioned in the previous comment.

Once you have the changes integrated you can follow the instructions for using kfctl
https://www.kubeflow.org/docs/started/getting-started-gke/

To deploy Kubeflow on GCP.

If you run it from your local repo; it will use whatever changes you've made to the Katib/Modeldb components.

@carmine carmine added this to the 0.4.0 milestone Nov 6, 2018
@jlewi
Copy link
Contributor Author

jlewi commented Nov 19, 2018

@mpvartak any update on this?

@jlewi
Copy link
Contributor Author

jlewi commented Dec 3, 2018

@mpvartak @YujiOshima @gaocegege; anyone working on this?

@mpvartak
Copy link
Contributor

mpvartak commented Dec 3, 2018 via email

@jlewi
Copy link
Contributor Author

jlewi commented Dec 17, 2018

@mpvartak Thanks!

Looks like #2050 is pending. Would be great to get that merged today so we can include it in the 0.4 release.

@jlewi
Copy link
Contributor Author

jlewi commented Jan 7, 2019

#2050 was merged so closing this out.

@jlewi jlewi closed this as completed Jan 7, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants