Unify manifest save path #322

PhilippeMoussalli · 2023-07-27T07:52:40Z

Related to #313

The local and remote runner store the manifest in different locations:

For the local runner, the manifest save path is set at compile time to a fixed path that we specify {base_path}/{component_name}/manifest.json"
For the remote runner (kfp), the output_manifest_path is set as an output artifact type. This is needed for chaining component together. The save path in this case is save internally within the VM /tmp/outputs/output_manifest_path/data and then mapped to minio storage after the component run (which then gets mapped to a cloud storage). The mapping saves the artifact to the specified base path followed by a fixed file structure that cannot be changed and also stored. It is also stored as zip file which contains the written text file (manifest)

Example:

minio://soy-audio-379412_kfp-artifacts/artifacts/datacomp-filtering-pipeline-wvglp/2023/07/26/datacomp-filtering-pipeline-wvglp-3788902962/load-from-hub-output_manifest_path.tgz

where /soy-audio-379412_kfp-artifacts is the artifact bucket specified when deploying KFP.

This PR unifies the manifest save path for both local and remote runner. It checks whether the save_path matches the expected kubeflow path and if so, save it both to the expected kfp artifact path (needed for chaining component) and the custom path that we require for caching.

It's not the most optimal solution since we're writing the file twice but I don't see any other clear cut solution.

@ChristiaensBert I think this might also fix some issues with the data explorer.

GeorgesLorre

Its not a perfect solution but more then good enough for now.

@ChristiaensBert

Related to #313 The local and remote runner store the manifest in different locations: * For the local runner, the manifest save path is set at compile time to a fixed path that we specify `{base_path}/{component_name}/manifest.json"` * For the remote runner (kfp), the `output_manifest_path` is set as an output artifact type. This is needed for chaining component together. The save path in this case is save internally within the VM `/tmp/outputs/output_manifest_path/data` and then mapped to minio storage after the component run (which then gets mapped to a cloud storage). The mapping saves the artifact to the specified base path followed by a fixed file structure that cannot be changed and also stored. It is also stored as zip file which contains the written text file (manifest) Example: ``` minio://soy-audio-379412_kfp-artifacts/artifacts/datacomp-filtering-pipeline-wvglp/2023/07/26/datacomp-filtering-pipeline-wvglp-3788902962/load-from-hub-output_manifest_path.tgz ``` where `/soy-audio-379412_kfp-artifacts` is the artifact bucket specified when deploying KFP. This PR unifies the manifest save path for both local and remote runner. It checks whether the `save_path` matches the expected kubeflow path and if so, save it both to the expected kfp artifact path (needed for chaining component) and the custom path that we require for caching. It's not the most optimal solution since we're writing the file twice but I don't see any other clear cut solution. @ChristiaensBert I think this might also fix some issues with the data explorer.

PhilippeMoussalli added 4 commits July 26, 2023 20:40

changes

9525a23

changes

38301a4

changes

ebb8653

refactor

166bf45

PhilippeMoussalli requested a review from GeorgesLorre July 27, 2023 07:52

PhilippeMoussalli self-assigned this Jul 27, 2023

PhilippeMoussalli added the Core Core framework label Jul 27, 2023

PhilippeMoussalli linked an issue Jul 27, 2023 that may be closed by this pull request

Unify manifest save path between local and remote runner #321

Closed

fix docs

c4e750a

GeorgesLorre approved these changes Jul 27, 2023

View reviewed changes

PhilippeMoussalli merged commit 72f9958 into main Jul 27, 2023

PhilippeMoussalli deleted the unify-manifest-save-path branch July 27, 2023 11:36

PhilippeMoussalli mentioned this pull request Aug 22, 2023

Redesign base path file structure #373

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify manifest save path #322

Unify manifest save path #322

PhilippeMoussalli commented Jul 27, 2023

GeorgesLorre left a comment

Unify manifest save path #322

Unify manifest save path #322

Conversation

PhilippeMoussalli commented Jul 27, 2023

GeorgesLorre left a comment

Choose a reason for hiding this comment