Experiment tracking: big open questions #1217

antonymilne · 2022-09-07T21:39:00Z

Note. This is very much a kedro-viz and kedro core issue but I've put it here since that's where most of the big experiment tracking discussions are currently.

There are several parts of experiment tracking that already exist or we have always anticipated adding but feel very uncertain/unachievable at the moment because they either don't have a design or we've deviated from the original designs. Some of these already have their own issues here, but I want to get the ball rolling about what the overall solution here might be. Several of the issues are very closely connected and their solutions will impact each other (e.g. how tracking.MetricsDataSet works will affect the use of SQLite database, which will affect the multi-user experience). That doesn't mean we need to implement lots of new features all at once, but I think we need a holistic design here rather than building it piecemeal. At the moment I feel like we're a bit stuck on these questions and it would be great to get some clarity on them.

Open questions

1. What is `SQLiteStore` for?

In the original proposal @limdauto stated:

Once we are happy with the schema and implementation of the store, we can move it into Kedro core and make it the default session store.

I think @idanov feels otherwise though:

the SQLiteStore should always live in kedro-viz
SQLiteStore should not be used to record anything other than run metadata (like run command, timestamp, etc.)

Where the SQLiteStore code lives isn't such a big deal, but getting a clearer idea of what SQLiteStore is actually for is essential if we're going to add features like multi-user experience, searching by metric, etc.

Update default suggestions in settings.py to ones that work kedro#1538 - what should be suggested in settings.py?
Fixing the Experiment tracking set up docs #1042 - how can users more easily enabled experiment tracking?

2. What should happen to the `tracking` datasets?

The original proposal expected there to be three datasets for recording experiment tracking: tracking.MetricsDataSet (key-value pairs with numerical values), tracking.JSONDataSet (general JSON) and tracking.ArtifactDataSet (everything else). The first two of these exist but the third doesn't. Instead it was chosen to implement plots as versioned instances of matplotlib.MatplotlibWriter dataset.

Copying my comments from kedro-org/kedro#1626 (comment):

While I agree with the "tracked plot = versioned dataset" approach, it does feel like an inconsistent and confusing UX given the already-existing tracking datasets:

Want to track json data? Change your dataset type to tracking.JSONDataSet.

Want to track a plot? Keep the same dataset type but set versioned: true.

Hence I think we do need to work out what happens with tracking.JSONDataSet and tracking.MetricsDataSet sooner rather than later. tracking.JSONDataSet could be easily deprecated in favour of json.JSONDataSet with versioned: true, but tracking.MetricsDataSet is trickier. To me this is directly coupled to questions like "how do I search runs by metric" and "why not just do log_metric call" (which we decided against before). Overall, adding plots to experiment tracking sounds straightforward and I'm very happy to do it by versioned: true, but we need work out a more holistic and complete solution here or experiment tracking becomes a bit of a mish-mash of different approaches.

3. How do we enable a search functionality?

This was always on the roadmap as a feature and now it's been requested by a user: #1039.

The linked issue has the relevant quotes on @limdauto's idea for implementing this, but they all rely on SQLiteStore being used to store metrics in some way. This is something I personally feel most uncertain about because I don't really have any idea how to build a search functionality in either scenario (metrics in SQLiteStore or not).

4. How to enable multi-user experience?

Relevant issue: #1218

This is also very unclear to me currently and depends heavily on the role played by SQLiteStore.

The text was updated successfully, but these errors were encountered:

tynandebold · 2023-09-25T16:33:05Z

We'll reopen this ticket if we look at this again downstream.

antonymilne added the Technical Design label Sep 7, 2022

antonymilne mentioned this issue Sep 7, 2022

Fixing the Experiment tracking set up docs #1042

Closed

antonymilne mentioned this issue Sep 28, 2022

[FE] POC for the new Metrics charts in Experiment tracking #1090

Closed

1 task

antonymilne mentioned this issue Oct 12, 2022

Build out a query to return data for exp. tracking metrics plots #1133

Closed

1 task

NeroOkwa mentioned this issue Jan 16, 2023

Allow users to collaborate while using experiment tracking #1218

Closed

tynandebold transferred this issue from kedro-org/kedro Jan 16, 2023

tynandebold added this to the Collaborative Experiment Tracking milestone Jan 16, 2023

antonymilne mentioned this issue May 22, 2023

Consolidate MetricsDataSet #1836

Closed

tynandebold closed this as not planned Won't fix, can't repro, duplicate, stale Sep 25, 2023

github-project-automation bot moved this to Done in Kedro Framework Sep 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment tracking: big open questions #1217

Experiment tracking: big open questions #1217

antonymilne commented Sep 7, 2022 •

edited

Loading

tynandebold commented Sep 25, 2023

Experiment tracking: big open questions #1217

Experiment tracking: big open questions #1217

Comments

antonymilne commented Sep 7, 2022 • edited Loading

Open questions

1. What is SQLiteStore for?

2. What should happen to the tracking datasets?

3. How do we enable a search functionality?

4. How to enable multi-user experience?

tynandebold commented Sep 25, 2023

antonymilne commented Sep 7, 2022 •

edited

Loading

1. What is `SQLiteStore` for?

2. What should happen to the `tracking` datasets?