Add support for storing the optimization targets and direction of an experiment #628

bpkroth · 2024-01-10T21:46:03Z

This PR is useful for mlos-viz and dabl wrapper (#624) to be able to automatically graph the results for a given optimization target, for instance via something like the following:

for opt_target in exp.objectives:
    dabl.plot(exp.results, opt_target)

Since the prior efforts on capturing this data in the Trial metadata are somewhat problematic (allow conflicting changes between runs of an experiment, don't support multi-objective), we extend them to also store values directly as a part of the Experiment, which is a somewhat more appropriate location. Upon retrieval, an attempt is also made to merge the two data sources for backwards compatibility.

This PR does not enforce strictness on that metadata, but future versions could (e.g., disallow resuming an Experiment if it looks like the objective targets have changed. In that case the prior Trial results can potentially still be used to prewarm a new Experiment's optimizer).

…xperiment

motus · 2024-01-10T22:07:33Z

wait, we already store that data in the trial_param table - see

MLOS/mlos_bench/mlos_bench/run.py

Lines 120 to 124 in 2032d79

    
           trial = exp.new_trial(tunables, config={ 
        
               "optimizer": opt.name, 
        
               "opt_target": opt.target, 
        
               "opt_direction": "min" if opt.is_min else "max", 
        
           })

you can add more key/value pairs to store with each trial there

bpkroth · 2024-01-11T15:34:01Z

wait, we already store that data in the trial_param table - see

MLOS/mlos_bench/mlos_bench/run.py

Lines 120 to 124 in 2032d79

trial = exp.new_trial(tunables, config={

"optimizer": opt.name,

"opt_target": opt.target,

"opt_direction": "min" if opt.is_min else "max",

})

you can add more key/value pairs to store with each trial there

We had a conversation about this offline. I think the conclusion was that:

The optimization configuration is really a property of the Experiment and not the Trial, so if something about the Experiment's optimization goal's configuration changes, we should really require a new Experiment, though previously we didn't store or enforce this.
That doesn't preclude reusing some TrialData from a previous Experiment to pre-warm a new one.
However, storing metadata about the Optimization goals used during a given Trial, which potentially redundant, may still be useful.
Though, with that in mind, we may also want to store other parameter data used too ...

I think with that in mind, I will still pursue this PR and a future one can enforce the strictness aspects of not changing things, perhaps after we make it easier to merge other "related" TrialData (though capturing which those are might itself be tricky).

mlos_bench/mlos_bench/storage/base_storage.py

mlos_bench/mlos_bench/tests/storage/trial_data_test.py

mlos_bench/mlos_bench/run.py

mlos_bench/mlos_bench/storage/sql/trial.py

mlos_bench/mlos_bench/storage/sql/experiment_data.py

mlos_bench/mlos_bench/storage/base_storage.py

Co-authored-by: Sergiy Matusevych <sergiy.matusevych@gmail.com>

motus

Looks good! I leave it up to you to keep the optimization direction nullable or roll back to the non-nullable variant

mlos_bench/mlos_bench/storage/base_experiment_data.py

Co-authored-by: Sergiy Matusevych <sergiy.matusevych@gmail.com>

Adds a basic `mlos_viz.plot(exp)` style API for simple visualizations of `ExperimentData` results relative to the experiment's objectives (building off of #628 and dabl/dabl#335). Note: this PR currently omits unit tests for the new module due to the complexity of testing visualizations. We intend to add this in future PRs. There is however, a working example of its use here right now: Microsoft-CISL/sqlite-autotuning#41 --------- Co-authored-by: Sergiy Matusevych <sergiy.matusevych@gmail.com>

#628 mistakenly included an early attempt at adding `optimization_target` and `optimization_direction` to the `experiment` table in the `mlos_bench.storage.sql` backend. In that PR we later moved it to its own `objectives` table to eventually support multi-objectives. Nothing accesses those columns now, however including them in the metadata makes it impossible to load storage backends previously created with the old schema since adjusting columns with sqlalchemy's `create_all()` API only considers table existence. On the contrary, the latter means that we will automatically support old storage backends with the new code for the `objectives` table. Removing these two columns in the `metadata` schema description simply allows that to proceed without error. See Also: #649 Co-authored-by: Sergiy Matusevych <sergiym@microsoft.com>

Add support for storing the optimization target and direction of an e…

fdcc3e3

…xperiment

bpkroth requested a review from a team as a code owner January 10, 2024 21:46

bpkroth added the WIP Work in progress - do not merge yet label Jan 10, 2024

bpkroth added 4 commits January 11, 2024 18:46

Rename and improve tunable vs config API and documentation

cf69674

comments

5cb4258

data checking

a0c21ca

actually do the insert, and add some todo comments

cfd3e4e

bpkroth marked this pull request as draft January 11, 2024 18:48

bpkroth added 8 commits January 11, 2024 18:50

consolidate logic

7536d59

wip: add a fallback

b14e710

pylint

3498ff2

TODOs

4fdf957

fixups

eeea732

Merge branch 'main' into store-and-expose-optimization-target-info

c26ba06

add objective info to storage schema

6e49010

todo comments

687501f

bpkroth marked this pull request as ready for review January 11, 2024 21:34

stubs for tests

aa9e545

bpkroth changed the title ~~Add support for storing the optimization target and direction of an experiment~~ Add support for storing the optimization targets and direction of an experiment Jan 11, 2024

bpkroth added 5 commits January 11, 2024 21:54

fixup

814a0dd

move some attrs to the base class

a2673da

fixups

fdf64d2

basic test

d77a806

add some more test handling

c15123b

bpkroth commented Jan 11, 2024

View reviewed changes

mlos_bench/mlos_bench/storage/base_storage.py Outdated Show resolved Hide resolved

bpkroth added ready for review Ready for review and removed WIP Work in progress - do not merge yet labels Jan 11, 2024

reorg

5ffcd67

bpkroth commented Jan 11, 2024

View reviewed changes

mlos_bench/mlos_bench/tests/storage/trial_data_test.py Show resolved Hide resolved

bpkroth mentioned this pull request Jan 12, 2024

Introduce basic experiment visualization module mlos_viz via dabl #624

Merged

bpkroth requested a review from motus January 12, 2024 23:21

motus reviewed Jan 16, 2024

View reviewed changes

bpkroth and others added 4 commits January 16, 2024 11:14

Update mlos_bench/mlos_bench/run.py

d6ba89e

Co-authored-by: Sergiy Matusevych <sergiy.matusevych@gmail.com>

making opt_direction optional

8683aad

adding todo comments and stubbing out weighted multi-objective support

8d12b72

Merge branch 'main' into store-and-expose-optimization-target-info

bd5a2a6

motus approved these changes Jan 16, 2024

View reviewed changes

mlos_bench/mlos_bench/storage/base_experiment_data.py Outdated Show resolved Hide resolved

bpkroth and others added 2 commits January 16, 2024 18:00

make optimization direction non-nullable

667119a

Update mlos_bench/mlos_bench/storage/base_experiment_data.py

d4d8f50

Co-authored-by: Sergiy Matusevych <sergiy.matusevych@gmail.com>

bpkroth enabled auto-merge (squash) January 16, 2024 18:02

bpkroth merged commit 221cee3 into microsoft:main Jan 16, 2024
11 checks passed

bpkroth deleted the store-and-expose-optimization-target-info branch January 16, 2024 18:11

bpkroth mentioned this pull request Jan 26, 2024

Remove mistakenly added columns #652

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for storing the optimization targets and direction of an experiment #628

Add support for storing the optimization targets and direction of an experiment #628

bpkroth commented Jan 10, 2024 •

edited

Loading

motus commented Jan 10, 2024 •

edited

Loading

bpkroth commented Jan 11, 2024

motus left a comment

Add support for storing the optimization targets and direction of an experiment #628

Add support for storing the optimization targets and direction of an experiment #628

Conversation

bpkroth commented Jan 10, 2024 • edited Loading

motus commented Jan 10, 2024 • edited Loading

bpkroth commented Jan 11, 2024

motus left a comment

Choose a reason for hiding this comment

bpkroth commented Jan 10, 2024 •

edited

Loading

motus commented Jan 10, 2024 •

edited

Loading