paradox/set6 integration #345

RaphaelS1 · 2021-03-24T11:29:23Z

@berndbischl @mllg @mb706 @be-marc summarising our discussions here and including action points.

Problem: Want to avoid two R6 parameter interfaces that perform a similar task (paradox and param6). param6 created by me as paradox did not work for use-cases that involved more complex sets and had significant bottlenecks.

Solution: Change R6->S3 for Param object in paradox and internally all Params wrap set6 objects. mlr packages will all use sugar functionality to make it easier to develop internal changes without breaking.

Comments: Given the size of the to-do list below and other use-cases paradox cannot currently handle, for example extracting parameters with the same IDs but ignoring prefixes (essential to distr6), I think we should consider converging in the 'other direction'. i.e. start with param6 (if needed we can change the name to paradox) and converge towards paradox, but I currently think param6 actually has more functionality so it could be quicker to change variable names and add sugar functions to param6, than it would be to change underlying objects in paradox.

If we agree that the final package is still called paradox and has at least the same amount of functionality as paradox, then it could be much quicker to start from the param6 code.

To-do (not complete):

The text was updated successfully, but these errors were encountered:

mb706 · 2021-03-25T23:22:58Z

Things that seem to be missing from set6 objects that we use in paradox:

$convert() function
qunif() function (possibly not needed)
tolerance for ParamDbl

also, how sure are we that param6's cnd mechanism are not constraints instead of dependencies?

RaphaelS1 · 2021-03-26T07:50:44Z

What does convert do? If qunif is needed it can be added easily but not sure if it's appropriate in the package ? And how does tol work in ParamDbl?

Edit: Looked at code for convert and tolerance. It seems this is another point why set6 is useful. ParamDbl with tol=0 is equivalent to a set6 interval with closed bounds, whereas with tol>0 this is equivalently to an interval with open bounds (can also be half-open). So all we need is to make the tolerance amount controllable as either a global option (currently) or local parameter in the Interval object. About convert, I don't fully understand why this is needed as either an element should or shouldn't be in a set. Also I think that conversion, if it is needed for any reason, should really take place when the value is set immediately and not as a method. If needed we can add it easily.

And about cnd it's actually both. I realised that many checks in distr6 could be handled much mor easily and cleanly by using the same mechanism. So my cnd does the same as paradox it you only use value-based and not variable-based dependencies, and has same set of conditions, as well as more functionality for comparing variables. It makes it super useful for checks like "is Param A lower than Param B"

mllg · 2021-03-26T08:40:53Z

AFAICT qunif() is only needed for sampling and can go to bbotk as (internal) S3.

mb706 · 2021-03-26T09:36:43Z

tol>0 this is equivalently to an interval with open bounds

the way I understand 'open bounds' in set6 is that it makes the set smaller by some small number (if you're lucky enough and your bounds are not so large that machine precision eats that number). What tol does in paradox is that it makes the acceptable range larger by some small number, so that numbers that are only off by machine imprecision are still accepted by the ParamSet. What convert then does is that it puts numbers that are outside of bounds by that small amount to exactly that bound. It furthermore converts numeric values to integer for ParamInt, as some methods (python or C based) make a distinction between these types when R otherwise generally doesn't.

mb706 · 2021-03-26T09:47:18Z

Does anything make any use of the constraints in cnd beyond give a yes/no answer about the feasibility of a configuration point as a whole? Otherwise it could be replaced by a more general mechanism, like a function that maps the configuration space to logical(1). What dependencies are used for in paradox is for (1) removing components that are not 'active' as per the dependencies and for (2) allowing hierarchical sampling of parameter values, depending on whether their dependencies are fulfilled. AFAIUI this would not be possible with more general conditions, where you would just do rejection sampling, which a more general function would also work for?

berndbischl · 2021-03-26T12:25:09Z

AFAICT qunif() is only needed for sampling and can go to bbotk as (internal) S3.

even there we don't need it that much

RaphaelS1 · 2021-03-26T23:12:50Z

the way I understand 'open bounds' in set6 is that it makes the set smaller by some small number (if you're lucky enough and your bounds are not so large that machine precision eats that number). What tol does in paradox is that it makes the acceptable range larger by some small number, so that numbers that are only off by machine imprecision are still accepted by the ParamSet. What convert then does is that it puts numbers that are outside of bounds by that small amount to exactly that bound. It furthermore converts numeric values to integer for ParamInt, as some methods (python or C based) make a distinction between these types when R otherwise generally doesn't.

Got it, we could easily add this to set6.

Does anything make any use of the constraints in cnd beyond give a yes/no answer about the feasibility of a configuration point as a whole?

I guess it asks the paradox question: can this parameter be set, as well as: if it can be set then how can it be set. And the check is only triggered if the value is non-NULL, so it has the same use-case as paradox, i.e. "let this value be set if a condition on the dependency is satisfied". This is still much faster than a general check over the full parameter set

mb706 · 2021-03-29T00:25:37Z

much faster than a general check over the full parameter set

So speed is the only reason here? Or is cnd used somewhere in your code for something beyond making a yes / no decision about feasibility?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

paradox/set6 integration #345

paradox/set6 integration #345

RaphaelS1 commented Mar 24, 2021 •

edited by mb706

Loading

mb706 commented Mar 25, 2021

RaphaelS1 commented Mar 26, 2021 •

edited

Loading

mllg commented Mar 26, 2021

mb706 commented Mar 26, 2021

mb706 commented Mar 26, 2021

berndbischl commented Mar 26, 2021

RaphaelS1 commented Mar 26, 2021

mb706 commented Mar 29, 2021

paradox/set6 integration #345

paradox/set6 integration #345

Comments

RaphaelS1 commented Mar 24, 2021 • edited by mb706 Loading

mb706 commented Mar 25, 2021

RaphaelS1 commented Mar 26, 2021 • edited Loading

mllg commented Mar 26, 2021

mb706 commented Mar 26, 2021

mb706 commented Mar 26, 2021

berndbischl commented Mar 26, 2021

RaphaelS1 commented Mar 26, 2021

mb706 commented Mar 29, 2021

RaphaelS1 commented Mar 24, 2021 •

edited by mb706

Loading

RaphaelS1 commented Mar 26, 2021 •

edited

Loading