Comparison matrix of models versus optimisers versus inference methods #1542

martinjrobins · 2018-02-12T16:03:14Z

martinjrobins
Feb 12, 2018
Maintainer

I'm writing a repo that simply takes all the toy models in pints, and all the methods (optimisers and inference), and then tests every method against every model. This will take a while, so its doing it all on arcus-b (there is a lot of machine-specific stuff in there, so it's not suitable to put into Pints itself)

When I'm comparing optimisers, I compare using the following criteria:

final score
time taken to reach the final score (this is the time taken using all the cores of a whole node on arcus-b)

I might also average these results over multiple runs of the optimiser, since some of them will be stochastic.

I'm less sure how to compare the inference methods, perhaps:

Effective sample size (in a given time limit?)
Rhat
anything else?

What other criteria do you all think are necessary @MichaelClerx @ben18785 @sanmitraghosh @mirams @chonlei ? I'm hoping this will give a bunch of heat maps comparing the performance of all of our methods, and will go into the first paper

mirams · 2018-02-12T20:01:59Z

mirams
Feb 12, 2018
Maintainer

A direct count of the number of forward solves involved in getting to an optimum/converged posterior is nice to have.

0 replies

MichaelClerx · 2018-02-12T21:55:06Z

MichaelClerx
Feb 12, 2018
Maintainer

A direct count of the number of forward solves involved in getting to an optimum/converged posterior is nice to have.

Good point! Relates to #203

0 replies

martinjrobins · 2018-02-21T10:59:47Z

martinjrobins
Feb 21, 2018
Maintainer Author

Here are a couple of heat maps I did using the optimisers, just one run each, and 1% noise:

score_with_noise_0.pdf
time_with_noise_0.pdf

0 replies

MichaelClerx · 2018-02-21T11:09:33Z

MichaelClerx
Feb 21, 2018
Maintainer

Just discussing if, for optimisers, we want to show the mean score of multiple runs, or the best score.

@DavidGavaghan ?

0 replies

mirams · 2018-02-21T15:08:17Z

mirams
Feb 21, 2018
Maintainer

I think you want to see the distribution of optimiser results really, will have a big impact on use if they always get the same answer versus have a wide distribution of results.

0 replies

martinjrobins · 2018-02-21T15:17:42Z

martinjrobins
Feb 21, 2018
Maintainer Author

Yea, I think you're right. I'm storing all the results from all the independent runs of the optimisers, and plotting the mean and minimum scores and execution_time. But all the data is there, so we can post-process any other statistic you might wish for.

…

On 21 February 2018 at 15:08, Gary Mirams ***@***.***> wrote: I think you want to see the distribution of optimiser results really, will have a big impact on use if they always get the same answer versus have a wide distribution of results. — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#229 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABGF9LimpVDkUTm_rTjVrygsIQSkFPFTks5tXDFigaJpZM4SCZxF> .

0 replies

MichaelClerx · 2024-06-18T12:01:59Z

MichaelClerx
Jun 18, 2024
Maintainer

@martinjrobins can this one be closed?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparison matrix of models versus optimisers versus inference methods #1542

{{title}}

Replies: 7 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Comparison matrix of models versus optimisers versus inference methods #1542

martinjrobins Feb 12, 2018 Maintainer

Replies: 7 comments

mirams Feb 12, 2018 Maintainer

MichaelClerx Feb 12, 2018 Maintainer

martinjrobins Feb 21, 2018 Maintainer Author

MichaelClerx Feb 21, 2018 Maintainer

mirams Feb 21, 2018 Maintainer

martinjrobins Feb 21, 2018 Maintainer Author

MichaelClerx Jun 18, 2024 Maintainer

martinjrobins
Feb 12, 2018
Maintainer

mirams
Feb 12, 2018
Maintainer

MichaelClerx
Feb 12, 2018
Maintainer

martinjrobins
Feb 21, 2018
Maintainer Author

MichaelClerx
Feb 21, 2018
Maintainer

mirams
Feb 21, 2018
Maintainer

martinjrobins
Feb 21, 2018
Maintainer Author

MichaelClerx
Jun 18, 2024
Maintainer