Make ML config files more readable and add comments. #2455

LukasBeiske · 2023-11-13T10:05:55Z

I also added some more config options with their default values to make users aware of their existence.

Tobychev · 2023-11-15T12:42:51Z

ctapipe/resources/train_disp_reconstructor.yaml

      n_estimators: 10
      max_depth: 10
      n_jobs: -1

-    log_target: True
+    log_target: True  # If true, norm(disp) models predict log(norm(disp)) (output = exp(prediciton))


I'm not sure I understand what the last statement in parenthesis means: it might be trying to say that the default of scikit-learn is that the models output gets wrapped by exp() before being passed to the user, though I'm not sure that is true...

Ah no, this is not related to scikit-learn, we do this ourselves, if log_target = True. I just want to say that the model outputs the correct values, even if log_target = True.

I'm not sure if this is obvious, but writing a short sentence is probably better than this parenthesis.

I don't think it's a good idea to document the options in the config files, they are available in the documentation:
https://ctapipe.readthedocs.io/en/latest/api/ctapipe.reco.DispReconstructor.html#ctapipe.reco.DispReconstructor.log_target

Duplicating this by hand has the danger of getting out of sync.

@LukasBeiske I think Max has a good point that we shouldn't duplicate the existing documentation in the example config files.

Instead I suggest you open a PR where you update the DispReconstructor with the doc strings you suggest here, as I find them more useful.

Ok, I can see that. This extends to all the config options of the ML Reconstructor classes and the stereo combiner options, as well, doesn't it?

I noticed that there are some docstrings missing in DispReconstructor and the other ML related classes, so I will put all of that in a separate PR 👍

In that case, should I also remove these options for which I am only setting the default values here (like e.g. prefix, and StereoMeanCombiner.weights)?

In that case, should I also remove these options for which I am only setting the default values here (like e.g. prefix, and StereoMeanCombiner.weights)?

I think for the example config it makes more sense to change disp to something custom (maybe "elite" ; ) as an example to the reader. The stereo combiner weights is something you choose for more principled reasons so maybe remove it.

I don't think we should change the prefixes by default. For most purposes, we should stick to (and properly define them) the default prefixes.

Otherwise, we'll end up with different prefixes / data formats for the same things done by different users. (or all of them using the same one we now defined as "new default" in the config file.

I'd say comment out the "prefix" completely and add a comment like "add a prefix here if you e.g. want to compare the same method applied with different settings".

I think for the example config it makes more sense to change disp to something custom (maybe "elite" ; ) as an example to the reader. The stereo combiner weights is something you choose for more principled reasons so maybe remove it.

I removed the default value for the weights and commented it out, just like prefix. I think leaving it in as a comment like this makes it a bit more obvious how one can set the options of StereoMeanCombiner or any other future stereo combiner.

codecov · 2023-11-17T12:05:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (49ab82b) 60.60% compared to head (933c22e) 60.60%.
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2455   +/-   ##
=======================================
  Coverage   60.60%   60.60%           
=======================================
  Files           3        3           
  Lines          33       33           
=======================================
  Hits           20       20           
  Misses         13       13

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Tobychev reviewed Nov 15, 2023

View reviewed changes

LukasBeiske added 5 commits November 17, 2023 12:06

Add comments to ml config files

5337fe3

Minor changes and changelog

0738ea9

Give working example for n_events, but use all events by default

b586dab

Better explanation of log_target

496beb6

Do not duplicate docs in config files

933c22e

LukasBeiske force-pushed the verbose_ml_configs branch from 51a4ccd to 933c22e Compare November 17, 2023 11:53

Tobychev approved these changes Nov 17, 2023

View reviewed changes

LukasBeiske mentioned this pull request Nov 20, 2023

Use signal_fraction for training particle classifier #2465

Merged

1 task

maxnoe approved these changes Nov 23, 2023

View reviewed changes

maxnoe merged commit 2e1011b into cta-observatory:main Nov 23, 2023
14 checks passed

LukasBeiske deleted the verbose_ml_configs branch November 23, 2023 15:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make ML config files more readable and add comments. #2455

Make ML config files more readable and add comments. #2455

LukasBeiske commented Nov 13, 2023

Tobychev Nov 15, 2023

LukasBeiske Nov 15, 2023 •

edited

Loading

maxnoe Nov 15, 2023

Tobychev Nov 15, 2023

LukasBeiske Nov 16, 2023

LukasBeiske Nov 16, 2023

LukasBeiske Nov 16, 2023

Tobychev Nov 16, 2023

maxnoe Nov 16, 2023

LukasBeiske Nov 17, 2023

codecov bot commented Nov 17, 2023 •

edited

Loading

Make ML config files more readable and add comments. #2455

Make ML config files more readable and add comments. #2455

Conversation

LukasBeiske commented Nov 13, 2023

Choose a reason for hiding this comment

LukasBeiske Nov 15, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Nov 17, 2023 • edited Loading

Codecov Report

LukasBeiske Nov 15, 2023 •

edited

Loading

codecov bot commented Nov 17, 2023 •

edited

Loading