Extra srcdep parameters and limit the offset of gamma MC for training #1077

SeiyaNozaki · 2023-02-13T16:14:09Z

This PR implements extra two parameters for source-dep analysis defined in the following slides.
Any comments are welcome, especially the naming of the parameters...!
https://indico.cta-observatory.org/event/4631/contributions/38096/attachments/23113/33203/20230213_srcdep_update.pdf

In addition, this PR changes the default value of `src_r_cut of MC gamma for the RF training.

…rectness) for source-dep analysis and limit the offset of gamma MC for the RF training

maxnoe · 2023-02-13T16:23:45Z

lstchain/reco/dl1_to_dl2.py

@@ -478,10 +478,13 @@ def build_models(filegammas, fileprotons,
        # Apply the temporary disp norm regressor and sign classifier to the test set
        disp_norm = tmp_reg_disp_norm.predict(test[config['disp_regression_features']])
        disp_sign = tmp_cls_disp_sign.predict(test[config['disp_classification_features']])
+        disp_sign_proba = tmp_cls_disp_sign.predict_proba(test[config['disp_classification_features']])


There is no need to call predict and predict proba, that's wasteful. Since we now want the proba, you should change the disp_sign calculation to use the proba to not evaluate the random forest twice:

col = list(tmp_cls_disp_sign.classes_).index(1) disp_sign = np.where(disp_sign_proba[:, col] > 0.5, 1, -1)

Thanks for the comment! Actually, I just copy & paste the codes of gammaness computation, so this part can also be updated to reduce computation time
https://github.com/cta-observatory/cta-lstchain/blob/master/lstchain/reco/dl1_to_dl2.py#L652-L653

SeiyaNozaki · 2023-02-24T13:44:09Z

CI failed since only a single event survives after off-axis cut (src_r < 1 degree) ...
Is it possible to include several low off-axis events in the test MC gamma dataset?

rlopezcoto · 2023-02-27T06:04:19Z

CI failed since only a single event survives after off-axis cut (src_r < 1 degree) ... Is it possible to include several low off-axis events in the test MC gamma dataset?

I think it was @maxnoe the one creating those datasets, right? Would it be possible to create one including more inner events now that we have seen an improvement in the performance of src-dependent analysis?
Otherwise we could modify the tests, @SeiyaNozaki aren't there any events surviving because of the cuts (just relax them) or overall without applying any additional constraints?

rlopezcoto · 2023-05-04T15:10:25Z

so @maxnoe would it be possible to get a different test dataset with more off-axis events or shall we change the tests for them to pass after these conditions are applied?

…to zeros) to keep events after src_r cut)

…ining step

maxnoe · 2023-05-10T07:53:53Z

so @maxnoe would it be possible to get a different test dataset with more off-axis events or shall we change the tests for them to pass after these conditions are applied?

No sure what simulations you are talking about, but I can't remember creating any.

If you need a new small test file with specific settings, you should probably ask the simulation team to create it, like I did here:
cta-observatory/lst-sim-config#51

SeiyaNozaki · 2023-05-10T07:57:41Z

ah I see Max's comment now, but now I prepared fake DL1 MC gamma file (like fake proton MC) by changing src_x and src_y to zeros to keep events after src_r cut. So now it should work with the current test data.

codecov · 2023-05-10T08:15:50Z

Codecov Report

Patch coverage: 100.00% and project coverage change: +0.10 🎉

Comparison is base (8fb5cb5) 74.02% compared to head (f8c3347) 74.12%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1077      +/-   ##
==========================================
+ Coverage   74.02%   74.12%   +0.10%     
==========================================
  Files         123      123              
  Lines       11869    11915      +46     
==========================================
+ Hits         8786     8832      +46     
  Misses       3083     3083

Impacted Files	Coverage Δ
lstchain/conftest.py	`97.77% <100.00%> (+0.19%)`	⬆️
lstchain/io/tests/test_config.py	`100.00% <100.00%> (ø)`
lstchain/reco/dl1_to_dl2.py	`81.66% <100.00%> (+0.76%)`	⬆️
lstchain/reco/tests/test_utils.py	`100.00% <100.00%> (ø)`
lstchain/reco/utils.py	`76.38% <100.00%> (+0.86%)`	⬆️
lstchain/scripts/tests/test_lstchain_scripts.py	`99.65% <100.00%> (+<0.01%)`	⬆️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

SeiyaNozaki added 2 commits February 13, 2023 17:02

add additional parameters (reco_disp_norm_diff and reco_disp_sign_cor…

4321660

…rectness) for source-dep analysis and limit the offset of gamma MC for the RF training

bugfix

dc38f5b

maxnoe reviewed Feb 13, 2023

View reviewed changes

compute disp_sign using disp_sign_proba

36e95ad

SeiyaNozaki added 4 commits May 9, 2023 17:11

update pytest (prepare fake gamma dl1 file (src_x, src_y are changed …

474510b

…to zeros) to keep events after src_r cut)

make a function to apply src_r cut and call this function in a RF tra…

ef1da9b

…ining step

add a pytest of apply_src_r_cut function

649ec0d

merge master after solving conflict

f8c3347

rlopezcoto approved these changes May 11, 2023

View reviewed changes

rlopezcoto merged commit 33446c8 into master May 11, 2023

SeiyaNozaki deleted the extra_srcdep_params branch May 11, 2023 11:52

SeiyaNozaki mentioned this pull request Feb 9, 2024

Update dl1 to dl2 script (Less required memory and less required step) #1224

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extra srcdep parameters and limit the offset of gamma MC for training #1077

Extra srcdep parameters and limit the offset of gamma MC for training #1077

SeiyaNozaki commented Feb 13, 2023

maxnoe Feb 13, 2023 •

edited

Loading

SeiyaNozaki Feb 24, 2023

SeiyaNozaki commented Feb 24, 2023

rlopezcoto commented Feb 27, 2023

rlopezcoto commented May 4, 2023

maxnoe commented May 10, 2023

SeiyaNozaki commented May 10, 2023

codecov bot commented May 10, 2023

Extra srcdep parameters and limit the offset of gamma MC for training #1077

Extra srcdep parameters and limit the offset of gamma MC for training #1077

Conversation

SeiyaNozaki commented Feb 13, 2023

maxnoe Feb 13, 2023 • edited Loading

Choose a reason for hiding this comment

SeiyaNozaki Feb 24, 2023

Choose a reason for hiding this comment

SeiyaNozaki commented Feb 24, 2023

rlopezcoto commented Feb 27, 2023

rlopezcoto commented May 4, 2023

maxnoe commented May 10, 2023

SeiyaNozaki commented May 10, 2023

codecov bot commented May 10, 2023

Codecov Report

maxnoe Feb 13, 2023 •

edited

Loading