ENH: add find_empty_room step #629

larsoner · 2022-10-12T19:14:58Z

@agramfort experienced some "caching slowdowns" of unknown origin. Then locally for me when I ran a dataset, I also noticed it taking 5-10 sec per subject during the maxwell_filter step just to say "cached, skipping".

I tracked this down to the filename resolution step of scripts/preprocessing/_01_maxfilter::get_input_fnames_maxwell_filter, specifically this line:

in_files["raw_er"] = ref_bids_path.find_empty_room()

This is because, in order to find the empty room file, mne-bids will at least sometimes resort to reading info from all of the empty room files to find the one with the closest measurement date. (I think ideally this would be stored during the BIDS-ification step, but it's not always done, and we should accommodate these use cases if we can.)

To avoid this, I propose we add a scripts/init/_01_match_empty_room.py file that does this once ahead of time for datasets that need it, and then saves the match in some new file. This step can be cached just like all the others, and should only need to be run once per dataset (unless the raw data actually changes, which fortunately sane caching logic should take care of for us).

The only open question in this implementation is: where can we store the mapping from subject->empty room file? Can we create a new sidecar file somewhere? @hoechenberger

The text was updated successfully, but these errors were encountered:

larsoner · 2022-10-12T21:57:36Z

The only open question in this implementation is: where can we store the mapping from subject->empty room file? Can we create a new sidecar file somewhere? @hoechenberger

... From working a bit on #631 I'm thinking that we should just write a .json for each subject that gives the mapping to the empty-room file, if relevant. Then based on this:

$ git grep find_empty_room
mne_bids_pipeline/scripts/preprocessing/_01_maxfilter.py:            in_files["raw_er"] = ref_bids_path.find_empty_room()
mne_bids_pipeline/scripts/preprocessing/_02_frequency_filter.py:                in_files["raw_er"] = ref_bids_path.find_empty_room()

We can just modify the in_files for these two steps to get these from the JSON file (when relevant).

agramfort · 2022-10-13T10:49:37Z

it's related to mne-tools/mne-bids#795 so there is a way to store this in bids to speed up the process

…

Message ID: ***@***.***>

hoechenberger · 2022-10-13T10:58:06Z

Problem is we don't want to modify the input dataset

agramfort · 2022-10-13T11:05:39Z

true but at least it can be fast if sidecar is complete

…

Message ID: ***@***.***>

larsoner · 2022-10-13T11:10:08Z

true but at least it can be fast if sidecar is complete

Problem is we don't want to modify the input dataset

And if the sidecar is not complete -- as it seems to be for my dataset, and at least one of yours -- I propose we just store it in a little json file EDIT: in the mne-bids-pipeline derivatives. Then we get good speed in all cases without modifying the root dataset, or requiring people to update their root dataset.

hoechenberger · 2022-10-13T11:15:03Z

true but at least it can be fast if sidecar is complete

Message ID: @.***>

Yes but we already are because we're using the sidecar if it's present

Check the use_sidecar_only parameter :)

larsoner · 2022-10-13T15:04:15Z

From a brief chat with @agramfort he's okay with adding a step to store the matching in our own file, so I'll make a PR for that. Hopefully this can be done jointly with simplifications from #632 after #631 is merged, since #631 sets up the _io.py that I'll want to use, and #632 would greatly simplify/streamline the logic inside the freq filter and maxfilter functions that I'm going to be messing with anyway.

hoechenberger · 2022-10-13T15:39:16Z

Sounds good!

larsoner · 2022-10-13T16:15:38Z

I decided just to put the #632 logic changes in #633 for simplicity. I'll add the new find_empty_room step once #633 and #631 are in

larsoner mentioned this issue Oct 14, 2022

ENH: Add find_empty_room step #634

Merged

hoechenberger closed this as completed in #634 Oct 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: add find_empty_room step #629

ENH: add find_empty_room step #629

larsoner commented Oct 12, 2022

larsoner commented Oct 12, 2022

agramfort commented Oct 13, 2022 via email

hoechenberger commented Oct 13, 2022

agramfort commented Oct 13, 2022 via email

larsoner commented Oct 13, 2022 •

edited

Loading

hoechenberger commented Oct 13, 2022

larsoner commented Oct 13, 2022 •

edited

Loading

hoechenberger commented Oct 13, 2022

larsoner commented Oct 13, 2022

ENH: add find_empty_room step #629

ENH: add find_empty_room step #629

Comments

larsoner commented Oct 12, 2022

larsoner commented Oct 12, 2022

agramfort commented Oct 13, 2022 via email

hoechenberger commented Oct 13, 2022

agramfort commented Oct 13, 2022 via email

larsoner commented Oct 13, 2022 • edited Loading

hoechenberger commented Oct 13, 2022

larsoner commented Oct 13, 2022 • edited Loading

hoechenberger commented Oct 13, 2022

larsoner commented Oct 13, 2022

larsoner commented Oct 13, 2022 •

edited

Loading

larsoner commented Oct 13, 2022 •

edited

Loading