Remove unnecessary batching functionality from functions #718

alex-l-kong · 2022-09-20T19:42:57Z

Is your feature request related to a problem? Please describe.

We've found that in some cases, batching exists solely for the purpose of loading more image data in at once. We later loop over each FOV in that batch anyways, which doesn't add any significant optimization.

This additionally causes issues with cohorts with different image sizes, because loading a batch into a 1024x1024 or 2048x2048 array will cause dimension errors.

Describe the solution you'd like

We've already removed batching from generate_deepcell_input, the following functions also need to be modified in this regard:

data_utils.generate_and_save_pixel_cluster_masks (and by extension, data_utils.generate_pixel_cluster_mask)
data_utils.generate_and_save_cell_cluster_masks (and by extension, data_utils.label_cells_by_cluster)
marker_quantification.generate_cell_table
spatial_analysis.batch_channel_spatial_enrichment and spatial_analysis.batch_cluster_spatial_enrichment: talked to @ackagel about this, we can condense these to a per-FOV basis with negligible speed difference
spatial_analysis.create_neighborhood_matrix (and by extension, the neighborhood analysis notebook): currently, this process requires precomputing all the distance matrices prior to running then function, then an additional per-FOV loop during neighborhood analysis. We should condense all this down to one per-FOV loop in the neighborhood matrix function.
spatial_analysis_utils.calc_dist_matrix: after the neighborhood analysis process is updated, the dist matrix function will no longer be receiving a batch of FOVs to process over, so it should be condensed to process just one FOV.

The text was updated successfully, but these errors were encountered:

ngreenwald · 2022-09-21T01:12:36Z

Looks good, free to proceed one at a time as we discussed, or whatever feels like the right amount for one PR

alex-l-kong added the enhancement New feature or request label Sep 20, 2022

alex-l-kong self-assigned this Sep 20, 2022

alex-l-kong mentioned this issue Sep 26, 2022

Remove batching functionality from cell and pixel mask generation #727

Merged

alex-l-kong mentioned this issue Oct 13, 2022

Update calc_dist_matrix so it preprocesses prior to running each type of spatial analysis #770

Closed

3 tasks

alex-l-kong mentioned this issue Nov 3, 2022

Refactor distance matrix saving #803

Merged

ngreenwald closed this as completed in #803 Nov 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove unnecessary batching functionality from functions #718

Remove unnecessary batching functionality from functions #718

alex-l-kong commented Sep 20, 2022 •

edited

Loading

ngreenwald commented Sep 21, 2022

Remove unnecessary batching functionality from functions #718

Remove unnecessary batching functionality from functions #718

Comments

alex-l-kong commented Sep 20, 2022 • edited Loading

ngreenwald commented Sep 21, 2022

alex-l-kong commented Sep 20, 2022 •

edited

Loading