Temporarily remove pixel preprocessing normalization for Candace's paper #913

alex-l-kong · 2023-02-14T21:20:24Z

What is the purpose of this PR?

@cliu72 will need to remove channel_norm_df normalization and pixel_thresh from the pixel preprocessing step of Pixie.

How did you implement your changes

Explicitly remove both of these processes.

Remaining issues

See discussion thread.

…essing

alex-l-kong · 2023-02-14T21:21:48Z

@cliu72 is it fine to leave in the computations of channel_norm_df and pixel_thresh_df as long as they're not used for preprocessing, or would the reviewers need all references to these completely purged?

… temp_norm_purge

…g to MapDataToNodes

cliu72 · 2023-02-16T00:54:35Z

@cliu72 is it fine to leave in the computations of channel_norm_df and pixel_thresh_df as long as they're not used for preprocessing, or would the reviewers need all references to these completely purged?

I think it's okay to leave in the computations to make it easier for later, but can we remove writing those files? The files with the 99.9% value and the pixel threshold value. I think having those files written will be confusing.

…sh feather files

alex-l-kong · 2023-02-17T00:02:49Z

@cliu72 we're aiming to get this PR merged in by Friday so we can close out as many open Pixie issues as possible before your submission.

cliu72

Looks good!

ngreenwald

Lets wait to merge this in until all of the other PRs are merged in, as well as after all the testing candace does, to make sure that undoing this is as simple as possible.

alex-l-kong · 2023-03-02T20:13:56Z

@ngreenwald @cliu72 with the overwrite branch now merged into main, I think that concludes all the dependencies for this task. If we're ready, we can merge this PR in and tag this as the release for Candace's paper.

cliu72 · 2023-03-02T21:43:10Z

I'm going to do one more check over everything, which I'll do hopefully today/tomorrow. I think we can wait to merge this PR then after that.

ngreenwald

Merge conflicts with the refactoring branch. Once those are resolved this is good to go.

ngreenwald

Not sure what happened here, but none of the changes are present anymore

alex-l-kong · 2023-03-13T22:54:28Z

Oh that's weird, let me fix that up again.

alex-l-kong · 2023-03-13T23:12:17Z

Looks like GitHub had trouble mapping the changes into pixie_preprocessing.py. Should be good to go now!

cliu72 · 2023-03-13T23:57:26Z

I don't think all the changes have been re-implemented after the merge conflicts. These lines should be gone (were removed in the original review):

ark-analysis/src/ark/phenotyping/pixie_preprocessing.py

Lines 68 to 70 in 8efc712

    
           # remove any rows with channels with a sum below the threshold 
        
           rowsums = pixel_mat[channels].sum(axis=1) 
        
           pixel_mat = pixel_mat.loc[rowsums > pixel_thresh_val, :].reset_index(drop=True)

ark-analysis/src/ark/phenotyping/pixie_preprocessing.py

Lines 152 to 157 in 8efc712

    
           # create vector for normalizing image data 
        
           norm_vect = channel_norm_df.iloc[0].values 
        
           norm_vect = np.array(norm_vect).reshape([1, 1, len(norm_vect)]) 
        
           # normalize image data 
        
           img_data = img_data / norm_vect

After thinking on it some more, I think it would be best to remove these too (to avoid confusion):

ark-analysis/src/ark/phenotyping/pixie_preprocessing.py

Lines 258 to 284 in 8efc712

    
           # define path to channel normalization values 
        
           channel_norm_path = os.path.join( 
        
               base_dir, pixel_output_dir, 'channel_norm.feather' 
        
           ) 
        
           # define path to pixel normalization values 
        
           pixel_thresh_path = os.path.join( 
        
               base_dir, pixel_output_dir, 'pixel_thresh.feather' 
        
           ) 
        
           # reset entire cohort if channels provided are different from ones in existing channel_norm 
        
           if os.path.exists(channel_norm_path): 
        
               channel_norm_df = feather.read_dataframe(channel_norm_path) 
        
               if set(channel_norm_df.columns.values) != set(channels): 
        
                   print("New channels provided: overwriting whole cohort") 
        
                   # delete the existing data in data_dir and subset_dir 
        
                   rmtree(os.path.join(base_dir, data_dir)) 
        
                   os.mkdir(os.path.join(base_dir, data_dir)) 
        
                   rmtree(os.path.join(base_dir, subset_dir)) 
        
                   os.mkdir(os.path.join(base_dir, subset_dir)) 
        
                   # delete the existing channel_norm.feather and pixel_thresh.feather 
        
                   os.remove(channel_norm_path) 
        
                   os.remove(pixel_thresh_path)

ark-analysis/src/ark/phenotyping/pixie_preprocessing.py

Lines 322 to 347 in 8efc712

    
           # load existing channel_norm_path if exists, otherwise generate 
        
           if not os.path.exists(channel_norm_path): 
        
               # compute channel percentiles 
        
               channel_norm_df = pixel_cluster_utils.calculate_channel_percentiles( 
        
                   tiff_dir=tiff_dir, 
        
                   fovs=fovs, 
        
                   channels=channels, 
        
                   img_sub_folder=img_sub_folder, 
        
                   percentile=channel_percentile 
        
               ) 
        
           else: 
        
               # load previously generated output 
        
               channel_norm_df = feather.read_dataframe(channel_norm_path) 
        
           # load existing pixel_thresh_path if exists, otherwise generate 
        
           if not os.path.exists(pixel_thresh_path): 
        
               # compute pixel percentiles 
        
               pixel_thresh_val = pixel_cluster_utils.calculate_pixel_intensity_percentile( 
        
                   tiff_dir=tiff_dir, fovs=fovs, channels=channels, 
        
                   img_sub_folder=img_sub_folder, channel_percentiles=channel_norm_df 
        
               ) 
        
               pixel_thresh_df = pd.DataFrame({'pixel_thresh_val': [pixel_thresh_val]}) 
        
           else: 
        
               pixel_thresh_df = feather.read_dataframe(pixel_thresh_path) 
        
               pixel_thresh_val = pixel_thresh_df['pixel_thresh_val'].values[0]

Remove channel_norm_df and pixel_thresh_val as inputs to preprocess_fov

alex-l-kong · 2023-03-14T00:18:50Z

@cliu72 OK this should do the trick.

cliu72

Looks good!

Temporarily purge certain normalization steps for Pixie pixel preproc…

8cc13f0

…essing

alex-l-kong self-assigned this Feb 14, 2023

alex-l-kong added 8 commits February 14, 2023 14:18

Pin pyFlowSOM to 0.1.14

f109a43

Move back to 0.1.12 (this will be addressed by a different PR)

beedc79

Never mind, pin to 0.1.14

e6fdd1e

Remove deterministic flag for testing

6bdac58

Pin pyFlowSOM at 0.1.13

dfde9c0

pyFlowSOM.som requires dtype float64, explicitly cast

bf38338

Merge branch 'main' of https://github.com/angelolab/ark-analysis into…

fdc232e

… temp_norm_purge

Explicitly cast weights and external data to np.float64 before passin…

499b03a

…g to MapDataToNodes

alex-l-kong and others added 5 commits February 15, 2023 17:21

Remove saving the channel and pixel feather files

8a06816

Adjust tests so they don't need to handle channel_norm and pixel_thre…

3c47ed5

…sh feather files

Merge branch 'main' into temp_norm_purge

52a6eb3

Remove more commented code

20cff9c

Merge branch 'main' into temp_norm_purge

79ac387

alex-l-kong requested a review from cliu72 February 16, 2023 22:09

alex-l-kong added 2 commits February 16, 2023 14:56

Merge branch 'main' into temp_norm_purge

562b49f

Merge branch 'main' into temp_norm_purge

207e8ca

Merge branch 'main' into temp_norm_purge

c685d66

cliu72 approved these changes Feb 21, 2023

View reviewed changes

ngreenwald requested changes Feb 21, 2023

View reviewed changes

alex-l-kong added 2 commits February 23, 2023 14:04

Merge branch 'main' into temp_norm_purge

bdfc47c

Merge branch 'main' into temp_norm_purge

78cc23c

alex-l-kong requested a review from ngreenwald March 2, 2023 20:13

cliu72 mentioned this pull request Mar 12, 2023

Pixie notebook text changes #942

Merged

ngreenwald requested changes Mar 13, 2023

View reviewed changes

Merge branch 'main' into temp_norm_purge

7e5efde

alex-l-kong requested a review from ngreenwald March 13, 2023 22:22

ngreenwald requested changes Mar 13, 2023

View reviewed changes

Fix errors caused by GitHub merge tool

8efc712

alex-l-kong requested a review from ngreenwald March 13, 2023 23:12

ngreenwald requested a review from cliu72 March 13, 2023 23:39

alex-l-kong added 2 commits March 13, 2023 17:16

Remove channel_norm_df and pixel_thresh_val

c6751c3

Remove more references in pixie_preprocessing_test.py

96aa412

cliu72 approved these changes Mar 14, 2023

View reviewed changes

alex-l-kong merged commit 1059061 into main Mar 14, 2023

alex-l-kong deleted the temp_norm_purge branch March 14, 2023 00:56

alex-l-kong mentioned this pull request Apr 28, 2023

Re-add channel normalization and pixel thresholding to Pixie #980

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temporarily remove pixel preprocessing normalization for Candace's paper #913

Temporarily remove pixel preprocessing normalization for Candace's paper #913

alex-l-kong commented Feb 14, 2023

alex-l-kong commented Feb 14, 2023

cliu72 commented Feb 16, 2023

alex-l-kong commented Feb 17, 2023

cliu72 left a comment

ngreenwald left a comment

alex-l-kong commented Mar 2, 2023 •

edited

Loading

cliu72 commented Mar 2, 2023

ngreenwald left a comment

ngreenwald left a comment

alex-l-kong commented Mar 13, 2023

alex-l-kong commented Mar 13, 2023

cliu72 commented Mar 13, 2023

alex-l-kong commented Mar 14, 2023

cliu72 left a comment

Temporarily remove pixel preprocessing normalization for Candace's paper #913

Temporarily remove pixel preprocessing normalization for Candace's paper #913

Conversation

alex-l-kong commented Feb 14, 2023

alex-l-kong commented Feb 14, 2023

cliu72 commented Feb 16, 2023

alex-l-kong commented Feb 17, 2023

cliu72 left a comment

Choose a reason for hiding this comment

ngreenwald left a comment

Choose a reason for hiding this comment

alex-l-kong commented Mar 2, 2023 • edited Loading

cliu72 commented Mar 2, 2023

ngreenwald left a comment

Choose a reason for hiding this comment

ngreenwald left a comment

Choose a reason for hiding this comment

alex-l-kong commented Mar 13, 2023

alex-l-kong commented Mar 13, 2023

cliu72 commented Mar 13, 2023

alex-l-kong commented Mar 14, 2023

cliu72 left a comment

Choose a reason for hiding this comment

alex-l-kong commented Mar 2, 2023 •

edited

Loading