Pix preprocessing #579

ngreenwald · 2022-05-31T21:15:47Z

If you haven't already, please read through our contributing guidelines before opening your PR

What is the purpose of this PR?
Adds in functionality to normalize each channel of image data data separately prior to pixel clustering. This helps to make sure that markers which have different intensity values are treated equally right from the beginning of the clustering process.

In addition, it changes from removing pixels that have 0 total counts to removing pixels in the bottom 5% of total counts from the image. This better matches the format of the data following rosetta, where there are very few true zeros.

Remaining issues
The testing I put together is very basic. @alex-l-kong, if you could go in and double check that everything is working as intended, and if needed adding more testing, that would be great. Also feel free to change the organization/saving structure if you think it would be better some other way. I didn't add any new tests for create_pixel_matrix, that will likely need to be checked as well.

alex-l-kong · 2022-05-31T21:47:54Z

@ngreenwald ok I'll take a look at this.

… normalization

alex-l-kong · 2022-05-31T23:21:21Z

@ngreenwald just tested it on Candace's dataset on my end, had to make one change to account for all-zero channels in calculate_channel_percentiles but other than that it runs without error.

ngreenwald · 2022-06-01T00:04:30Z

Okay cool, let me know once it's ready to look at

…ages

ngreenwald

Couple comments

ark/phenotyping/som_utils.py

ngreenwald · 2022-06-01T05:42:18Z

ark/phenotyping/som_utils_test.py

+                                        provided_chans=chans)
+
+        # assert no rows sum to 0
+        assert np.all(sample_pixel_mat.loc[:, ['chan0', 'chan1']].sum(axis=1).values != 0)


Is there a more precise check we could add here to ensure that this addition is working? This test would pass with the previous version and this version

@ngreenwald because we're dividing by row sums, we can change this test to ensure that all rows sum to 1 to test normalization with different pixel_norm_val parameters. Is this what you were thinking about, or something more specific?

I mean something that checks that passing pixel_norm_val is working as intended. For example, an expected decrease in the total amount of pixels included in the df or something like that

@ngreenwald oh ok. I can do something along the lines of:

assert sample_pixel_mat.shape[0] < (sample_img_data.shape[0] * sample_img_data.shape[1])

This will assert that we actually generated fewer pixels in sample_pixel_mat than there exist in sample_img_data.

The opposite test:

assert sample_pixel_mat.shape[0] == (sample_img_data.shape[0] * sample_img_data.shape[1])

would be good for the other 2 tests where no pixels are removed by pixel_norm_val.

ark/phenotyping/som_utils_test.py

…analysis into pix_preprocessing

alex-l-kong · 2022-06-01T23:15:01Z

@ngreenwald just one clarification question, otherwise should be good to go.

alex-l-kong · 2022-06-02T00:32:47Z

@ngreenwald OK the above comment about testing pixel filtering with pixel_norm_val should be addressed now.

ngreenwald · 2022-06-04T00:37:45Z

gonna test this out myself a bit more before merging it in to make sure nothing got missed

ngreenwald · 2022-06-08T21:26:48Z

Looks good, once @cliu72 approves I'll merge it in

ark/phenotyping/som_utils.py

cliu72

Just a few small comments. Otherwise looks good to me.

ark/phenotyping/som_utils.py

review-notebook-app · 2022-06-09T21:17:41Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

ngreenwald added 3 commits May 31, 2022 14:00

add channel normalization

5c034cb

add pixel rowsum normalization

2f5146a

add new arg to test functions

8612d9a

alex-l-kong added 4 commits May 31, 2022 16:05

Add testing for new pixel matrix preprocessing with channel and pixel…

3fbf11e

… normalization

Add documentation for calculate_pixel_intensity_percentile

a04633b

Need to account for all-zero channels in calculate_channel_percentiles

e350753

Missing argument in docstring of calculate_pixel_intensity_percentile

d1c6848

alex-l-kong and others added 2 commits May 31, 2022 17:33

Merge branch 'master' into pix_preprocessing

b8366ee

Accidental uppercase 's' in 'TIFs' for test_calculate_channel_percent…

08f0ffc

…ages

alex-l-kong self-requested a review June 1, 2022 02:45

alex-l-kong approved these changes Jun 1, 2022

View reviewed changes

ngreenwald commented Jun 1, 2022

View reviewed changes

ngreenwald and others added 4 commits May 31, 2022 22:46

Merge branch 'master' into pix_preprocessing

b1cdf1d

First round of comments addressed

e05efae

Merge branch 'pix_preprocessing' of https://github.com/angelolab/ark-…

596265d

…analysis into pix_preprocessing

Change addition test in test_create_fov_pixel_data to test row sums to 1

643d551

Add test to check pixel_norm_val filtering works as intended

47fef3b

Merge branch 'master' into pix_preprocessing

7e68aae

add more testing

749f0a9

ngreenwald requested a review from cliu72 June 8, 2022 01:33

cliu72 reviewed Jun 9, 2022

View reviewed changes

ark/phenotyping/som_utils.py Outdated Show resolved Hide resolved

ark/phenotyping/som_utils.py Outdated Show resolved Hide resolved

cliu72 reviewed Jun 9, 2022

View reviewed changes

ark/phenotyping/som_utils.py Outdated Show resolved Hide resolved

Merge branch 'master' into pix_preprocessing

373d704

ngreenwald added 5 commits June 9, 2022 12:19

switch to dataframe

4a33544

standardize with feather files, switch naming

ecca8f9

update tests

e2855ea

make test less stringent

7298906

change default name in notebook

47a5609

remove extra arg

7984f57

ngreenwald merged commit 185cf2f into master Jun 9, 2022

ngreenwald deleted the pix_preprocessing branch June 9, 2022 21:57

srivarra added the enhancement New feature or request label Jun 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pix preprocessing #579

Pix preprocessing #579

ngreenwald commented May 31, 2022

alex-l-kong commented May 31, 2022

alex-l-kong commented May 31, 2022

ngreenwald commented Jun 1, 2022

ngreenwald left a comment

ngreenwald Jun 1, 2022

alex-l-kong Jun 1, 2022

ngreenwald Jun 1, 2022

alex-l-kong Jun 2, 2022

alex-l-kong commented Jun 1, 2022

alex-l-kong commented Jun 2, 2022

ngreenwald commented Jun 4, 2022

ngreenwald commented Jun 8, 2022

cliu72 left a comment

review-notebook-app bot commented Jun 9, 2022

Pix preprocessing #579

Pix preprocessing #579

Conversation

ngreenwald commented May 31, 2022

alex-l-kong commented May 31, 2022

alex-l-kong commented May 31, 2022

ngreenwald commented Jun 1, 2022

ngreenwald left a comment

Choose a reason for hiding this comment

ngreenwald Jun 1, 2022

Choose a reason for hiding this comment

alex-l-kong Jun 1, 2022

Choose a reason for hiding this comment

ngreenwald Jun 1, 2022

Choose a reason for hiding this comment

alex-l-kong Jun 2, 2022

Choose a reason for hiding this comment

alex-l-kong commented Jun 1, 2022

alex-l-kong commented Jun 2, 2022

ngreenwald commented Jun 4, 2022

ngreenwald commented Jun 8, 2022

cliu72 left a comment

Choose a reason for hiding this comment

review-notebook-app bot commented Jun 9, 2022