Pixel Subset in Pixie #1121

matthew-lee1 · 2024-02-27T19:58:18Z

matthew-lee1
Feb 27, 2024

Hello,

In your nature communications paper in the supplementary information, you show that 10% subset of pixels still works. I am curious if you tried other values as well such as 7.5%, 5%, etc. Asking because at 10% I run into memory errors (120k x 60k image), but I am able to run 5% subset successfully.

cliu72 · 2024-02-29T18:14:28Z

cliu72
Feb 29, 2024
Collaborator

Hi @matthew-lee1! Yes, we have tested subsetting lower than 10%. For around a billion pixels, we have done down to 1% and shown that results are still good. So for your use case, 5% should be fine. That being said, this can be specific to each dataset (i.e. how many markers you have, and more importantly how many phenotypes you would "expect" and how well represented these phenotypes are in your image). If you have certain markers that are super rare and aren't sampled in the 5% subset, you could run into problems with capturing those phenotypes. However, in our experience, for most well-designed panels, a random 5% subset is fine. The best way to evaluate is to look at the resulting pixel clusters with your markers and confirming that they reflect the underlying expression well. Hope this helps!

0 replies

matthew-lee1 · 2024-03-01T02:06:19Z

matthew-lee1
Mar 1, 2024
Author

Thanks so much! Additionally, if all follow up analyses will be cell-based, is it ok to cluster only on pixels within cell segmentation masks?

0 replies

cliu72 · 2024-03-02T20:20:39Z

cliu72
Mar 2, 2024
Collaborator

Yup, if you only care about cells, it's ok to cluster only on pixels within segmentation masks.

0 replies

matthew-lee1 · 2024-03-03T00:04:54Z

matthew-lee1
Mar 3, 2024
Author

Perfect thanks! Actually I've implemented and think I might've found a bug. In practice what I did was for any pixel labelled 0 on the segmentation mask (any non-cell pixel), I set all of the channels expression to 0. I did this since the code already filters out for pixels that are all 0 and thought this would be the easiest. What I found was that for FOVs where there were no pixels being used (which could happen either because of no cells OR if all pixels sum to 0 in that FOV, so not just my particular use case), the .feather file written out would include a 0 for the pixel_som_cluster. Later on, this would introduce a new "cluster" of index 0 (a 101th cluster), which I'm sure you know causes problem downstream since everything should be 1 indexed. My solution was to include at least 1 pixel from every FOV even if they all sum to 0.

0 replies

cliu72 · 2024-03-04T21:21:14Z

cliu72
Mar 4, 2024
Collaborator

Thanks for catching that. It makes sense that it has never come up for us because all of our images are non-zero. Because we typically include all pixels (not just those within cells), we never have empty images. Your solution sounds like it works fine, but another solution is to just manually exclude all images that are 0. You can change the list of FOVs with the fovs parameter in the notebook (default is to include all fovs, but you can change it).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pixel Subset in Pixie #1121

{{title}}

Replies: 5 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Pixel Subset in Pixie #1121

matthew-lee1 Feb 27, 2024

Replies: 5 comments

cliu72 Feb 29, 2024 Collaborator

matthew-lee1 Mar 1, 2024 Author

cliu72 Mar 2, 2024 Collaborator

matthew-lee1 Mar 3, 2024 Author

cliu72 Mar 4, 2024 Collaborator

matthew-lee1
Feb 27, 2024

cliu72
Feb 29, 2024
Collaborator

matthew-lee1
Mar 1, 2024
Author

cliu72
Mar 2, 2024
Collaborator

matthew-lee1
Mar 3, 2024
Author

cliu72
Mar 4, 2024
Collaborator