1566 shape feature extract #381

raylim · 2023-07-31T21:55:36Z

No description provided.

armaank · 2023-08-03T14:15:21Z

This looks very good, good call on computing log ratios

A few items to address:

You can remove all features extracted for Detection probability, these are unneeded and can be dropped from the fx extraction phase
You can remove features associated w/ unclassified regions, those can be dropped from the fx extraction phase
The whole slide features are redundant in this output format, since they are the same no matter the cell type, so we have double the number of features for whole slide features
A lot of the whole slide features we probably don't need, here is a list of features that don't need to be extracted: anything related to bounding boxes (bbox), anything related to centroids, anything related to inertia tensors, whole slide label, anything related to moments, whole slide area in microns (we already have whole slide area measured in pixels, and it kind of makes sense to keep all of our measurements in pixels rather than just converting area to units of microns)

I also think it makes sense to restructure the output format. Even though it'll be a little wide, it's more convenient for downstream use to have all of these features in a single column, so the columns will be structured as variable_parent_class . That way for an entire cohort we can concat all of the single-row results into a dataframe suitable for downstream use in a scikit-learn or pytorch model.

tomp

Approved

chore: 377, 375, 374, 379, 382 package versions feat: extract_tile_shape_features: add options for limiting variables in final output, clean up output a bit feat: add extra features to extract_tile_shape_features

This is just Armaan's original notebook, updated for the current state of luna. It still uses the CLIs.

1. Added function to pull output file names from metadata.yml files. 2. Cast Path objects to strings in a couple of places. 3. Check for both `aperio.MPP` and `openslide.mpp-x` in the slide image properties, and log a warning if neither is found. 4. Added new notebook to the tutorial docs 5. Renamed notebook to spatial_stats.ipynb

This lets us run a docs server on one of the compute servers and view the docs site on our laptops.

raylim requested a review from armaank July 31, 2023 21:55

tomp self-requested a review August 24, 2023 15:33

tomp approved these changes Aug 24, 2023

View reviewed changes

raylim and others added 7 commits August 24, 2023 09:21

feat: dsa_viz extract joint object tile

8bbc729

build: update shapely

2c1d520

feat: extract tile shape features cli

e2d3fd6

chore: 377, 375, 374, 379, 382 package versions feat: extract_tile_shape_features: add options for limiting variables in final output, clean up output a bit feat: add extra features to extract_tile_shape_features

Added spatial analysis tutorial notebook

b9cd41d

This is just Armaan's original notebook, updated for the current state of luna. It still uses the CLIs.

Allow non-local connections to the docs server

6a6b3e4

This lets us run a docs server on one of the compute servers and view the docs site on our laptops.

tests: update test_extract_shape_features

0cfc6be

raylim force-pushed the 1566-shape-feature-extract branch from 665c877 to 0cfc6be Compare August 24, 2023 16:23

raylim merged commit 7f37682 into dev Aug 24, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1566 shape feature extract #381

1566 shape feature extract #381

raylim commented Jul 31, 2023

armaank commented Aug 3, 2023

tomp left a comment

1566 shape feature extract #381

1566 shape feature extract #381

Conversation

raylim commented Jul 31, 2023

armaank commented Aug 3, 2023

tomp left a comment

Choose a reason for hiding this comment