Formatting tool for user friendly converting to spaceTX format #1318

shanaxel42 · 2019-05-07T19:06:17Z

Define file name convention
Have tile fetcher example that works with above convention
Document in readthedocs

ttung · 2019-05-17T21:04:28Z

Proposal

files are to be named:

<image_type>-f<fov_number>-r<round_number>-c<ch_number>-z<zplane_number>.<image_extension>

image_type is like primary, dots, etc.
*_number: self explanatory
image_extension: has to be one of the supported image extensions, case insensitive

coordinates are to be read from a specified csv file, where the columns are filename, xmin, xmax, ymin, ymax, zmin, zmax.

images where num_zplanes==1 should declare num_zplane to be any value they desire, and assign NaN to the z coordinates.

@njmei

Need this for the generalized experiment formatter (#1318), and @njmei needs it for #1322. Test plan: `pytest -v -n4 starfish/core/experiment/builder/test/`

@njmei

Need this for the generalized experiment formatter (#1318), and @njmei needs it for #1322. Test plan: `pytest -v -n4 starfish/core/experiment/builder/test/`

ttung · 2019-06-06T01:59:09Z

codebook is still the hardest part of this, IMO...

ambrosejcarr · 2019-06-06T02:18:29Z

There are a few formalisms for codebooks that we might want to support conversion from:

csv where each row is <gene>,ACCGTC where the position of each nucleotide is taken to be a sequential round, and a mapping is provided from nucleotide to channel.

csv where each row is <gene>,0010134 where the position of each number represents a sequential round, and the numbers are taken to be the channels.

ambrosejcarr · 2019-06-06T02:23:41Z

I don't have a good solution for codebooks that aren't 1-hot. Fortunately 1-hot appears to be very common.

ttung · 2019-06-06T17:12:01Z

I would prefer just doing:

<target>,r0_c0, r0_c1, ...,  rn_cn
SCUBE2,0,1, ..., 0

neuromusic · 2019-06-06T20:22:31Z

@ttung so each column defines a coordinate?

I would find the tidy version more intuitive, where each row corresponds to a single value in the codebook...

target,round,channel,value
SCUBE2,0,0,1
SCUBE2,0,1,1
BRCA,0,0,1
BRCA,1,1,1
ACTB,0,1,1
ACTB,1,0,1

the xarray to_dataframe() method would return the data formatted something like this (except with zero values, as well). see http://xarray.pydata.org/en/stable/pandas.html#dataset-and-dataframe

neuromusic · 2019-06-06T21:04:41Z

example roundtrip from codebook to csv and back (technically to xarray on the return) https://gist.github.com/neuromusic/87267d7e20279585517c8cd46a0c5601

ttung · 2019-06-06T21:48:38Z

so each column defines a coordinate?

correct. for one-hot encodings, it's very easy to sanity check (sums across rows or columns should always be 1)

I would find the tidy version more intuitive, where each row corresponds to a single value in the codebook...

that's a lot more "rows". riskier to get it wrong, potentially? not sure.

shanaxel42 added this to the SpaceTX milestone May 7, 2019

shanaxel42 assigned ttung May 7, 2019

shanaxel42 mentioned this issue May 7, 2019

Intuitive SpaceTx format conversion tools #1286

Closed

shanaxel42 added the feature label May 7, 2019

shanaxel42 unassigned ttung May 10, 2019

ttung self-assigned this May 16, 2019

ttung pushed a commit that referenced this issue May 24, 2019

Add the ability to write labeled experiments

0602e53

Need this for the generalized experiment formatter (#1318), and @njmei needs it for #1322. Test plan: `pytest -v -n4 starfish/core/experiment/builder/test/`

ttung mentioned this issue May 24, 2019

Add the ability to write labeled experiments #1374

Merged

ttung mentioned this issue Jun 24, 2019

data set formatter with fixed filenames #1421

Merged

ttung closed this as completed in #1421 Jul 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Formatting tool for user friendly converting to spaceTX format #1318

Formatting tool for user friendly converting to spaceTX format #1318

shanaxel42 commented May 7, 2019

ttung commented May 17, 2019 •

edited

Loading

ttung commented Jun 6, 2019

ambrosejcarr commented Jun 6, 2019 •

edited

Loading

ambrosejcarr commented Jun 6, 2019

ttung commented Jun 6, 2019

neuromusic commented Jun 6, 2019 •

edited

Loading

neuromusic commented Jun 6, 2019

ttung commented Jun 6, 2019

Formatting tool for user friendly converting to spaceTX format #1318

Formatting tool for user friendly converting to spaceTX format #1318

Comments

shanaxel42 commented May 7, 2019

ttung commented May 17, 2019 • edited Loading

Proposal

ttung commented Jun 6, 2019

ambrosejcarr commented Jun 6, 2019 • edited Loading

ambrosejcarr commented Jun 6, 2019

ttung commented Jun 6, 2019

neuromusic commented Jun 6, 2019 • edited Loading

neuromusic commented Jun 6, 2019

ttung commented Jun 6, 2019

ttung commented May 17, 2019 •

edited

Loading

ambrosejcarr commented Jun 6, 2019 •

edited

Loading

neuromusic commented Jun 6, 2019 •

edited

Loading