Create organized starfish.spacetx bucket #1314

shanaxel42 · 2019-05-06T20:13:48Z

We need to define how we want to organize both the published datasets @dganguli will be working with. And the spaceTX datasets and results for the benchmarking.

ambrosejcarr · 2019-05-06T20:16:33Z

I think #1287 is relevant to the discussion of the formatted/processed separation.

ttung · 2019-05-08T21:49:06Z

My proposal:

Top level: A list of datasets. NOT ASSAYS.
Second level: {raw, spacetx-formatted, outputs}.
Each second-level folder has timestamped folders.

ambrosejcarr · 2019-05-08T21:54:30Z

Makes sense. Do we want a naming convention for the datasets? What information should be encoded in the name?

Hoping to avoid things like:

ISS_1
ISS_ambrose
ISS_deep
...

ttung · 2019-05-08T21:55:35Z

I would say something more about the tissue type and chemistry?

shanaxel42 · 2019-05-14T22:34:14Z

proposal!

starfish.data.published
        assays
            datasets (some sort of schema for naming tissue type and chemistry)
                raw
                       raw_data
                formatted
                    date
                        experiment.json
                processed
                    date
                        decoded.csv (or whatever)
starfish.data.unpublished
        assays
            datasets (some sort of schema for naming tissue type and chemistry)
                raw
                       raw_data
                formatted
                    date
                        experiment.json
                processed
                    date
                        decoded.csv (or whatever)
starfish.data.spaceTX
        assays
            datasets (some sort of schema for naming tissue type and chemistry)
                raw
                       raw_data
                formatted
                    date
                        experiment.json
                processed
                    date
                        decoded.csv (or whatever)

we hand off everything underneath the spacteTX directory

ttung · 2019-05-14T22:59:24Z

I'm generally supportive of this approach.

What's with the top-level starfish thing?

shanaxel42 · 2019-05-14T23:02:15Z

What's with the top-level starfish thing?

oh thats just whatever the top level is so I guess now its starfish.data.public but could be whatever I don't really have a preference for that

ambrosejcarr · 2019-06-05T14:01:38Z

I am downloading some slide-seq data to work with. Based on the above proposal, I intend to put it in:

starfish.data.public/published/rodriques_science_2019_slide-seq_perkinje-cerebellum/20190605/<data>

The "datasets" corresponds to <first-author-last-name>_<science>_<year>_<assay_type>_<tissue-type>

How does this sound for the "datasets" schema?

shanaxel42 · 2019-06-05T16:23:53Z

scheme sounds fine but it should go in starfish.data.published, not starfish.data.public, the former is the new bucket

shanaxel42 · 2019-06-13T21:15:49Z

starfish.data.spacetx/ now contains all the current spaceTX data and results we have in an organized structure. The structure as well as the original locations used to copy the data over from is described here:

ISS_30
	mouse		
		formatted: 
			https://console.aws.amazon.com/s3/buckets/spacetx.starfish.data.upload/xiaoyan_qian/

	human
		formatted: 
			https://console.aws.amazon.com/s3/buckets/spacetx.starfish.data.upload/xiaoyan_qian/ISS_human_HCA_07_MultiFOV/
			main_files/?region=us-east-1&tab=overview

		starish_results: 
			https://console.aws.amazon.com/s3/buckets/spacetx.starfish.data.upload/xiaoyan_qian/ISS_human_HCA_07_MultiFOV/
			main_files/*iss_spacetx_*

ISS_120 
	mouse		
		formatted: 
			https://console.aws.amazon.com/s3/object/spacetx.starfish.data.upload/xiaoyan_qian/
			ISS_m_brain_03/README.txt?region=us-east-1&tab=overview

	human

		formatted: 
			https://console.aws.amazon.com/s3/buckets/spacetx.starfish.data.upload/xiaoyan_qian/	
			ISS_h_brain_03/?region=us-east-1&tab=overview

FISSEQ
	mouse: 
		contributer results: 
			spacetx.starfish.data.upload/samuel_inverso/20181203-mouse-71


BaristaSEQ: 
	mouse:  
		formatted: 
			https://console.aws.amazon.com/s3/buckets/spacetx.starfish.data.public/browse/formatted/20190319/baristaseq/?region=us-east-1&tab=overview

		contributer_results: 
			https://console.aws.amazon.com/s3/object/spacetx.starfish.data.upload/xiaoyin_chen/resultsandcode.zip?region=us-east-1&tab=overview

seqFISH
	mouse:
		formatted_mulitplexed: 
			https://console.aws.amazon.com/s3/buckets/spacetx.starfish.data.upload/nico_pierson/multiplexed/

		formatted_sequential:
			spacetx.starfish.data.upload/nico_pierson/sequential/

		contributer_results_mulitplexed: 
			spacetx.starfish.data.upload/nico_pierson/multiplexed/output/ 

smFISH
	mouse: 
		formatted:
			https://console.aws.amazon.com/s3/buckets/starfish.data.spacetx/smFISH/mouse/formatted/20190214/?region=us-east-1&tab=overview


spatial transcriptomics
	mouse: 
		formatted:
			Ambrose TODO

shanaxel42 self-assigned this May 6, 2019

shanaxel42 added this to the SpaceTX milestone May 7, 2019

shanaxel42 added the feature New work label May 7, 2019

ambrosejcarr mentioned this issue May 29, 2019

Format complete ISS experiment and expose in starfish.data #1316

Merged

shanaxel42 changed the title ~~Organize aws buckets~~ Create organized starfish.spacetx bucket Jun 13, 2019

shanaxel42 closed this as completed Jun 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create organized starfish.spacetx bucket #1314

Create organized starfish.spacetx bucket #1314

shanaxel42 commented May 6, 2019 •

edited

Loading

ambrosejcarr commented May 6, 2019 •

edited

Loading

ttung commented May 8, 2019

ambrosejcarr commented May 8, 2019

ttung commented May 8, 2019

shanaxel42 commented May 14, 2019 •

edited

Loading

ttung commented May 14, 2019

shanaxel42 commented May 14, 2019

ambrosejcarr commented Jun 5, 2019

shanaxel42 commented Jun 5, 2019

shanaxel42 commented Jun 13, 2019 •

edited

Loading

Create organized starfish.spacetx bucket #1314

Create organized starfish.spacetx bucket #1314

Comments

shanaxel42 commented May 6, 2019 • edited Loading

ambrosejcarr commented May 6, 2019 • edited Loading

ttung commented May 8, 2019

ambrosejcarr commented May 8, 2019

ttung commented May 8, 2019

shanaxel42 commented May 14, 2019 • edited Loading

ttung commented May 14, 2019

shanaxel42 commented May 14, 2019

ambrosejcarr commented Jun 5, 2019

shanaxel42 commented Jun 5, 2019

shanaxel42 commented Jun 13, 2019 • edited Loading

shanaxel42 commented May 6, 2019 •

edited

Loading

ambrosejcarr commented May 6, 2019 •

edited

Loading

shanaxel42 commented May 14, 2019 •

edited

Loading

shanaxel42 commented Jun 13, 2019 •

edited

Loading