How-to guide to upload dataset to s3

If you have data you would like to share with the group. Submit a PR with a new readme.md in the data directory. General recommended structure is outlined below.

NOTE: if this is a fully curated/structured dataset data.world may be a better home. S3 is for raw or work in progress data.

Document directory structure

Let people know where your data will be housed

far-right/  
|--sub-folder/
  |--raw-data/
    |--input.csv
    |--input2.csv
  |--processed-data/
      |--output.csv

Overview

Give a brief overview and explain why this data is important

Document collection methodologies

How/when was the data gathered
Data cleaning steps/process. Links to code used?
Assumptions made during analysis that may inform future analysis
Read about data governance here

Help others get started

Links to existing notebooks using the data
Questions you hope to ask of the data / answers you seek.

Questions?

Join #assemble and ask in slack
Contact @bstarling to arrange access.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how_to_submit_dataset.md

how_to_submit_dataset.md

How-to guide to upload dataset to s3

Document directory structure

Overview

Document collection methodologies

Help others get started

Questions?

Files

how_to_submit_dataset.md

Latest commit

History

how_to_submit_dataset.md

File metadata and controls

How-to guide to upload dataset to s3

Document directory structure

Overview

Document collection methodologies

Help others get started

Questions?