Skip to content

Latest commit

 

History

History
34 lines (27 loc) · 1.31 KB

how_to_submit_dataset.md

File metadata and controls

34 lines (27 loc) · 1.31 KB

How-to guide to upload dataset to s3

If you have data you would like to share with the group. Submit a PR with a new readme.md in the data directory. General recommended structure is outlined below.

NOTE: if this is a fully curated/structured dataset data.world may be a better home. S3 is for raw or work in progress data.

Document directory structure

Let people know where your data will be housed

far-right/  
|--sub-folder/
  |--raw-data/
    |--input.csv
    |--input2.csv
  |--processed-data/
      |--output.csv

Overview

  • Give a brief overview and explain why this data is important

Document collection methodologies

  • How/when was the data gathered
  • Data cleaning steps/process. Links to code used?
  • Assumptions made during analysis that may inform future analysis
  • Read about data governance here

Help others get started

  • Links to existing notebooks using the data
  • Questions you hope to ask of the data / answers you seek.

Questions?