Skip to content

Commit

Permalink
Merge pull request #372 from cidgoh/ambr-update
Browse files Browse the repository at this point in the history
ambr and CanCoGeN schema updates
  • Loading branch information
ddooley authored Jan 24, 2023
2 parents 8ba0c00 + 56b730b commit b17f798
Show file tree
Hide file tree
Showing 20 changed files with 1,544 additions and 1,192 deletions.
551 changes: 318 additions & 233 deletions web/templates/ambr/schema.json

Large diffs are not rendered by default.

531 changes: 331 additions & 200 deletions web/templates/ambr/schema.yaml

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion web/templates/ambr/schema_core.yaml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
id: https://example.com/AMBR
name: AMBR
description: ""
version: 1.0.0
version: 2.1.1
imports:
- 'linkml:types'
prefixes:
Expand Down
1,019 changes: 516 additions & 503 deletions web/templates/ambr/schema_enums.tsv

Large diffs are not rendered by default.

115 changes: 57 additions & 58 deletions web/templates/ambr/schema_slots.tsv

Large diffs are not rendered by default.

3 changes: 2 additions & 1 deletion web/templates/canada_covid19/exampleInput/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,8 @@

This directory contains example input/test data for the Canadian COVID Genomics Network (CanCOGeN) DataHarmonizer application template: `CanCOGeN Covid-19`. This data is appropriate for testing up to the version appended to the end of the file name, for example:

- `validTestData_1-0-0.csv` is _valid_ for version `1-0-0` of the DataHarmonizer.
- `validTestData_2-1-2.csv` is _valid_ for version `2-1-2` of the DataHarmonizer template.
- `invalidTestData_1-0-0.csv` is still _invalid_ for version `2-1-2` of the DataHarmonizer template, and thus can still be used to explore validation features.

## Valid Test Data

Expand Down

This file was deleted.

This file was deleted.

Binary file not shown.
Binary file not shown.
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
Database Identifiers,,,,,,,,,,,,Sample collection and processing,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,Host Information,,,,,,,,,,,,,,,,,Host vaccination information,,,,,,,,,,,Host exposure information,,,,,,,,,,,,,,,Host reinfection information,,,,,,Sequencing,,,,,,,,,,,,Bioinformatics and QC metrics,,,,,,,,,,,,,,,,,,,,,Lineage and Variant information,,,,,,Pathogen diagnostic testing,,,,,,,,,Contributor acknowledgement,
specimen collector sample ID,third party lab service provider name,third party lab sample ID,case ID,Related specimen primary ID,IRIDA sample name,umbrella bioproject accession,bioproject accession,biosample accession,SRA accession,GenBank accession,GISAID accession,sample collected by,sample collector contact email,sample collector contact address,sequence submitted by,sequence submitter contact email,sequence submitter contact address,sample collection date,sample collection date precision,sample received date,geo_loc_name (country),geo_loc_name (state/province/territory),geo_loc_name (city),organism,isolate,purpose of sampling,purpose of sampling details,NML submitted specimen type,Related specimen relationship type,anatomical material,anatomical part,body product,environmental material,environmental site,collection device,collection method,collection protocol,specimen processing,specimen processing details,lab host,passage number,passage method,biomaterial extracted,host (common name),host (scientific name),host health state,host health status details,host health outcome,host disease,host age,host age unit,host age bin,host gender,host residence geo_loc name (country),host residence geo_loc name (state/province/territory),host subject ID,symptom onset date,signs and symptoms,pre-existing conditions and risk factors,complications,host vaccination status,number of vaccine doses received,vaccination dose 1 vaccine name,vaccination dose 1 vaccination date,vaccination dose 2 vaccine name,vaccination dose 2 vaccination date,vaccination dose 3 vaccine name,vaccination dose 3 vaccination date,vaccination dose 4 vaccine name,vaccination dose 4 vaccination date,vaccination history,location of exposure geo_loc name (country),destination of most recent travel (city),destination of most recent travel (state/province/territory),destination of most recent travel (country),most recent travel departure date,most recent travel return date,travel point of entry type,border testing test day type,travel history,travel history availability,exposure event,exposure contact level,host role,exposure setting,exposure details,prior SARS-CoV-2 infection,prior SARS-CoV-2 infection isolate,prior SARS-CoV-2 infection date,prior SARS-CoV-2 antiviral treatment,prior SARS-CoV-2 antiviral treatment agent,prior SARS-CoV-2 antiviral treatment date,purpose of sequencing,purpose of sequencing details,sequencing date,library ID,amplicon size,library preparation kit,flow cell barcode,sequencing instrument,sequencing protocol name,sequencing protocol,sequencing kit number,amplicon pcr primer scheme,raw sequence data processing method,dehosting method,consensus sequence name,consensus sequence filename,consensus sequence filepath,consensus sequence software name,consensus sequence software version,breadth of coverage value,depth of coverage value,depth of coverage threshold,r1 fastq filename,r2 fastq filename,r1 fastq filepath,r2 fastq filepath,fast5 filename,fast5 filepath,number of base pairs sequenced,consensus genome length,Ns per 100 kbp,reference genome accession,bioinformatics protocol,lineage/clade name,lineage/clade analysis software name,lineage/clade analysis software version,variant designation,variant evidence,variant evidence details,gene name 1,diagnostic pcr protocol 1,diagnostic pcr Ct value 1,gene name 2,diagnostic pcr protocol 2,diagnostic pcr Ct value 2,gene name 3,diagnostic pcr protocol 3,diagnostic pcr Ct value 3,authors,DataHarmonizer provenance
lab_12345,,,,,,,,,,,,Alberta Precision Labs (APL),,,Alberta Precision Labs (APL),,,2023-01-02,day,,Canada,Alberta,,Severe acute respiratory syndrome coronavirus 2,hCov-19/CANADA/BC-prov_rona_99/2023,Diagnostic testing,Not Applicable,Not Applicable,,Not Applicable,Nasopharynx (NP),Not Applicable,Not Applicable,Not Applicable,Swab,Not Applicable,,,,,,,,,Homo sapiens,,,,COVID-19,23,year,20 - 29,Female,,,,,,,,,,,,,,,,,,,,,,,,,,,,Travel history available,,,,,,,,,,,,Baseline surveillance (random sampling),Not Applicable,2023-01-05,,,,,Illumina NextSeq 550,,"Genomes were generated through amplicon sequencing of 1200 bp amplicons with Freed schema primers. Libraries were created using Illumina DNA Prep kits, and sequence data was produced using Miseq Micro v2 (500 cycles) sequencing kits.",,Midnight 1200,ncov-tools 1.2.3,ncov-tools 1.2.3,,,,iVar 1.2.3,iVar 1.2.3,,,,,,,,,,,,,,ncov-tools 1.2.3,,,,,,,E gene (orf4),,23.5,,,,,,,,
Loading

0 comments on commit b17f798

Please sign in to comment.