Skip to content

Latest commit

 

History

History
25 lines (13 loc) · 1.43 KB

Annotation.md

File metadata and controls

25 lines (13 loc) · 1.43 KB

^Front Page

<Assembly QC

Annotation

The following are some basic steps for assembly annotation. These are by no means comprehensive, as your assembly annotation needs will differ depending on the downstream uses of the given assembly. As with assembly QC, all steps are run using Annotation_script.sh [Sample] [Assembler] [Mode].

Repeat Annotation

RepeatMasker run as part of TETools can be used to annotate repeatitive seqeunce in your genoem assembly. Note that human repeat elements are assumed (--species human), and the script will need to be tweak if annotating a non-human genome.

Annotation_RepeatMasker.sh [Sample] [Assembler] [Mode]

Transcript Annotation

LiftOff can be used to lift over GFF information from a reference sequence to your genome assembly (assuming the same or a closely related species).

Annotation_LiftOff.sh [Sample] [Assembler] [Mode]

Reference Sequence Annotation and Misjoin Check

Minimap2 can be used to create a .paf aligment file from a given reference sequence to your genome assembly. In addition, this script checks for misjoins based on the produced .paf file. Note that this script is currently mislabelled as 'CHM13' it should work on any reference sequence given.

Annotation_CHM13Alignment.sh [Sample] [Assembler] [Mode]