This repository has been archived by the owner on Jun 21, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 4
/
README
47 lines (31 loc) · 1.76 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
** Depreciated, check out GRIPper2: https://github.com/adamewing/GRIPper2 **
gripper README.txt
Adam Ewing <adam.ewing@gmail.com>
GRIPper is a tool to identify non-reference gene retrocopy insertion polymorphisms
(GRIPs) from paired-end whole-genome sequence data, specifically tuned for Illumina
reads.
Prerequisites:
bwa (http://bio-bwa.sourceforge.net/)
samtools (http://samtools.sourceforge.net/)
tabix (http://samtools.sourceforge.net/tabix.shtml)
pysam ( https://github.com/pysam-developers/pysam)
parallelpython (http://www.parallelpython.com/)
Usage:
There are a number of auxilliary files that specify details of the various
GRIPs one expects to find. Currently, gripper is configured to analyse human
genomes; this can be changed by substituting the appropriate annotations for
another species of interest.
* To begin, human_sample.cfg will have to be modified to assign
'hg19' to a copy of hg19 that has been indexed by `bwa index -a stdsw`
* By default, GRIPper expects chromosome names to be prefixed by 'chr'. If this is
not the case for your reference genome, use 'usechr' to 'False' in the config
file (e.g. human_sample.cfg).
* To increase sensitivity, the parameter 'minPeakSize' may be decreased from the
default of 8 reads to a minmum of 4 reads.
* If there is more than one sample (BAM file) specified in gripper.py, changing the
-p command line flag will run multiple jobs in parallel.
Start by trying the included example (test/example.bam) with the following command:
./gripper.py -s samplelist_example.txt -c human_sample.cfg -o test
output will end up in test/example, the most relevant files are
uncategorized.tab.txt and uncategorizedbreaks.tab.txt, the former contains the
coordinates and annotations, the latter contains breakpoint information.