Lifting over bam

Sometimes for amplicon sequencings, we would like to map reads to the amplicon sequence only but bringing them back to genomic coordinates for easy variant calling and viewing.

Let's say we have a gene in chr1:100-1000, and we would first extract this locus from the genome fasta file to make a new fasta record with name >chr1:100-1000, this can be done with:

echo "chr1:100-1000" | samtools faidx -r - genome.fa > gene.fa

and map the reads to this single gene fasta file with bwa or bowtie2 to make a bam alignment file:

bwa mem gene.fa query.fq | samtools view -b > gene.bam

So what if you want to put these alignments back to the genomic coordinates after that?

The liftover_bam.liftover function is trying to solve this problem in pure python!

gene_bam="gene.bam"
genome_bam="any.bam_file_that_maps_to_the_genome"
out_bam="where_you_want_your_output_bam_file_to_be"
liftover(gene_bam, genome_bam, out_bam)

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
alignments		alignments
liftover_bam		liftover_bam
ref		ref
test		test
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
makefile		makefile
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
query.fa		query.fa
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lifting over bam

About

Releases 1

Packages

Languages

wckdouglas/liftover_bam

Folders and files

Latest commit

History

Repository files navigation

Lifting over bam

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages