read-mapping

Illumina (and SOLiD) sensitive read mapping tool (cloned from svn://scm.gforge.inria.fr/svnroot/storm/, original code from @marta- , with some work done by @yoann-dufresne)

incremental fasta fastq-format illumina fastq indels sequence-alignment illumina-sequencing read-mapping spaced-seed simd-intrinsics fasta-format openmp-parallelization sam-file sam-output

Updated Nov 16, 2017
C

acgtun / perm2

Star

DNA read mapping (seed-extension-aligner)

sequence-alignment read-mapping spaced-seed

Updated Dec 18, 2017
C++

achacond / gem-cutter

Star

Highly optimized genomic resources for GPUs

bioinformatics cuda fm-index read-mapping text-distance

Updated Sep 23, 2018
C

Schre / Read-Mapping

Star

Mapping reads to a corresponding genome using a suffix tree (linear time construction) and Smith-Waterman local alignment algorithm

computational-biology smith-waterman alignment read-mapping ukkonen-algorithm

Updated Nov 16, 2018
C++

kalyaniasthana / FindingMutationsInDNAAndProteins_BioinformaticsVI

Star

coding problems from course 6 of the Bioinformatics specialization

hmm bioinformatics trie sequence-alignment read-mapping hmm-model hmm-viterbi-algorithm burrows-wheeler-transform bioinformatics-algorithms trie-tree trie-structure sequence-alignment-algoirthms trie-coloring profile-hmm

Updated Jun 17, 2020
Python

Lightweight single-html-file-based Genome Segments playground for Visualize genome features cluster(gene arrow map or other features), add synteny among genome fragments or add crosslink among features, add short(PE/MP)/long reads(pacbio or nanopore) mapping or snpindel in vcf(not support complex sv yet), support all CIGAR of sam alignment, dire…

Updated Jul 12, 2020
HTML

Mangul-Lab-USC / review-technology-dictates-algorithms

Star

A systematic survey of algorithmic foundations and methodologies across 107 alignment methods (1988-2021), for both short and long reads. We provide a rigorous experimental evaluation of 11 read aligners to demonstrate the effect of these underlying algorithms on speed and efficiency of read alignment. Described by Alser et al. at https://arxiv.…

hts smith-waterman needleman-wunsch heuristics genome-analysis sequence-alignment illumina-sequencing read-mapping nanopore-sequencing hifi-read pacbio-sequencing read-alignments

Updated Jul 7, 2021
Jupyter Notebook

berkalpyakici / mscrm

Star

Reference-based read-mapper which performs ungapped alignment of sample reads on reference sequence.

read-mapping

Updated Dec 4, 2021
Java

CMU-SAFARI / GenStore

Star

GenStore is the first in-storage processing system designed for genome sequence analysis that greatly reduces both data movement and computational overheads of genome sequence analysis by exploiting low-cost and accurate in-storage filters. Described in the ASPLOS 2022 paper by Mansouri Ghiasi et al. at https://people.inf.ethz.ch/omutlu/pub/GenS…

ftl ssd sequence-alignment read-mapping long-reads hardware-accelerator near-data-processing pre-alignment-filtering in-storage-processing exact-matching

Updated Apr 6, 2022
C

CMU-SAFARI / Molecules2Variations

Star

The first work to provide a comprehensive survey of a prominent set of algorithmic improvement and hardware acceleration efforts for the entire genome analysis pipeline used for the three most prominent sequencing data, short reads (Illumina), ultra-long reads (ONT), and accurate long reads (HiFi). Described in arXiv (2022) by Alser et al. https…

Updated Jun 14, 2022

CMU-SAFARI / GenASM

Star

Source code for the software implementations of the GenASM algorithms proposed in our MICRO 2020 paper: Senol Cali et. al., "GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis" at https://people.inf.ethz.ch/omutlu/pub/GenASM-approximate-string-matching-framework-for-genome-analys…

approximate-string-matching read-mapping hw-sw-co-design read-alignment bitap-algorithm pre-alignment-filtering genome-sequence-analysis

Updated Dec 19, 2022
C

CMU-SAFARI / BLEND

Star

BLEND is a mechanism that can efficiently find fuzzy seed matches between sequences to significantly improve the performance and accuracy while reducing the memory space usage of two important applications: 1) finding overlapping reads and 2) read mapping. Described by Firtina et al. (published in NARGAB https://doi.org/10.1093/nargab/lqad004)

bioinformatics genome-analysis genome-assembly blend read-mapping de-novo-assembly minimizers strobemers seed-matching fuzzy-seeds read-overlapping spaced-seeds

Updated May 10, 2023
C

CMU-SAFARI / GateSeeder

Star

GateSeeder is the first near-memory CPU-FPGA co-design for alleviating both the compute-bound and memory-bound bottlenecks in short and long-read mapping. GateSeeder outperforms Minimap2 by up to 40.3×, 4.8×, and 2.3× when mapping real ONT, HiFi, and Illumina reads, respectively.

bioinformatics fpga genomics indexing seeding hbm sequence-alignment read-mapping long-reads minimap2 minimizers short-reads genome-on-diet

Updated Oct 3, 2023
C

CMU-SAFARI / SequenceLab

Star

SequenceLab is a benchmark suite for evaluating computational methods for comparing genomic sequences, such as pre-alignment filters and pairwise sequence alignment algorithms. SequenceLab is described by Rumpf et al. at https://arxiv.org/abs/2310.16908

benchmark sequence-alignment read-mapping