Bioinformatic Random Seed

Keep up with what's happening in Bioinformatics and Machine Learning (^ω^)

Back to main page

Alignment

Last update: 06/05/2020
By: Huitian (Yolanda) Diao

1. Concept

1.1 Alignment

In NGS, alignment means to compare short reads with reference genome. It generate information including the corresponding location of the reads, and quality of alignment.

Input: short reads (fastq format)
Output: aligned files (SAM / BAM)
Goal: Aquire abundance information of genetic features in short reads

1.2 Aligner v.s. Mapper

Aligners try to find exact location of reads by base mapping, while mappers assign the reads to features without requirement of exact alignment. Mappers are usually faster that aligners. Aligners: Bowtie, Tophat, STAR
Mapper: Kallisto, Salmon

1.3 Global v.s. local

Global alignment attemps to align the entire sequence, while local alignment trys to find regions with highest level of similarity between sequences. Local alignments are suitable for comaparing dissimilar sequences that has similar regions.

1.4 Parameters

1.4.1 Read clipping
1.4.2 Splicing
1.4.3 Insert size
1.4.4 Inner distance

2. Algorithms

2.1 Burrows-Wheeler Transformation (BWT)

BWT is a lossless data compression method, which works by rearranging character string into runs of similar characters. BWT is basis for alignment tools such as Bowtie, BWA and SOAP2.

3. Tools

Single cell alignment

References

  1. Wikibooks - Next Generation Sequencing(NGS) / Alignment
  2. UC Davis Bioinformatics training
  3. Bleidorn C. (2017) Alignment and Mapping. In: Phylogenomics. Springer, Cham
  4. Wikipedia - Burrows-Wheeler_transform