Powerpoint | ||
---|---|---|
|
What is RNA-seq?
RNA-seq (RNA sequencing) is a powerful and widely used technique for analysing the quantity and sequences of RNA in a sample. It provides a snapshot of the entire transcriptome—the complete set of RNA transcripts produced by the genome at a given time.
...
In practice, both pipelines are highly regarded, and the best choice often comes down to the specific requirements of your research project and the resources available.
Overview of pipeline tools:
source: https://nf-co.re/rnaseq/3.14.0
Merge re-sequenced FastQ files (
cat
)Auto-infer strandedness by subsampling and pseudoalignment (
fq
,Salmon
)Read QC (
FastQC
)UMI extraction (
UMI-tools
)Adapter and quality trimming (
Trim Galore!
)Removal of genome contaminants (
BBSplit
)Removal of ribosomal RNA (
SortMeRNA
)Choice of multiple alignment and quantification routes:
STAR
->Salmon
STAR
->RSEM
HiSAT2
-> NO QUANTIFICATION
Sort and index alignments (
SAMtools
)UMI-based deduplication (
UMI-tools
)Duplicate read marking (
picard MarkDuplicates
)Transcript assembly and quantification (
StringTie
)Create bigWig coverage files (
BEDTools
,bedGraphToBigWig
)Extensive quality control:
RSeQC
Qualimap
dupRadar
Preseq
DESeq2
Kraken2
->Bracken
on unaligned sequences; optional
Pseudoalignment and quantification (
Salmon
or ‘Kallisto’; optional)Present QC for raw read, alignment, gene biotype, sample similarity, and strand-specificity checks (
MultiQC
,R
)