/
Data download using module load sra-tools

Data download using module load sra-tools

Aim:

Download FASTQ files for samples of interest.

SRA website: https://www.google.com/search?client=safari&rls=en&q=NCBI++SRA&ie=UTF-8&oe=UTF-8

Search NGS data of interest

In the ‘search box' enter one of the following identifiers:

  • Project accession (i.e., PRJNA229998)

  • Study accession (i.e., SRP033351)

  • Experiment accession (i.e., SRX384360)

  • Run accession (i.e., SRR1039508)

  • or search for keywords, for example, “RNA-seq mouse”

 

Prepare a list of SRA run accessions to be downloaded and saved in a file, for example, ‘SRR_Acc_List.txt’:

SRR1039508 SRR1039509 SRR1039510 SRR1039511 SRR1039512

Prepare a PBS Pro submission script as follows:

#!/bin/bash -l #PBS -N rna #PBS -l select=1:ncpus=1:mem=8gb #PBS -l walltime=24:00:00 #Enable the container modules source /pkg/shpc/enable #Load the SRA-TOOLS module module load sra-tools/3.0.5--h9f5acd7_1 #work on current directory (folder) cd $PBS_O_WORKDIR for i in $(cat SRR_Acc_List.txt); do echo $i prefetch.3 $i fasterq-dump.3 --split-files $i done gzip *fastq

submit PBS script to the HPC cluster

qsub launch_fetch_SRAfiles.pbs

monitor job progression

 

 

Related pages