Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

For this exercise we will use the epi2me-labs/wf-human-variation pipeline. Find information on the pipeline at https://labs.epi2me.io/workflows/wf-human-variation/

...

Approximate run time: Variable depending on whether it is targeted sequencing or whole genome sequencing, as well as coverage and the individual analyses requested. For instance, a 90X human sample run (options: --snp --sv --mod --str --cnv --phased --sex male) takes less than 8h with recommended resources.

NOTE: in contrast to the nf-core/sarek pipeline that we used in session 2, the epi2me-labs/wf-human-variation pipeline runs in ‘local’ mode (needs large amount of CPUs and RAM memory), while the nf-core/sarek pipeline will use a ‘pbspro’ mode, where the pipeline will submit individual jobs to the HPC cluster and define the CPUs and memory for each task individually.

...

Code Block
#!/bin/bash -l
#PBS -N run3_epi2me_WF-HV
#PBS -l select=1:ncpus=16:mem=32gb
#PBS -l walltime=48:00:00
#PBS -m abe

module load java

export NXF_OPTS='-Xms1g -Xmx4g'
cd $PBS_O_WORKDIR

nextflow run epi2me-labs/wf-human-variation \
    -r v2.2.0 \
    -profile singularity \
    --bam "/training/ONTvariants/runs/run2_mapping/SRR17138639_mapped_chr20.bam" \
    --basecaller_cfg "dna_r10.4.1_e8.2_400bps_hac_prom" \
    --mod \
    --ref "/training/ONTvariants/data/chr20.fasta" \
    --sample_name "SRR17138639" \
    --snp \
    --sv \
    --bam_min_coverage 15 \
    --out_dir "results"

Note:

  • Line 1: Defines that the script is a bash script.

  • Lines 2-5: Are commented out with “#” at the beginning and are ignored by bash, however, these PBS lines tell the scholar (PBS Pro) the name of the job (line 2), the number of CPUs and RAM memory to use (line 3), the time to run the script (line 4) and report if there are any errors (line 5).

  • Line 7: load java required to run cextflow pipelines

  • Line 8: assign up to 4GB memory for the nextflow initial script to use

  • Line 9: Tells the job to run on the current directory.

  • Lines 11-22: Parameters to run the epi2me-labs/wf-human-variation pipeline (refer above for details on each parameter)

Submit the QC job to the HPC cluster:

Code Block
qsub launch_ONTvariants_mapping.pbs

Monitor the progress of the job:

Code Block
qjobs