Data
Experiment Accession | sample | FASTQ | Experiment Title | Organism Name | Instrument | Submitter | Study Accession | Study Title | Sample Accession | Total Size, Mb | Total Spots | Total Bases | Library Strategy | Library Source | Library Selection |
SRX14748451 | S1 | SRR18645307 | Homo sapiens | Homo sapiens | MinION | Drexel University | SRP367676 | Multiplex structural variant detection by whole-genome mapping and nanopore sequencing. | SRS12509856 | 821.1 | 348226 | 972620520 | OTHER | GENOMIC | other |
SRX19406878 | S2 | SRR23513621 | NA12878 DNA sequencing from nanopore WSG consortium - basecalled sequences (Guppy 6.1.3 super accuracy) | Homo sapiens | MinION | Garvan Institute of Medical Research | SRP421403 | Curated publicly available nanopore datasets | SRS16801715 | 78526.8 | 11173458 | 97545895593 | WGS | GENOMIC | RANDOM |
ERX8211413 | S3 | ERR8578833 | MinION sequencing | Homo sapiens | MinION | the university of hong kong | ERP135493 | Target enrichment sequencing and variant calling on medical exome using ONT MinION | ERS10590135 | 8961.02 | 9636172 | 10382057986 | Targeted-Capture | GENOMIC | PCR |
ERX8211414 | S4 | ERR8578834 | MinION sequencing | Homo sapiens | MinION | the university of hong kong | ERP135493 | Target enrichment sequencing and variant calling on medical exome using ONT MinION | ERS10590135 | 10669.72 | 10644000 | 12212807287 | Targeted-Capture | GENOMIC | PCR |
SRX13322984 | S5 | SRR17138639 | Nanopore targeted sequencing (ReadUntil/ReadFish) of NA12878-HG001- basecalled sequences | Homo sapiens | MinION | Garvan Institute of Medical Research | SRP349335 | Comprehensive genetic diagnosis of tandem repeat expansion disorders with programmable targeted nanopore sequencing | SRS11230712 | 6629.97 | 5513156 | 7815960904 | WGS | GENOMIC | other |
SRX13323057 | S6 | SRR17138566 | Nanopore targeted sequencing (ReadUntil/ReadFish) of NA12878-HG001- basecalled sequences | Homo sapiens | MinION | Garvan Institute of Medical Research | SRP349335 | Comprehensive genetic diagnosis of tandem repeat expansion disorders with programmable targeted nanopore sequencing | SRS11230747 | 17107.98 | 12278391 | 20238395479 | WGS | GENOMIC | other |
Mapping
Let’s run the --help option of the pipeline to get information on the available parameters
module load java nextflow run epi2me-labs/wf-alignment -profile singularity --help
N E X T F L O W ~ version 23.12.0-edge Launching `https://github.com/epi2me-labs/wf-alignment` [nostalgic_galileo] DSL2 - revision: e1fd7a51dc [master] WARN: Config setting `prov.formats` is not defined, no provenance reports will be produced |||||||||| _____ ____ ___ ____ __ __ _____ _ _ |||||||||| | ____| _ \_ _|___ \| \/ | ____| | | __ _| |__ ___ ||||| | _| | |_) | | __) | |\/| | _| _____| |/ _` | '_ \/ __| ||||| | |___| __/| | / __/| | | | |__|_____| | (_| | |_) \__ \ |||||||||| |_____|_| |___|_____|_| |_|_____| |_|\__,_|_.__/|___/ |||||||||| wf-alignment v1.1.2-ge1fd7a5 -------------------------------------------------------------------------------- Typical pipeline command: nextflow run epi2me-labs/wf-alignment \ --fastq 'wf-alignment-demo/fastq' \ --references 'wf-alignment-demo/references' Input Options --fastq [string] FASTQ files to use in the analysis. --bam [string] BAM or unaligned BAM (uBAM) files to use in the analysis. --analyse_unclassified [boolean] Analyse unclassified reads from input directory. By default the workflow will not process reads in the unclassified directory. --references [string] Path to a directory containing FASTA reference files. --reference_mmi_file [string] Path to an MMI index file to be used as reference. --counts [string] Path to a CSV file containing expected counts as a control. Sample Options --sample_sheet [string] A CSV file used to map barcodes to sample aliases. The sample sheet can be provided when the input data is a directory containing sub-directories with FASTQ files. --sample [string] A single sample name for non-multiplexed data. Permissible if passing a single .fastq(.gz) file or directory of .fastq(.gz) files. Output Options --out_dir [string] Directory for output of all workflow results. [default: output] --prefix [string] Optional prefix attached to each of the output filenames. Advanced options --depth_coverage [boolean] Calculate depth coverage statistics and include them in the report. [default: true] --minimap_preset [choice] Pre-defined parameter sets for `minimap2`, covering most common use cases. [default: dna] * dna * rna --minimap_args [string] String of command line arguments to be passed on to `minimap2`. Miscellaneous Options --threads [integer] Number of CPU threads to use for the alignment step. [default: 4] --disable_ping [boolean] Enable to prevent sending a workflow ping. Other parameters --monochrome_logs [boolean] null --validate_params [boolean] null [default: true] --show_hidden_params [boolean] null !! Hiding 4 params, use --show_hidden_params to show them !! -------------------------------------------------------------------------------- If you use epi2me-labs/wf-alignment for your analysis please cite: * The nf-core framework https://doi.org/10.1038/s41587-020-0439-x