Refer to Data Download for additional information.
Public data: https://www.ebi.ac.uk/ena/browser/view/PRJNA961731
Manuscript:
Son S, Kim B, Yang J, Kim VN. Role of the proline-rich disordered domain of DROSHA in intronic microRNA processing. Genes Dev. 2023 May 1;37(9-10):383-397.
doi: 10.1101/gad.350275.122. Epub 2023 May 26. PMID: 37236670; PMCID: PMC10270192. https://genesdev.cshlp.org/content/37/9-10/383.long
...
(2) HCT116 WT cells cultured with media supplemented with 9.1 % or 0.1 % FBS.
Sample metadata:
sample | group | treatment | cell | replicate |
SRR24302008 | WT | 0.1 % FBS, 72 hr | HCT116 | rep3 |
SRR24302009 | WT | 0.1 % FBS, 72 hr | HCT116 | rep2 |
SRR24302010 | WT | 0.1 % FBS, 72 hr | HCT116 | rep1 |
SRR24302011 | WT | 9.1 % FBS, 72 hr | HCT116 | rep3 |
SRR24302012 | WT | 9.1 % FBS, 72 hr | HCT116 | rep2 |
SRR24302013 | WT | 9.1 % FBS, 72 hr | HCT116 | rep1 |
SRR24302014 | DROSHA_KO | GFP | HCT116 | rep1 |
SRR24302015 | WT | no treatment | HCT116 | rep1 |
SRR24302016 | DROSHA_KO | DROSHA-p140 | HEK293E | rep3 |
SRR24302017 | DROSHA_KO | DROSHA-p140 | HEK293E | rep2 |
SRR24302018 | DROSHA_KO | DROSHA-p140 | HEK293E | rep1 |
SRR24302019 | DROSHA_KO | full-length DROSHA | HEK293E | rep3 |
SRR24302020 | DROSHA_KO | full-length DROSHA | HEK293E | rep2 |
SRR24302021 | DROSHA_KO | full-length DROSHA | HEK293E | rep1 |
SRR24302022 | DROSHA_KO | DROSHA-p140 | HCT116 | rep3 |
SRR24302023 | DROSHA_KO | DROSHA-p140 | HCT116 | rep2 |
SRR24302024 | DROSHA_KO | DROSHA-p140 | HCT116 | rep1 |
SRR24302025 | DROSHA_KO | full-length DROSHA | HCT116 | rep3 |
SRR24302026 | DROSHA_KO | full-length DROSHA | HCT116 | rep2 |
SRR24302027 | DROSHA_KO | full-length DROSHA | HCT116 | rep1 |
Download data
Prepare the following PBS Pro script to submit the request to download of files to the HPC cluster.
Code Block |
---|
#!/bin/bash -l #PBS -N download _ENA _data #PBS -l select=1:ncpus=2:mem=8gb #PBS -l walltime=24:00:00 #work on current directory (folder) cd $PBS_O_WORKDIR wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/023/SRR24302023/SRR24302023.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/017/SRR24302017/SRR24302017.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/019/SRR24302019/SRR24302019.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/012/SRR24302012/SRR24302012.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/026/SRR24302026/SRR24302026.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/027/SRR24302027/SRR24302027.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/013/SRR24302013/SRR24302013.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/015/SRR24302015/SRR24302015.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/021/SRR24302021/SRR24302021.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/024/SRR24302024/SRR24302024.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/010/SRR24302010/SRR24302010.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/018/SRR24302018/SRR24302018.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/022/SRR24302022/SRR24302022.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/011/SRR24302011/SRR24302011.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/009/SRR24302009/SRR24302009.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/016/SRR24302016/SRR24302016.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/020/SRR24302020/SRR24302020.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/008/SRR24302008/SRR24302008.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/025/SRR24302025/SRR24302025.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/014/SRR24302014/SRR24302014.fastq.gz |
...
Monitor progress of job:
Code Block |
---|
qjobs |
Preparing “sample sheet” index file:
Code Block |
---|
#fetch samples names
ls --color=never *.gz -1 | awk -F "." '{print $1}' > sample.names
#fetch the path to each FASTQ file
for i in `ls --color=never *.gz`; do echo $i; readlink -f $i >> sample.path ; done
#merge the sample names and FASTQ path
echo "sample,fastq_1" > samplesheet.csv
paste sample.names sample.path | awk '{print $1 "," $2}' >> samplesheet.csv
#remove intermediate files
rm sample.names sample.path |