Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

Refer to Data Download for additional information.

Public data: https://www.ebi.ac.uk/ena/browser/view/PRJNA961731

Manuscript:

Son S, Kim B, Yang J, Kim VN. Role of the proline-rich disordered domain of DROSHA in intronic microRNA processing. Genes Dev. 2023 May 1;37(9-10):383-397.

doi: 10.1101/gad.350275.122. Epub 2023 May 26. PMID: 37236670; PMCID: PMC10270192. https://genesdev.cshlp.org/content/37/9-10/383.long

Background:

DROSHA serves as a gatekeeper of the microRNA (miRNA) pathway by processing primary transcripts (pri-miRNAs). While the functions of structured domains of DROSHA have been well-documented, the contribution of N-terminal proline-rich disordered domain (PRD) remains elusive. Here we show that the PRD promotes the processing of miRNA hairpins within introns. We identified a DROSHA isoform (p140) lacking the PRD produced by proteolytic cleavage. Small RNA sequencing revealed that p140 is significantly impaired in the maturation of intronic miRNAs. Consistently, our minigene constructs demonstrated that PRD enhances the processing of intronic hairpins, but not those in exons. Splice site mutations did not affect the PRD’s enhancing effect on intronic constructs, suggesting that the PRD acts independently of splicing reaction by interacting with sequences residing within introns. The N-terminal regions from zebrafish and Xenopus DROSHA can replace the human counterpart, indicating functional conservation despite poor sequence alignment. Moreover, we found that rapidly evolving intronic miRNAs are generally more dependent on PRD than conserved ones, suggesting a role of PRD in miRNA evolution. Our study reveals a new layer of miRNA regulation mediated by a low-complexity disordered domain that senses the genomic contexts of miRNA loci.

Overall design:

Comparative miRNA expression profiling analysis of small RNA-seq data for:

(1) full-length DROSHA or DROSHA-p140 rescue in HCT116 or HEK293E DROSHA knockout (KO) cells and

(2) HCT116 WT cells cultured with media supplemented with 9.1 % or 0.1 % FBS.

Download data

Prepare the following PBS Pro script to submit the request to download of files to the HPC cluster.

#!/bin/bash -l
#PBS -N download_ENA_data
#PBS -l select=1:ncpus=2:mem=8gb
#PBS -l walltime=24:00:00

#work on current directory (folder)
cd $PBS_O_WORKDIR

wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/023/SRR24302023/SRR24302023.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/017/SRR24302017/SRR24302017.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/019/SRR24302019/SRR24302019.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/012/SRR24302012/SRR24302012.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/026/SRR24302026/SRR24302026.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/027/SRR24302027/SRR24302027.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/013/SRR24302013/SRR24302013.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/015/SRR24302015/SRR24302015.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/021/SRR24302021/SRR24302021.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/024/SRR24302024/SRR24302024.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/010/SRR24302010/SRR24302010.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/018/SRR24302018/SRR24302018.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/022/SRR24302022/SRR24302022.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/011/SRR24302011/SRR24302011.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/009/SRR24302009/SRR24302009.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/016/SRR24302016/SRR24302016.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/020/SRR24302020/SRR24302020.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/008/SRR24302008/SRR24302008.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/025/SRR24302025/SRR24302025.fastq.gz
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR243/014/SRR24302014/SRR24302014.fastq.gz

Submit the download script to the cluster:

qsub launch_ENA_download.pbs

Monitor progress of job:

qjobs

Preparing “sample sheet” index file:

#fetch samples names
ls --color=never *.gz -1 | awk -F "." '{print $1}' > sample.names

#fetch the path to each FASTQ file
for i in `ls --color=never *.gz`; do echo $i; readlink -f $i >> sample.path ; done

#merge the sample names and FASTQ path
echo "sample,fastq_1" > samplesheet.csv 
paste sample.names sample.path | awk '{print $1 "," $2}' >> samplesheet.csv

#remove intermediate files
rm sample.names sample.path
  • No labels