Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Assemble the genome of two M. alpina strains (Mortierella alpina and Mortierella sp.) sequenced at QUT.

  • Identify DAG (Diacyl-glycerol) and Phopholipase (PLA, PLB, PLC and/or PLD) genes

...

Code Block
#step1: decompress file
gzip -d GCA_015679415.1_UCR_MalpAD072_1.0_protein.faa.gz

#step2: wrap sequence to one line
python2.7 /work/speight_team/projects/yeast_genomes/scripts/extract_seqs.py GCA_015679415.1_UCR_MalpAD072_1.0_protein.faa 0 | sed 's/lcl|//' > GCA_015679415.1_UCR_MalpAD072_1.0_protein.mod.faa

#step3" check file
less -S GCA_015679415.1_UCR_MalpAD072_1.0_protein.mod.faa

#use grep to fetch name of interest
grep -A 1 "Phospholipase" GCA_015679415.1_UCR_MalpAD072_1.0_protein.mod.faa | sed '/^--$/d' > Phospholipase_proteins.fasta

...