{"type":"doc","content":[{"type":"paragraph"},{"type":"extension","attrs":{"layout":"default","extensionType":"com.atlassian.confluence.macro.core","extensionKey":"toc","parameters":{"macroParams":{},"macroMetadata":{"macroId":{"value":"352d0cc1-966d-4265-8860-9d3fbcb907a8"},"schemaVersion":{"value":"1"},"title":"Table of Contents"}},"localId":"aa0a2a5e-6b2d-4f9e-b767-becc35f34fbb"}},{"type":"heading","attrs":{"level":2},"content":[{"text":"1. What is functional annotation?","type":"text"}]},{"type":"paragraph","content":[{"text":"Many types of genetic analysis will output a set of genes that are associated with a specific experimental condition. The classic example of this is RNA-Seq, which outputs a set of genes that are differentially expressed between experimental conditions. But micro RNA, epigenetics (e.g. differential methylation), variant calling and various other analysis types can also generate a set of condition-based genes.","type":"text"}]},{"type":"paragraph","content":[{"text":"Functional annotation uses a set of genes (such as differentially expressed genes) to examine enrichment of these genes in ","type":"text"},{"text":"Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways","type":"text","marks":[{"type":"link","attrs":{"href":"https://www.genome.jp/kegg/"}}]},{"text":" and ","type":"text"},{"text":"Gene Ontology (GO) terms","type":"text","marks":[{"type":"link","attrs":{"href":"http://geneontology.org/"}}]},{"text":".","type":"text"}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"KEGG","type":"text"}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":".. is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from genomic and molecular-level information. It is a computer representation of the biological system, consisting of molecular building blocks of genes and proteins (genomic information) and chemical substances (chemical information) that are integrated with the knowledge on molecular wiring diagrams of interaction, reaction and relation networks (systems information). It also contains disease and drug information (health information) as perturbations to the biological system.","type":"text"}]}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"GO","type":"text"}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":".. provides a computational representation of our current scientific knowledge about the functions of genes (or, more properly, the protein and non-coding RNA molecules produced by genes) from many different organisms, from humans to bacteria. It is widely used to support scientific research, and has been cited in tens of thousands of publications.","type":"text"}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"Understanding gene function—how individual genes contribute to the biology of an organism at the molecular, cellular and organism levels—is one of the primary aims of biomedical research. Moreover, experimental knowledge obtained in one organism is often applicable to other organisms, particularly if the organisms share the relevant genes because they inherited them from their common ancestor.","type":"text"}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"Associations of gene products to GO terms are statements that describe","type":"text"}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"Molecular Function: the molecular activities of individual gene products ","type":"text"}]},{"type":"paragraph","content":[{"text":"Cellular Component: where the gene products are active","type":"text"}]},{"type":"paragraph","content":[{"text":"Biological Process: the pathways and larger processes to which that gene product’s activity contributes","type":"text"}]}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"2. R Packages","type":"text"}]},{"type":"paragraph","content":[{"text":"We’ll be using two main R packages:","type":"text"}]},{"type":"paragraph","content":[{"text":"Functional enrichment for KEGG pathways and GO terms was completed using the package ","type":"text"},{"type":"inlineCard","attrs":{"url":"https://bioconductor.org/packages/release/bioc/html/clusterProfiler.html"}},{"text":" ","type":"text"}]},{"type":"paragraph","content":[{"text":"You can read more about clusterProfiler’s statistical and analysis methods here: ","type":"text"},{"type":"inlineCard","attrs":{"url":"https://yulab-smu.top/biomedical-knowledge-mining-book/index.html"}},{"text":" ","type":"text"}]},{"type":"paragraph","content":[{"text":"Annotated KEGG pathway maps are generated using the package ","type":"text"},{"type":"inlineCard","attrs":{"url":"https://www.bioconductor.org/packages/release/bioc/html/pathview.html"}},{"text":" ","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"3. Connect to an rVDI virtual desktop machine","type":"text"}]},{"type":"paragraph","content":[{"text":"As with the previous differential expression analyses we did in sessions 3 and 4, we will also be running this analysis in RStudio on an rVDI virtual machine. The reason is the same as before - to save time as the required R packages are pre-installed on these virtual machines. And, as before, you can copy and paste this script to RStudio on your local computer and adapt it to your own dataset.","type":"text"}]},{"type":"paragraph","content":[{"text":"To access and run an rVDI virtual desktop:","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph","content":[{"text":"Go to ","type":"text"},{"text":"https://rvdi.qut.edu.au/","type":"text","marks":[{"type":"link","attrs":{"href":"https://rvdi.qut.edu.au/"}}]}]},{"type":"paragraph","content":[{"text":"Click on ‘","type":"text"},{"text":"VMware Horizon HTML Access","type":"text","marks":[{"type":"strong"}]},{"text":"’","type":"text"}]},{"type":"paragraph","content":[{"text":"Log on with your QUT username and password","type":"text"}]},{"type":"paragraph","content":[{"text":"*","type":"text"},{"text":"NOTE","type":"text","marks":[{"type":"strong"}]},{"text":": you need to be connected to the QUT network first, either being on campus or connecting remotely via VPN.","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"4. Preparing your data","type":"text"}]},{"type":"paragraph","content":[{"text":"We’ll be using the RNA-Seq differential expression results you generated in session 3.","type":"text"}]},{"type":"paragraph","content":[{"text":"These will be in a file called ‘","type":"text"},{"text":"DE_genes_Basal_cells_Vs_Differentiated_cells.csv","type":"text","marks":[{"type":"strong"}]},{"text":"’ in the ‘","type":"text"},{"text":"Table","type":"text","marks":[{"type":"strong"}]},{"text":"' output folder from session 3. This file contains a list of differentially expressed genes that you generated using DESeq2. This list of DE genes will be used as input for functional annotation.","type":"text"}]},{"type":"paragraph"},{"type":"paragraph","content":[{"text":"a. In windows explorer, go to: ","type":"text","marks":[{"type":"strong"}]},{"text":"H:\\workshop\\RNAseq","type":"text","marks":[{"type":"textColor","attrs":{"color":"#36b37e"}},{"type":"strong"}]}]},{"type":"paragraph"},{"type":"paragraph","content":[{"text":"b. In this folder, create a new folder called ‘","type":"text","marks":[{"type":"strong"}]},{"text":"functional_annotation","type":"text","marks":[{"type":"textColor","attrs":{"color":"#36b37e"}},{"type":"strong"}]},{"text":"’ (case-sensitive)","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph"},{"type":"paragraph","content":[{"text":"c. Open RStudio and create a new R script (‘File’ → “New File” → “R script”). Now hit ‘File’ → ‘Save’ and save the script in the ","type":"text","marks":[{"type":"strong"}]},{"text":"H:\\workshop\\RNAseq\\functional_annotation","type":"text","marks":[{"type":"textColor","attrs":{"color":"#36b37e"}},{"type":"strong"}]},{"text":" folder you created. Save the script file as ‘","type":"text","marks":[{"type":"strong"}]},{"text":"functional_annotation.R","type":"text","marks":[{"type":"textColor","attrs":{"color":"#ff991f"}},{"type":"strong"}]},{"text":"’","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph"},{"type":"paragraph","content":[{"text":"In the following sections you will be copying and running the R code into your ","type":"text"},{"text":"functional_annotation.R","type":"text","marks":[{"type":"textColor","attrs":{"color":"#ff991f"}},{"type":"strong"}]},{"text":" script.","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"5. Installing packages","type":"text"}]},{"type":"paragraph","content":[{"text":"IMPORTANT: don’t run this code in this rVDI session, the packages are already installed. This code is only here if you need to run the analysis on your computer.","type":"text","marks":[{"type":"textColor","attrs":{"color":"#bf2600"}},{"type":"strong"}]}]},{"type":"table","attrs":{"layout":"default","width":1800.0,"localId":"0537d395-8cea-4186-8bbd-4e6d922e7a47"},"content":[{"type":"tableRow","content":[{"type":"tableHeader","attrs":{"colspan":1,"background":"#fff0b3","rowspan":1},"content":[{"type":"paragraph","content":[{"text":"#### 5. Installing required packages ####","type":"text"}]},{"type":"paragraph","content":[{"text":" ","type":"text"}]},{"type":"paragraph","content":[{"text":"bioconductor_packages <- c(\"clusterProfiler\", \"pathview\", \"AnnotationHub\", \"org.Mm.eg.db\")","type":"text"}]},{"type":"paragraph","content":[{"text":"cran_packages <- c(\"tidyverse\", \"ggplot2\", \"plyr\", \"readxl\", \"scales\")","type":"text"}]},{"type":"paragraph","content":[{"text":"# Compares installed packages to above packages and returns a vector of missing packages","type":"text"}]},{"type":"paragraph","content":[{"text":"new_packages <- bioconductor_packages[!(bioconductor_packages %in% installed.packages()[,\"Package\"])]","type":"text"}]},{"type":"paragraph","content":[{"text":"new_cran_packages <- cran_packages[!(cran_packages %in% installed.packages()[,\"Package\"])]","type":"text"}]},{"type":"paragraph","content":[{"text":"# Install missing bioconductor packages","type":"text"}]},{"type":"paragraph","content":[{"text":"if (!requireNamespace(\"BiocManager\", quietly = TRUE))","type":"text"}]},{"type":"paragraph","content":[{"text":" install.packages(\"BiocManager\")","type":"text"}]},{"type":"paragraph","content":[{"text":"BiocManager::install(new_packages)","type":"text"}]},{"type":"paragraph","content":[{"text":"# Install missing cran packages","type":"text"}]},{"type":"paragraph","content":[{"text":"if (length(new_cran_packages)) install.packages(new_cran_packages, repos = ","type":"text"},{"text":"http://cran.us.r-project.org","type":"text","marks":[{"type":"underline"},{"type":"link","attrs":{"href":"http://cran.us.r-project.org/"}}]},{"text":")","type":"text"}]},{"type":"paragraph","content":[{"text":"# Update all installed packages to the latest version","type":"text"}]},{"type":"paragraph","content":[{"text":"update.packages(bioconductor_packages, ask = FALSE)","type":"text"}]},{"type":"paragraph","content":[{"text":"update.packages(cran_packages, ask = FALSE, repos = ","type":"text"},{"text":"http://cran.us.r-project.org","type":"text","marks":[{"type":"underline"},{"type":"link","attrs":{"href":"http://cran.us.r-project.org/"}}]},{"text":")","type":"text"}]}]}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"6. Loading packages","type":"text"}]},{"type":"paragraph","content":[{"text":"Copy and paste (then run) this code into your R script (same with the code in all following sections as well).","type":"text"}]},{"type":"codeBlock","content":[{"text":"#### 6. Loading required packages ####\n\n# This section needs to be run every time\n# Load packages\nbioconductor_packages <- c(\"clusterProfiler\", \"pathview\", \"AnnotationHub\", \"org.Mm.eg.db\")\ncran_packages <- c(\"tidyverse\", \"ggplot2\", \"plyr\", \"readxl\", \"scales\")\nlapply(cran_packages, require, character.only = TRUE)\nlapply(bioconductor_packages, require, character.only = TRUE)","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"7. Gene ID conversion","type":"text"}]},{"type":"paragraph","content":[{"text":"KEGG pathways are based on ","type":"text"},{"text":"Entrez gene IDs","type":"text","marks":[{"type":"link","attrs":{"href":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1761442/"}}]},{"text":", NCBIs numeric gene identifiers. Therefore, to run this workflow you need to convert your input gene IDs (in our case gene symbols) into Entrez IDs","type":"text"}]},{"type":"paragraph","content":[{"text":"NOTE","type":"text","marks":[{"type":"strong"}]},{"text":": You need to have the correct path for your ","type":"text"},{"text":"DE_genes_Basal_cells_Vs_Differentiated_cells.csv","type":"text","marks":[{"type":"code"}]},{"text":" file, that you generated in session 3. It should be in ","type":"text"},{"text":"H:/workshop/RNAseq/DE_analysis_workshop/Tables","type":"text","marks":[{"type":"code"}]},{"text":" for most people. If not, change this to the correct path containing your DE genes table.","type":"text"}]},{"type":"paragraph","content":[{"text":"If you’ve deleted your ","type":"text"},{"text":"DE_genes_Basal_cells_Vs_Differentiated_cells.csv","type":"text","marks":[{"type":"code"}]},{"text":" file or didn’t attend section 3, you can download the file here ","type":"text"},{"type":"mediaInline","attrs":{"id":"8e6fca17-9646-42ae-822b-2e1357bf000c","collection":"contentId-2036269057"}},{"text":" , create the ","type":"text"},{"text":"H:/workshop/RNAseq/DE_analysis_workshop/Tables","type":"text","marks":[{"type":"code"}]},{"text":" folders and put the file in the Tables folder.","type":"text"}]},{"type":"paragraph"},{"type":"codeBlock","content":[{"text":"\n#### 7. Convert to Entrez gene IDs ####\n\n# Set your working directory\nsetwd(\"H:/workshop/RNAseq/functional_annotation\")\n\n# Import your DE genes data:\n# NOTE: THE DIRECTORY THAT CONTAINS YOUR RESULTS DATA MAY VARY. YOU NEED TO LOOK IN 'H:/workshop/RNAseq' TO SEE WHERE YOUR 'Tables' SUBDIRECTORY IS, AND CHANGE THE BELOW PATH TO REFLECT THAT.\ndat <- read.csv(\"H:/workshop/RNAseq/DE_analysis_workshop/Tables/DE_genes_Basal_cells_Vs_Differentiated_cells.csv\", row.names = 1)\n\n# Convert the gene symbol column to Entrez IDs\n# NOTE: 'OrgDb=' needs to be your organism database. We're using mouse here, so we're using 'OrgDb=org.Mm.eg.db' if your species was human you'd use 'OrgDb=org.Hs.eg.db', etc\ngene_list <- bitr(gene = dat$SYMBOL, fromType=\"SYMBOL\", toType=\"ENTREZID\", OrgDb=org.Mm.eg.db, drop=TRUE)\n","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"8. KEGG pathway enrichment","type":"text"}]},{"type":"paragraph","content":[{"text":"Now we can run the clusterProfiler function ","type":"text"},{"text":"enrichKEGG","type":"text","marks":[{"type":"code"}]},{"text":" to match your set of Entrez IDs to the KEGG pathways for your species.","type":"text"}]},{"type":"paragraph","content":[{"text":"NOTE","type":"text","marks":[{"type":"strong"}]},{"text":": our data is from mouse, so we’re using the KEGG species ID for mouse (","type":"text"},{"text":"organism = \"mmu\"","type":"text","marks":[{"type":"code"}]},{"text":"). The list of KEGG species IDs is here (e.g. if your data is human, you need to change this to ","type":"text"},{"text":"organism = \"hsu\"","type":"text","marks":[{"type":"code"}]},{"text":") :","type":"text"}]},{"type":"paragraph","content":[{"text":"https://www.genome.jp/kegg/catalog/org_list.html","type":"text","marks":[{"type":"link","attrs":{"href":"https://www.genome.jp/kegg/catalog/org_list.html"}}]}]},{"type":"codeBlock","content":[{"text":"#### 8. KEGG pathway enrichment ####\n\n# Use clusterProfiler's enrichKEGG() function to match your genes list to KEGG pathways\nkk <- enrichKEGG(gene = gene_list$ENTREZID, organism = \"mmu\", pvalueCutoff = 0.05, qvalueCutoff = 0.2)\n\n# Now you can save this as a table of enriched KEGG pathways\n# Create a 'results' subdirectory where all figures will be output\ndir.create(\"results\", showWarnings = FALSE)\nwrite.csv(as.data.frame(kk), \"./results/Enriched_KEGG_pathways.csv\")\n","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"9. Plotting enriched KEGG pathways","type":"text"}]},{"type":"paragraph","content":[{"text":"Here we’ll plot the enriched KEGG pathways. We’ll generate 3 plots (bar plot, dot plot, network plot) to visualise the same results in 3 different ways. This gives you options to present the data how you choose.","type":"text"}]},{"type":"paragraph","content":[{"text":"All of these plotting functions are part of the clusterProfiler package and can be modified to change the number of pathways shown and also change a wide array of other plot characteristics, such as plot colours, labels, legend, etc. More details can be seen here: ","type":"text"},{"type":"inlineCard","attrs":{"url":"https://yulab-smu.top/biomedical-knowledge-mining-book/index.html"}},{"text":" ","type":"text"}]},{"type":"paragraph"},{"type":"codeBlock","content":[{"text":"#### 9. Plotting KEGG results ####\n\n# Remove some text (' - Mus musculus (house mouse)') from each pathway description, for brevity\nkk@result$Description <- gsub(\" - Mus musculus (house mouse)\", \"\", kk@result$Description, fixed = T)\n\n# Create a barplot\np <- barplot(kk, showCategory = 14, font.size = 14)\np\n\n# Export as a 300dpi tiff\ntiff_exp <- \"./results/KEGG_bar.tiff\"\nggsave(file = tiff_exp, dpi = 300, compression = \"lzw\", device = \"tiff\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Export as a pdf\npdf_exp <- \"./results/KEGG_bar.pdf\"\nggsave(file = pdf_exp, device = \"pdf\", plot = p, width = 20, height = 20, units = \"cm\")\n\n\n# Dot plot\np <- dotplot(kk, showCategory = 8, font.size = 14)\np\n\n# Export as a 300dpi tiff\ntiff_exp <- \"./results/KEGG_dot.tiff\"\nggsave(file = tiff_exp, dpi = 300, compression = \"lzw\", device = \"tiff\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Export as a pdf\npdf_exp <- \"./results/KEGG_dot.pdf\"\nggsave(file = pdf_exp, device = \"pdf\", plot = p, width = 20, height = 20, units = \"cm\")\n\n\n# Network plot\np <- cnetplot(kk, showCategory = 8, colorEdge = F, node_label = \"category\")\np\n\n# Export as a 300dpi tiff\ntiff_exp <- \"./results/KEGG_network.tiff\"\nggsave(file = tiff_exp, dpi = 300, compression = \"lzw\", device = \"tiff\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Export as a pdf\npdf_exp <- \"./results/KEGG_network.pdf\"\nggsave(file = pdf_exp, device = \"pdf\", plot = p, width = 20, height = 20, units = \"cm\")\n","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"10. KEGG pathway maps","type":"text"}]},{"type":"paragraph","content":[{"text":"The package ","type":"text"},{"text":"pathview","type":"text","marks":[{"type":"link","attrs":{"href":"https://www.bioconductor.org/packages/release/bioc/html/pathview.html"}}]},{"text":" can pull down KEGG pathway maps from the KEGG database and annotate these with your input genes. It shades the genes by colour and intensity (e.g. using log fold change data, upregulated genes can be coloured green and downregulated coloured red).","type":"text"}]},{"type":"paragraph","content":[{"text":"NOTE: this only plots one of your enriched pathways at a time. You’ll need to enter one of these KEGG pathways IDs into into the ","type":"text"},{"text":"keggpath <- ..","type":"text","marks":[{"type":"code"}]},{"text":" line. You can see your set of enriched pathways in the ","type":"text"},{"text":"H:\\workshop\\RNAseq\\functional_annotation\\results\\Enriched_KEGG_pathways.csv","type":"text","marks":[{"type":"textColor","attrs":{"color":"#36b37e"}},{"type":"strong"}]},{"text":" or type ","type":"text"},{"text":"as.data.frame(kk)$ID","type":"text","marks":[{"type":"code"}]},{"text":" into the RStudio console.","type":"text"}]},{"type":"paragraph","content":[{"text":"You can change the KEGG pathway to a different pathway and re-run this for as many of your pathways as you like.","type":"text"}]},{"type":"codeBlock","content":[{"text":"#### 10. KEGG pathway maps ####\n\n# pathview requires a named vector as input, so we first will pull out your numeric data (e.g. log fold change data) and name this using your gene IDs.\nlfc <- dat$log2FoldChange\nnames(lfc) <- dat$SYMBOL\n\n# Select the pathway you want to annotate from one of the significantly enriched pathways\n# To view these:\nas.data.frame(kk)$ID\n\n# Enter one of these pathways below\nkeggpath <- \"mmu04360\"\n\n# Run the pathview function to generate the pathway map\npview <- pathview(gene.data = lfc,\n pathway.id = keggpath,\n species = \"mmu\",\n gene.idtype = \"SYMBOL\",\n low = list(gene = \"red\"),\n mid = list(gene = \"gray\"),\n high = list(gene = \"green\"),\n out.suffix = \"KEGG_pathway_map\",\n limit = list(gene=max(abs(lfc)), cpd=1))\n","type":"text"}]},{"type":"paragraph"},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"11. GO term enrichment","type":"text"}]},{"type":"paragraph","content":[{"text":"GO terms are functional groupings of genes. We can look at enrichment of GO terms in a similar way as we did with KEGG pathways.","type":"text"}]},{"type":"paragraph","content":[{"text":"NOTE","type":"text","marks":[{"type":"strong"}]},{"text":": GO terms are hierarchical, necessitating that we initially provide one of three top level terms in ","type":"text"},{"text":"ont <- \"..\"","type":"text","marks":[{"type":"code"}]},{"text":": these three terms are ","type":"text"},{"text":"Biological Processes","type":"text","marks":[{"type":"link","attrs":{"href":"https://www.informatics.jax.org/vocab/gene_ontology/GO:0008150"}}]},{"text":" (","type":"text"},{"text":"ont <- \"BP\"","type":"text","marks":[{"type":"code"}]},{"text":") ","type":"text"},{"text":"Molecular Functions","type":"text","marks":[{"type":"link","attrs":{"href":"https://www.informatics.jax.org/vocab/gene_ontology"}}]},{"text":" (","type":"text"},{"text":"ont <- \"MF\"","type":"text","marks":[{"type":"code"}]},{"text":") and ","type":"text"},{"text":"Cellular Components","type":"text","marks":[{"type":"link","attrs":{"href":"https://www.informatics.jax.org/vocab/gene_ontology/GO:0005575"}}]},{"text":" (","type":"text"},{"text":"ont <- \"CC\"","type":"text","marks":[{"type":"code"}]},{"text":"). We should run the two following code blocks (enrichment and plotting) three times, providing each ","type":"text"},{"text":"ont <- \"..\"","type":"text","marks":[{"type":"code"}]},{"text":".","type":"text"}]},{"type":"paragraph"},{"type":"codeBlock","content":[{"text":"#### 11. GO term enrichment ####\n\n# Fist select which of the top level terms you want to examine. You can run the analysis for each top-level term by changing this to another term then re-running this code cell and then the following GO code cells. The three terms are \"BP\", \"MF\", and \"CC\".\nont <- \"BP\"\n\n# Then run the enrichGO function\ngg <- enrichGO(gene = as.character(gene_list$ENTREZID), OrgDb = org.Mm.eg.db, ont = ont, pvalueCutoff = 0.05, qvalueCutoff = 0.2)\n\n# Now you can save this as a table of enriched GO terms\nwrite.csv(as.data.frame(gg), paste0(\"./results/Enriched_GO_terms_\", ont, \".csv\"))\n","type":"text"}]},{"type":"paragraph"},{"type":"heading","attrs":{"level":2},"content":[{"text":"12. Plotting enriched GO terms ","type":"text"}]},{"type":"paragraph","content":[{"text":"As with the KEGG pathways, we can plot a bar plot, dot plot and network plot.","type":"text"}]},{"type":"paragraph","content":[{"text":"In addition, we’re adding a fourth plot type here: a ","type":"text"},{"text":"directed acyclic graph","type":"text","marks":[{"type":"link","attrs":{"href":"https://en.wikipedia.org/wiki/Directed_acyclic_graph"}}]},{"text":". ","type":"text"}]},{"type":"paragraph","content":[{"text":"As previously mentioned, GO terms are hierarchical, with individual terms being linked to parent or child levels. The previous plots combine all levels under each of three top level terms. A directed acyclic graph allows a visualisation of how the enriched GO terms fit into the GO hierarchy.","type":"text"}]},{"type":"paragraph"},{"type":"codeBlock","content":[{"text":"#### 12. Plotting GO results ####\n\n# Bar plot\np <- barplot(gg, showCategory = 14, font.size = 14)\np\n\n# Output as tiff\ntiff_exp <- paste0(\"./results/\", ont, \"_GO_bar.tiff\")\nggsave(file = tiff_exp, dpi = 300, compression = \"lzw\", device = \"tiff\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Output as pdf\npdf_exp <- paste0(\"./results/\", ont, \"_GO_bar.pdf\")\nggsave(file = pdf_exp, device = \"pdf\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Dot plot\np <- dotplot(gg, showCategory = 8, font.size = 14)\np\n\n# Output as tiff\ntiff_exp <- paste0(\"./results/\", ont, \"_GO_dot.tiff\")\nggsave(file = tiff_exp, dpi = 300, compression = \"lzw\", device = \"tiff\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Output as pdf\npdf_exp <- paste0(\"./results/\", ont, \"_GO_dot.pdf\")\nggsave(file = pdf_exp, device = \"pdf\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Network plot\np <- cnetplot(gg, showCategory = 8, colorEdge = TRUE, node_label = \"category\")\np\n\n# Output as tiff\ntiff_exp <- paste0(\"./results/\", ont, \"_GO_network.tiff\")\nggsave(file = tiff_exp, dpi = 300, compression = \"lzw\", device = \"tiff\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Output as pdf\npdf_exp <- paste0(\"./results/\", ont, \"_GO_network.pdf\")\nggsave(file = pdf_exp, device = \"pdf\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Directed acyclic graph\np <- goplot(gg, showCategory = 30)\np\n\ntiff_exp <- paste0(\"./results/\", ont, \"_GO_DAG.tiff\")\nggsave(file = tiff_exp, dpi = 300, compression = \"lzw\", device = \"tiff\", plot = p, width = 20, height = 20, units = \"cm\")\n\n# Output as pdf\npdf_exp <- paste0(\"./results/\", ont, \"_GO_DAG.pdf\")\nggsave(file = pdf_exp, device = \"pdf\", plot = p, width = 20, height = 20, units = \"cm\")\n\n","type":"text"}]},{"type":"paragraph"},{"type":"paragraph"},{"type":"paragraph"},{"type":"paragraph"},{"type":"paragraph"},{"type":"paragraph"},{"type":"paragraph"}],"version":1}

Browser not supported