Section 8 Pathway Analysis

In Section 7, we covered analysis at the individual feature level (protein, peptide, phosphoprotein, etc.). While this is useful, it is not without its own set of shortcomings. For instance, there may be no features that pass the significance threshold after correcting for multiple hypothesis testing. Alternatively, there may be many features that are statistically significant, and interpreting this list can be tedious and “prone to investigator bias toward a hypothesis of interest” (Maleki et al., 2020). Another issue is that differential analysis fails to detect subtle, yet coordinated changes in groups of related features (Subramanian et al., 2005).

In order to address these, and other, issues, pathway analysis instead examines a priori defined gene sets—groups of genes that participate in the same biological pathway, share the same cellular location, etc. In this section, we will explore some common annotation databases, as well as two pathway analysis methods: Over-Representation Analysis (ORA) and Gene Set Enrichment Analysis (GSEA).

References

Maleki, F., Ovens, K., Hogan, D. J. and Kusalik, A. J., Gene Set Analysis: Challenges, Opportunities, and Future Research, Frontiers in Genetics, vol. 11, p. 654, accessed September 7, 2021, from https://www.frontiersin.org/article/10.3389/fgene.2020.00654, 2020. DOI: 10.3389/fgene.2020.00654
Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., Paulovich, A., et al., Gene Set Enrichment Analysis: A Knowledge-Based Approach for Interpreting Genome-Wide Expression Profiles, Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 43, pp. 15545–50, accessed September 7, 2021, from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1239896/, October 2005. DOI: 10.1073/pnas.0506580102