Abstract
Motivation. Predominant pathway analysis approaches treat pathways as collections of individual genes and consider all pathway members as equally informative. As a result, at times spurious and misleading pathways are inappropriately identified as statistically significant, solely due to components that they share with the more relevant pathways. Results. We introduce the concept of Pathway Gene-Pair Signatures (Pathway- GPS) as pairs of genes that, as a combination, are specific to a single pathway. We devised and implemented a novel approach to pathway analysis, Signature Overrepresentation Analysis (SIGORA), which focuses on the statistically significant enrichment of Pathway-GPS in a user-specified gene list of interest. In a comparative evaluation of several published datasets, SIGORA outperformed traditional methods by delivering biologically more plausible and relevant results. Availability. An efficient implementation of SIGORA, as an R package with precompiled GPS data for several human and mouse pathway repositories is available for download from http://sigora.googlecode.com/svn/.
Original language | English |
---|---|
Article number | 229 |
Pages (from-to) | e229 |
Number of pages | 27 |
Journal | PeerJ |
Volume | 2013 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2013 |
Keywords
- Functional analysis
- High-throughput data
- Over-representation analysis
- Pathway analysis
- Shared components
- Systems biology