Gene Expression COmparison
Microarrays provide a high throughput approach for identifying functionally related proteins. A clear signal between microarray data and interacting proteins has previously been observed with the yeast dataset and the MIPS (MPACT) PPI dataset. For human we use the E-TABM-185 compendium dataset of 6000 gcrma normalised HGU133-A Affymetrix microarrays assembled by ArrayExpress. A maximum of 5 values were allowed to be missing from a given gene’s expression profile, using the C-clustering libraries masking function. For the human hgu133a Affymetrix chips 14,500 genes are well characterised giving a very large set of similarity scores.
Click here for a technical report (PDF, 1MB) which describes the GECO algorithm in detail.
