some set of genes in organisms exposed to at least two different conditions, and usually thentrying to determine which (if any) of these genes are actually changing in response to the changein experimental condition (2). Unfortunately, the random variation in microarrays is often largerelative to the changes trying to be detected. This makes it very hard to eliminate false positives,genes whose random fluctuations make them erroneously seem to respond to the experimentalcondition, and false negatives, genes whose random fluctuations mask the fact that they areactually responding to the experimental condition. It is standard statistical practice to handle this problem by using replicates, i.e., several microarrays run in identical settings (1-9). Althoughreplicates are highly effective, the expense of additional replicates makes it worthwhile tominimize the number needed by using more effective statistical tests.When there are only two conditions in an experiment, the conventional statistical test for differential expression is the t-test (2,6). But Long,
et. al.
(5, also 6,7) realized that by running aseparate t-test on each gene, valuable information is lost, namely the fact that the actual population standard deviation of all the genes is rather similar. Due to the small number of replicates in microarray experiments there are usually a few genes that, by chance, appear tohave extremely low standard deviations and thus register even a very small change as highlysignificant by a conventional t-test. In reality -- and as additional replicates would confirm -- thestandard deviations of these genes are much higher, and the change is not at all significant. Inresponse to this issue, Long
et. al.
derived using Bayesian statistics a method of transformingstandard deviations that, in essence, moves outlying standard deviations closer to the meanstandard deviation. Unfortunately, the resulting algorithm, which they call Cyber-T, works onlywith two condition experiments. Since this research is concerned with multiple conditionexperiments, it was necessary to expand the method to work with this type of experiment, thestandard statistical test for which is called ANOVA (ANalysis Of Variations) (8). Here we reportthe successful extension of the Cyber-T algorithm to multiple condition experiments, creating thealgorithm we call Cyber-ANOVA.In microarray experiments involving several conditions, the expression vector of eachgene formed by consecutive measurements of its expression level has a distinctive shape, whichcontains useful information. The more conditions in the experiment (and thus numbers in the
3
Leave a Comment