Abstract
The multiple testing problem attributed to gene expression analysis is challenging not only by its size, but also by possible dependence between the expression levels of different genes resulting from co-regulations of the genes. Furthermore, the measurement errors of these expression levels may be dependent as well since they are subjected to several technical factors. Multiple testing of such data faces the challenge of correlated test statistics. In such a case, the control of the False Discovery Rate (FDR) is not straightforward, and thus demands new approaches and solutions that will address multiplicity while accounting for this dependency. This paper investigates the effects of dependency between bormal test statistics on FDR control in two-sided testing, using the linear step-up procedure (BH) of Benjamini and Hochberg (1995). The case of two multiple hypotheses is examined first. A simulation study offers primary insight into the behavior of the FDR subjected to different levels of correlation and distance between null and alternative means. A theoretical analysis follows in order to obtain explicit upper bounds to the FDR. These results are then extended to more than two multiple tests, thereby offering a better perspective on the effect of the proportion of false null hypotheses, as well as the structure of the test statistics correlation matrix. An example from gene expression data analysis is presented.
Original language | English |
---|---|
Pages (from-to) | 107-126 |
Number of pages | 20 |
Journal | Biometrical Journal |
Volume | 49 |
Issue number | 1 |
DOIs | |
State | Published - 1 Jan 2007 |
Externally published | Yes |
Keywords
- Correlated test statistics
- False discovery rate
- Gene expression
- Microarrays
- Multiple comparisons
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty