This document proposes a framework for addressing multiple testing dependence by modeling dependence at the data level rather than the p-value level. It introduces the concept of a dependence kernel, which captures dependence between tests, and shows that fitting a model which includes both primary variables of interest and a dependence kernel as covariates results in independent test statistics under the null hypothesis. This allows for use of standard multiple testing procedures without modification. The approach is demonstrated on simulated data, where it is shown to correct for dependence and provide more accurate control of false discovery rates and test statistic rankings compared to approaches that ignore dependence.
Related topics: