|国家预印本平台
首页|Finding Distributions that Differ, with False Discovery Rate Control

Finding Distributions that Differ, with False Discovery Rate Control

Finding Distributions that Differ, with False Discovery Rate Control

来源:Arxiv_logoArxiv
英文摘要

We consider the problem of comparing a reference distribution with several other distributions. Given a sample from both the reference and the comparison groups, we aim to identify the comparison groups whose distributions differ from that of the reference group. Viewing this as a multiple testing problem, we introduce a methodology that provides exact, distribution-free control of the false discovery rate. To do so, we introduce the concept of batch conformal p-values and demonstrate that they satisfy positive regression dependence across the groups [Benjamini and Yekutieli, 2001], thereby enabling control of the false discovery rate through the Benjamini-Hochberg procedure. The proof of positive regression dependence introduces a novel technique for the inductive construction of rank vectors with almost sure dominance under exchangeability. We evaluate the performance of the proposed procedure through simulations, where, despite being distribution-free, in some cases they show performance comparable to methods with knowledge of the data-generating normal distribution; and further have more power than direct approaches based on conformal out-of-distribution detection. Further, we illustrate our methods on a Hepatitis C treatment dataset, where they can identify patient groups with large treatment effects; and on the Current Population Survey dataset, where they can identify sub-population with long work hours.

Yonghoon Lee、Edgar Dobriban、Eric Tchetgen Tchetgen

医学研究方法数学

Yonghoon Lee,Edgar Dobriban,Eric Tchetgen Tchetgen.Finding Distributions that Differ, with False Discovery Rate Control[EB/OL].(2025-05-19)[2025-06-07].https://arxiv.org/abs/2505.13769.点此复制

评论