|国家预印本平台
首页|Local graph estimation: Interpretable network discovery for complex data

Local graph estimation: Interpretable network discovery for complex data

Local graph estimation: Interpretable network discovery for complex data

来源:Arxiv_logoArxiv
英文摘要

Large, complex datasets often include a small set of variables of primary interest, such as clinical outcomes or known biomarkers, whose relation to the broader system is the main focus of analysis. In these situations, exhaustively estimating the entire network may obscure insights into the scientific question at hand. To address this common scenario, we introduce local graph estimation, a statistical framework that focuses on inferring substructures around target variables rather than recovering the full network of inter-variable relationships. We show that traditional graph estimation methods often fail to recover local structure, and present pathwise feature selection (PFS) as an alternative approach. PFS estimates local subgraphs by iteratively applying feature selection and propagating uncertainty along network paths. We prove that PFS provides path discovery with finite-sample false discovery control and yields highly interpretable results, even in settings with mixed variable types and nonlinear dependencies. Applied to two cancer studies -- one analyzing county-level cancer incidence and mortality across the U.S., and another integrating gene, microRNA, protein, and clinical data from The Cancer Genome Atlas -- PFS uncovers biologically plausible networks that reveal both known and novel associations.

Omar Melikechi、David B. Dunson、Noureddine Melikechi、Jeffrey W. Miller

肿瘤学生物科学研究方法、生物科学研究技术

Omar Melikechi,David B. Dunson,Noureddine Melikechi,Jeffrey W. Miller.Local graph estimation: Interpretable network discovery for complex data[EB/OL].(2025-07-23)[2025-08-10].https://arxiv.org/abs/2507.17172.点此复制

评论