Gaussian copula correlation network analysis with application to multi-omics data
Gaussian copula correlation network analysis with application to multi-omics data
Reconstructing gene regulatory networks from large-scale heterogeneous data is a key challenge in biology. In multi-omics data analysis, networks based on pairwise statistical association measures remain popular, as they are easy to build and understand. In the presence of mixed-type (discrete and continuous) data, however, the choice of good association measures remains an important issue. We propose here a novel approach based on the Gaussian copula, the parameters of which represent the links of the network. Novel properties of the model are obtained to guide the interpretation of the network. To estimate the copula parameters, we calculated a semiparametric pairwise likelihood for mixed data. In an extensive simulation study, we showed that the proposed estimation procedure was able to accurately estimate the copula correlation matrix. The proposed methodology was also applied to a real ICGC dataset on breast cancer, and is implemented in a freely available R package heterocop.
Ekaterina Tomilina、Florence Jaffrézic、Gildas Mazo
MaIAGE, GABIGABIMaIAGE
生物科学研究方法、生物科学研究技术
Ekaterina Tomilina,Florence Jaffrézic,Gildas Mazo.Gaussian copula correlation network analysis with application to multi-omics data[EB/OL].(2025-06-10)[2025-06-18].https://arxiv.org/abs/2506.08586.点此复制
评论