Similarity bias from consensus perturbational signatures from the L1000 Connectivity Map
Similarity bias from consensus perturbational signatures from the L1000 Connectivity Map
The Next Generation L1000 Connectivity Map (L1000) is a massive, high-throughput dataset measuring transcriptional changes - or signatures - in cancer cell lines due to chemical and genetic perturbation. Zhu et al recently presented Deep Learning-Based Efficacy Prediction System (DLEPS), a method that models averages of many signatures (CTPs or meta-signatures) of the same compound across conditions. However, our analysis shows that averaging increasing numbers of signatures from L1000 results in a substantial positive increase in signature similarities even among entirely unrelated perturbations, drastically shifting the baseline threshold for similarity. Consequently, DLEPS overstates how informative the model is about each compound, highlighting the need to account for this similarity bias in any L1000 Connectivity Map analysis.
Haibe-Kains Benjamin、Scott Katy、Smith Ian
生物科学研究方法、生物科学研究技术基础医学药学
Haibe-Kains Benjamin,Scott Katy,Smith Ian.Similarity bias from consensus perturbational signatures from the L1000 Connectivity Map[EB/OL].(2025-03-28)[2025-08-02].https://www.biorxiv.org/content/10.1101/2022.01.24.477615.点此复制
评论