|国家预印本平台
首页|White-box Deep Neural Network Prediction of Genome-Wide Transcriptome Signatures

White-box Deep Neural Network Prediction of Genome-Wide Transcriptome Signatures

White-box Deep Neural Network Prediction of Genome-Wide Transcriptome Signatures

来源:bioRxiv_logobioRxiv
英文摘要

Abstract Prediction algorithms for protein or gene structures, including transcription factor binding from sequence information, have been transformative in understanding gene regulation. Here we ask whether human transcriptomic profiles can be predicted solely from the expression of transcription factors (TFs). To this end, we explore whether a neural network (NN) could predict the transcriptome from TFs. Using at least one hidden layer, we find that the expression of 1,600 TFs can explain >95% of variance in 25,000 genes. Using the light-up technique to inspect the trained NN, we find an overrepresentation of known TF-gene regulations. Furthermore, the learned prediction network has a hierarchical organization. A smaller set of around 125 core TFs could explain close to 80% of the variance. Interestingly, reducing the number of TFs below 500 induces a rapid decline in prediction performance. Next, we evaluated the prediction model using transcriptional data from 22 human diseases. The TFs were sufficient to predict the target genes’ dysregulation (rho=0.61, P < 10?216). By inspecting the model, key causative TFs could be extracted for subsequent validation using disease-associated genetic variants. In conclusion, we demonstrate the construction of an interpretable neural network predictor. Analysis of the predictors revealed key TFs that were inducing transcriptional changes during disease.

Magnusson Rasmus、Gustafsson Mika、Tegn¨|r Jesper N.

Bioinformatics, Department of Physics, Chemistry and Biology, Link?ping University||School of Bioscience, Systems Biology Research Center, University of Sk?vdeBioinformatics, Department of Physics, Chemistry and Biology, Link?ping UniversityBiological and Environmental Sciences and Engineering Division, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST)||Unit of Computational Medicine, Department of Medicine, Solna, Center for Molecular Medicine, Karolinska Institutet||Science for Life Laboratory

10.1101/2021.02.11.430730

基础医学生物科学研究方法、生物科学研究技术分子生物学

Magnusson Rasmus,Gustafsson Mika,Tegn¨|r Jesper N..White-box Deep Neural Network Prediction of Genome-Wide Transcriptome Signatures[EB/OL].(2025-03-28)[2025-05-24].https://www.biorxiv.org/content/10.1101/2021.02.11.430730.点此复制

评论