|国家预印本平台
首页|Predicting the Impact of cis-Regulatory Variation on Alternative Polyadenylation

Predicting the Impact of cis-Regulatory Variation on Alternative Polyadenylation

Predicting the Impact of cis-Regulatory Variation on Alternative Polyadenylation

来源:bioRxiv_logobioRxiv
英文摘要

Alternative polyadenylation (APA) is a major driver of transcriptome diversity in human cells. Here, we use deep learning to predict APA from DNA sequence alone. We trained our model (APARENT, APA REgression NeT) on isoform expression data from over three million APA reporters, built by inserting random sequence into twelve distinct 3’UTR contexts. Predictions are highly accurate across both synthetic and genomic contexts; when tasked with inferring APA in human 3’UTRs, APARENT outperforms models trained exclusively on endogenous data. Visualizing features learned across all network layers reveals that APARENT recognizes sequence motifs known to recruit APA regulators, discovers previously unknown sequence determinants of cleavage site selection, and integrates these features into a comprehensive, interpretable cis-regulatory code. Finally, we use APARENT to quantify the impact of genetic variants on APA. Our approach detects pathogenic variants in a wide range of disease contexts, expanding our understanding of the genetic origins of disease.

Bogard Nicholas、Linder Johannes、Seelig Georg、Rosenberg Alexander B.

Department of Electrical Engineering, University of WashingtonPaul G. Allen School of Computer Science, University of WashingtonDepartment of Electrical Engineering, University of Washington||Paul G. Allen School of Computer Science, University of WashingtonDepartment of Electrical Engineering, University of Washington

10.1101/300061

基础医学生物科学研究方法、生物科学研究技术分子生物学

Bogard Nicholas,Linder Johannes,Seelig Georg,Rosenberg Alexander B..Predicting the Impact of cis-Regulatory Variation on Alternative Polyadenylation[EB/OL].(2025-03-28)[2025-05-28].https://www.biorxiv.org/content/10.1101/300061.点此复制

评论