Sitetack: A Deep Learning Model that Improves PTM Prediction by Using Known PTMs
Sitetack: A Deep Learning Model that Improves PTM Prediction by Using Known PTMs
Abstract Post-translational modifications (PTMs) increase the diversity of the proteome and are vital to organismal life and therapeutic strategies. Deep learning has been used to predict PTM locations. Still, limitations in datasets and their analyses compromise success. Here we evaluate the use of known PTM sites in prediction via sequence-based deep learning algorithms. Specifically, PTM locations were encoded as a separate amino acid before sequences were encoded via word embedding and passed into a convolutional neural network that predicts the probability of a modification at a given site. Without labeling known PTMs, our model is on par with others. With labeling, however, we improved significantly upon extant models. Moreover, knowing PTM locations can increase the predictability of a different PTM. Our findings highlight the importance of PTMs for the installation of additional PTMs. We anticipate that including known PTM locations will enhance the performance of other proteomic machine learning algorithms.
Kassim Alia A.、Gutierrez Benjamin D.、Gutierrez Clair S.、Raines Ronald T.
Department of Chemistry, Massachusetts Institute of TechnologyDepartment of Chemistry, Massachusetts Institute of Technology||Broad Institute of MIT and Harvard, CambridgeDepartment of Chemistry, Massachusetts Institute of Technology||Broad Institute of MIT and Harvard, Cambridge||Koch Institute for Integrated Cancer Research at MIT
生物科学研究方法、生物科学研究技术生物化学分子生物学
Kassim Alia A.,Gutierrez Benjamin D.,Gutierrez Clair S.,Raines Ronald T..Sitetack: A Deep Learning Model that Improves PTM Prediction by Using Known PTMs[EB/OL].(2025-03-28)[2025-04-27].https://www.biorxiv.org/content/10.1101/2024.06.03.596298.点此复制
评论