Deep Learning for Protein Peptide binding Prediction: Incorporating Sequence, Structural and Language Model Features
Deep Learning for Protein Peptide binding Prediction: Incorporating Sequence, Structural and Language Model Features
Protein-peptide interactions play a crucial role in various cellular processes and are implicated in abnormal cellular behaviors leading to diseases such as cancer. Therefore, understanding these interactions is vital for both functional genomics and drug discovery efforts. Despite a significant increase in the availability of protein-peptide complexes, experimental methods for studying these interactions remain laborious, time-consuming, and expensive. Computational methods offer a complementary approach but often fall short in terms of prediction accuracy. To address these challenges, we introduce PepCNN, a deep learning-based prediction model that incorporates structural and sequence-based information from primary protein sequences. By utilizing a combination of half-sphere exposure, position specific scoring matrices, and pre-trained transformer language model, PepCNN outperforms state-of-the-art methods in terms of specificity, precision, and AUC. The PepCNN software and datasets are publicly available at https://github.com/abelavit/PepCNN.git.
Tsunoda Tatsuhiko、Sattar Abdul、Sharma Alok、Chandra Abel、Dehzangi Iman
生物科学研究方法、生物科学研究技术基础医学药学
Tsunoda Tatsuhiko,Sattar Abdul,Sharma Alok,Chandra Abel,Dehzangi Iman.Deep Learning for Protein Peptide binding Prediction: Incorporating Sequence, Structural and Language Model Features[EB/OL].(2025-03-28)[2025-06-27].https://www.biorxiv.org/content/10.1101/2023.09.02.556055.点此复制
评论