|国家预印本平台
首页|Mask prior-guided denoising diffusion improves inverse protein folding

Mask prior-guided denoising diffusion improves inverse protein folding

Mask prior-guided denoising diffusion improves inverse protein folding

来源:Arxiv_logoArxiv
英文摘要

Inverse protein folding generates valid amino acid sequences that can fold into a desired protein structure, with recent deep-learning advances showing strong potential and competitive performance. However, challenges remain, such as predicting elements with high structural uncertainty, including disordered regions. To tackle such low-confidence residue prediction, we propose a Mask-prior-guided denoising Diffusion (MapDiff) framework that accurately captures both structural information and residue interactions for inverse protein folding. MapDiff is a discrete diffusion probabilistic model that iteratively generates amino acid sequences with reduced noise, conditioned on a given protein backbone. To incorporate structural information and residue interactions, we develop a graph-based denoising network with a mask-prior pre-training strategy. Moreover, in the generative process, we combine the denoising diffusion implicit model with Monte-Carlo dropout to reduce uncertainty. Evaluation on four challenging sequence design benchmarks shows that MapDiff substantially outperforms state-of-the-art methods. Furthermore, the in silico sequences generated by MapDiff closely resemble the physico-chemical and structural characteristics of native proteins across different protein families and architectures.

Peizhen Bai、Filip Miljković、Xianyuan Liu、Leonardo De Maria、Rebecca Croasdale-Wood、Owen Rackham、Haiping Lu

生物科学研究方法、生物科学研究技术分子生物学

Peizhen Bai,Filip Miljković,Xianyuan Liu,Leonardo De Maria,Rebecca Croasdale-Wood,Owen Rackham,Haiping Lu.Mask prior-guided denoising diffusion improves inverse protein folding[EB/OL].(2025-07-25)[2025-08-04].https://arxiv.org/abs/2412.07815.点此复制

评论