Guide your favorite protein sequence generative model
Guide your favorite protein sequence generative model
Generative machine learning models on sequences are transforming protein engineering. However, no principled framework exists for conditioning these models on auxiliary information, such as experimental data, in a plug-and-play manner. Herein, we present ProteinGuide -- a principled and general method for conditioning -- by unifying a broad class of protein generative models under a single framework. We demonstrate the applicability of ProteinGuide by guiding two protein generative models, ProteinMPNN and ESM3, to generate amino acid and structure token sequences, conditioned on several user-specified properties such as enhanced stability, enzyme classes, and CATH-labeled folds. We also used ProteinGuide with inverse folding models and our own experimental assay to design adenine base editor sequences for high activity.
Maria Lukarska、Luke M. Oltrogge、Jennifer Listgarten、Junhao Xiong、Ishan Gaur、David F. Savage、Hunter Nisonoff
生物科学研究方法、生物科学研究技术生物工程学
Maria Lukarska,Luke M. Oltrogge,Jennifer Listgarten,Junhao Xiong,Ishan Gaur,David F. Savage,Hunter Nisonoff.Guide your favorite protein sequence generative model[EB/OL].(2025-05-07)[2025-06-03].https://arxiv.org/abs/2505.04823.点此复制
评论