|国家预印本平台
首页|Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

来源:Arxiv_logoArxiv
英文摘要

Recent data-efficient molecular generation approaches exploit graph grammars to introduce interpretability into the generative models. However, grammar learning therein relies on expert annotation or unreliable heuristics for algorithmic inference. We propose Foundation Molecular Grammar (FMG), which leverages multi-modal foundation models (MMFMs) to induce an interpretable molecular language. By exploiting the chemical knowledge of an MMFM, FMG renders molecules as images, describes them as text, and aligns information across modalities using prompt learning. FMG can be used as a drop-in replacement for the prior grammar learning approaches in molecular generation and property prediction. We show that FMG not only excels in synthesizability, diversity, and data efficiency but also offers built-in chemical interpretability for automated molecular discovery workflows. Code is available at https://github.com/shiningsunnyday/induction.

Michael Sun、Weize Yuan、Gang Liu、Wojciech Matusik、Jie Chen

化学计算技术、计算机技术

Michael Sun,Weize Yuan,Gang Liu,Wojciech Matusik,Jie Chen.Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages[EB/OL].(2025-05-28)[2025-06-14].https://arxiv.org/abs/2505.22948.点此复制

评论