Geometric Representation Condition Improves Equivariant Molecule Generation
Geometric Representation Condition Improves Equivariant Molecule Generation
Recent advances in molecular generative models have demonstrated great promise for accelerating scientific discovery, particularly in drug design. However, these models often struggle to generate high-quality molecules, especially in conditional scenarios where specific molecular properties must be satisfied. In this work, we introduce GeoRCG, a general framework to improve molecular generative models by integrating geometric representation conditions with provable theoretical guarantees. We decompose the generation process into two stages: first, generating an informative geometric representation; second, generating a molecule conditioned on the representation. Compared with single-stage generation, the easy-to-generate representation in the first stage guides the second stage generation toward a high-quality molecule in a goal-oriented way. Leveraging EDM and SemlaFlow as base generators, we observe significant quality improvements in unconditional molecule generation on the widely used QM9 and GEOM-DRUG datasets. More notably, in the challenging conditional molecular generation task, our framework achieves an average 50\% performance improvement over state-of-the-art approaches, highlighting the superiority of conditioning on semantically rich geometric representations. Furthermore, with such representation guidance, the number of diffusion steps can be reduced to as small as 100 while largely preserving the generation quality achieved with 1,000 steps, thereby significantly reducing the generation iterations needed. Code is available at https://github.com/GraphPKU/GeoRCG.
Cai Zhou、Zian Li、Xiyuan Wang、Xingang Peng、Muhan Zhang
计算技术、计算机技术分子生物学
Cai Zhou,Zian Li,Xiyuan Wang,Xingang Peng,Muhan Zhang.Geometric Representation Condition Improves Equivariant Molecule Generation[EB/OL].(2025-07-28)[2025-08-23].https://arxiv.org/abs/2410.03655.点此复制
评论