TABASCO: A Fast, Simplified Model for Molecular Generation with Improved Physical Quality
TABASCO: A Fast, Simplified Model for Molecular Generation with Improved Physical Quality
State-of-the-art models for 3D molecular generation are based on significant inductive biases, SE(3), permutation equivariance to respect symmetry and graph message-passing networks to capture local chemistry, yet the generated molecules still struggle with physical plausibility. We introduce TABASCO which relaxes these assumptions: The model has a standard non-equivariant transformer architecture, treats atoms in a molecule as sequences and reconstructs bonds deterministically after generation. The absence of equivariant layers and message passing allows us to significantly simplify the model architecture and scale data throughput. On the GEOM-Drugs benchmark TABASCO achieves state-of-the-art PoseBusters validity and delivers inference roughly 10x faster than the strongest baseline, while exhibiting emergent rotational equivariance despite symmetry not being hard-coded. Our work offers a blueprint for training minimalist, high-throughput generative models suited to specialised tasks such as structure- and pharmacophore-based drug design. We provide a link to our implementation at github.com/carlosinator/tabasco.
Carlos Vonessen、Charles Harris、Miruna Cretu、Pietro Liò
分子生物学药学
Carlos Vonessen,Charles Harris,Miruna Cretu,Pietro Liò.TABASCO: A Fast, Simplified Model for Molecular Generation with Improved Physical Quality[EB/OL].(2025-07-01)[2025-07-17].https://arxiv.org/abs/2507.00899.点此复制
评论