首页|DINGO: Constrained Inference for Diffusion LLMs

DINGO: Constrained Inference for Diffusion LLMs

来源：

英文摘要

Diffusion LLMs have emerged as a promising alternative to conventional autoregressive LLMs, offering significant potential for improved runtime efficiency. However, existing diffusion models lack the ability to provably enforce user-specified formal constraints, such as regular expressions, which makes them unreliable for tasks that require structured outputs, such as fixed-schema JSON generation. Unlike autoregressive models that generate tokens sequentially, diffusion LLMs predict a block of tokens in parallel. This parallelism makes traditional constrained decoding algorithms, which are designed for sequential token prediction, ineffective at preserving the true output distribution. To address this limitation, we propose DINGO, a dynamic programming-based constrained decoding strategy that is both efficient and provably distribution-preserving. DINGO enables sampling of output strings with the highest probability under the model's predicted distribution, while strictly satisfying any user-specified regular expression. On standard symbolic math and JSON generation benchmarks, DINGO achieves up to a 68 percentage point improvement over unconstrained inference

作者：Tarun Suresh、Debangshu Banerjee、Shubham Ugare、Sasa Misailovic、Gagandeep Singh

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Tarun Suresh,Debangshu Banerjee,Shubham Ugare,Sasa Misailovic,Gagandeep Singh.DINGO: Constrained Inference for Diffusion LLMs[EB/OL].(2025-05-29)[2025-06-13].https://arxiv.org/abs/2505.23061.点此复制

DINGO: Constrained Inference for Diffusion LLMs

DINGO: Constrained Inference for Diffusion LLMs

评论