|国家预印本平台
首页|Planning with Diffusion Models for Target-Oriented Dialogue Systems

Planning with Diffusion Models for Target-Oriented Dialogue Systems

Planning with Diffusion Models for Target-Oriented Dialogue Systems

来源:Arxiv_logoArxiv
英文摘要

Target-Oriented Dialogue (TOD) remains a significant challenge in the LLM era, where strategic dialogue planning is crucial for directing conversations toward specific targets. However, existing dialogue planning methods generate dialogue plans in a step-by-step sequential manner, and may suffer from compounding errors and myopic actions. To address these limitations, we introduce a novel dialogue planning framework, DiffTOD, which leverages diffusion models to enable non-sequential dialogue planning. DiffTOD formulates dialogue planning as a trajectory generation problem with conditional guidance, and leverages a diffusion language model to estimate the likelihood of the dialogue trajectory. To optimize the dialogue action strategies, DiffTOD introduces three tailored guidance mechanisms for different target types, offering flexible guidance towards diverse TOD targets at test time. Extensive experiments across three diverse TOD settings show that DiffTOD can effectively perform non-myopic lookahead exploration and optimize action strategies over a long horizon through non-sequential dialogue planning, and demonstrates strong flexibility across complex and diverse dialogue scenarios. Our code and data are accessible through https://anonymous.4open.science/r/DiffTOD.

Hanwen Du、Bo Peng、Xia Ning

计算技术、计算机技术

Hanwen Du,Bo Peng,Xia Ning.Planning with Diffusion Models for Target-Oriented Dialogue Systems[EB/OL].(2025-04-23)[2025-06-04].https://arxiv.org/abs/2504.16858.点此复制

评论