|国家预印本平台
首页|Length Controlled Generation for Black-box LLMs

Length Controlled Generation for Black-box LLMs

Length Controlled Generation for Black-box LLMs

来源:Arxiv_logoArxiv
英文摘要

Large language models (LLMs) have demonstrated impressive instruction following capabilities, while still struggling to accurately manage the length of the generated text, which is a fundamental requirement in many real-world applications. Existing length control methods involve fine-tuning the parameters of LLMs, which is inefficient and suboptimal for practical use. In this paper, we propose a novel iterative sampling framework for text length control, integrating the Metropolis-Hastings algorithm with an importance sampling acceleration strategy. This framework efficiently and reliably regulates LLMs to generate length-constrained text without modifying the underlying parameters, thereby preserving the original capabilities of LLMs. Experimental results demonstrate that our framework achieves almost 100\% success rates of length control on Llama3.1 for tasks such as length-controlled abstractive summarization and length-constrained instruction following, with minimal additional computational overhead. This also highlights the significant potential of our method for precise length control across a broader range of applications, without compromising the versatility of LLMs.

Lei Huang、Yuxuan Gu、Kun Zhu、Xiaocheng Feng、Wenjie Wang、Weihong Zhong、Tat-Seng Chua、Bing Qin

计算技术、计算机技术

Lei Huang,Yuxuan Gu,Kun Zhu,Xiaocheng Feng,Wenjie Wang,Weihong Zhong,Tat-Seng Chua,Bing Qin.Length Controlled Generation for Black-box LLMs[EB/OL].(2024-12-19)[2025-04-27].https://arxiv.org/abs/2412.14656.点此复制

评论