|国家预印本平台
首页|Multi-use LLM Watermarking and the False Detection Problem

Multi-use LLM Watermarking and the False Detection Problem

Multi-use LLM Watermarking and the False Detection Problem

来源:Arxiv_logoArxiv
英文摘要

Digital watermarking is a promising solution for mitigating some of the risks arising from the misuse of automatically generated text. These approaches either embed non-specific watermarks to allow for the detection of any text generated by a particular sampler, or embed specific keys that allow the identification of the LLM user. However, simultaneously using the same embedding for both detection and user identification leads to a false detection problem, whereby, as user capacity grows, unwatermarked text is increasingly likely to be falsely detected as watermarked. Through theoretical analysis, we identify the underlying causes of this phenomenon. Building on these insights, we propose Dual Watermarking which jointly encodes detection and identification watermarks into generated text, significantly reducing false positives while maintaining high detection accuracy. Our experimental results validate our theoretical findings and demonstrate the effectiveness of our approach.

Zihao Fu、Chris Russell

计算技术、计算机技术自动化技术、自动化技术设备

Zihao Fu,Chris Russell.Multi-use LLM Watermarking and the False Detection Problem[EB/OL].(2025-06-19)[2025-07-16].https://arxiv.org/abs/2506.15975.点此复制

评论