|国家预印本平台
首页|CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality

CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality

CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality

来源:Arxiv_logoArxiv
英文摘要

Watermarking technology is a method used to trace the usage of content generated by large language models. Sentence-level watermarking aids in preserving the semantic integrity within individual sentences while maintaining greater robustness. However, many existing sentence-level watermarking techniques depend on arbitrary segmentation or generation processes to embed watermarks, which can limit the availability of appropriate sentences. This limitation, in turn, compromises the quality of the generated response. To address the challenge of balancing high text quality with robust watermark detection, we propose CoheMark, an advanced sentence-level watermarking technique that exploits the cohesive relationships between sentences for better logical fluency. The core methodology of CoheMark involves selecting sentences through trained fuzzy c-means clustering and applying specific next sentence selection criteria. Experimental evaluations demonstrate that CoheMark achieves strong watermark strength while exerting minimal impact on text quality.

Yubo Gao、Jungang Li、Xiaojie Gu、Xuming Hu、Junyan Zhang、Shuliang Liu、Aiwei Liu

计算技术、计算机技术

Yubo Gao,Jungang Li,Xiaojie Gu,Xuming Hu,Junyan Zhang,Shuliang Liu,Aiwei Liu.CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality[EB/OL].(2025-04-24)[2025-05-10].https://arxiv.org/abs/2504.17309.点此复制

评论