{"title":"PersonaMark: Personalized LLM watermarking for model protection and user attribution","authors":"Yuehan Zhang, Peizhuo Lv, Yinpeng Liu, Yongqiang Ma, Wei Lu, Xiaofeng Wang, Xiaozhong Liu, Jiawei Liu","doi":"arxiv-2409.09739","DOIUrl":null,"url":null,"abstract":"The rapid development of LLMs brings both convenience and potential threats.\nAs costumed and private LLMs are widely applied, model copyright protection has\nbecome important. Text watermarking is emerging as a promising solution to\nAI-generated text detection and model protection issues. However, current text\nwatermarks have largely ignored the critical need for injecting different\nwatermarks for different users, which could help attribute the watermark to a\nspecific individual. In this paper, we explore the personalized text\nwatermarking scheme for LLM copyright protection and other scenarios, ensuring\naccountability and traceability in content generation. Specifically, we propose\na novel text watermarking method PersonaMark that utilizes sentence structure\nas the hidden medium for the watermark information and optimizes the\nsentence-level generation algorithm to minimize disruption to the model's\nnatural generation process. By employing a personalized hashing function to\ninject unique watermark signals for different users, personalized watermarked\ntext can be obtained. Since our approach performs on sentence level instead of\ntoken probability, the text quality is highly preserved. The injection process\nof unique watermark signals for different users is time-efficient for a large\nnumber of users with the designed multi-user hashing function. As far as we\nknow, we achieved personalized text watermarking for the first time through\nthis. We conduct an extensive evaluation of four different LLMs in terms of\nperplexity, sentiment polarity, alignment, readability, etc. The results\ndemonstrate that our method maintains performance with minimal perturbation to\nthe model's behavior, allows for unbiased insertion of watermark information,\nand exhibits strong watermark recognition capabilities.","PeriodicalId":501332,"journal":{"name":"arXiv - CS - Cryptography and Security","volume":"49 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Cryptography and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09739","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid development of LLMs brings both convenience and potential threats.
As costumed and private LLMs are widely applied, model copyright protection has
become important. Text watermarking is emerging as a promising solution to
AI-generated text detection and model protection issues. However, current text
watermarks have largely ignored the critical need for injecting different
watermarks for different users, which could help attribute the watermark to a
specific individual. In this paper, we explore the personalized text
watermarking scheme for LLM copyright protection and other scenarios, ensuring
accountability and traceability in content generation. Specifically, we propose
a novel text watermarking method PersonaMark that utilizes sentence structure
as the hidden medium for the watermark information and optimizes the
sentence-level generation algorithm to minimize disruption to the model's
natural generation process. By employing a personalized hashing function to
inject unique watermark signals for different users, personalized watermarked
text can be obtained. Since our approach performs on sentence level instead of
token probability, the text quality is highly preserved. The injection process
of unique watermark signals for different users is time-efficient for a large
number of users with the designed multi-user hashing function. As far as we
know, we achieved personalized text watermarking for the first time through
this. We conduct an extensive evaluation of four different LLMs in terms of
perplexity, sentiment polarity, alignment, readability, etc. The results
demonstrate that our method maintains performance with minimal perturbation to
the model's behavior, allows for unbiased insertion of watermark information,
and exhibits strong watermark recognition capabilities.