{"title":"基于中文分词和词嵌入的关系数据库水印","authors":"Wenling Li, Jianen Yan, Zhaoxin Zhang","doi":"10.1109/ICCCN49398.2020.9209600","DOIUrl":null,"url":null,"abstract":"With the development of big data, relational databases are playing an important role in enterprises, military affairs, medical, etc. Moreover, they are vulnerable to piracy, forgery, and tampering. Consequently, the copyright protection of relational databases has become an issue with increasing concern. The use of digital watermarking technology can solve this problem effectively. In this paper, by using Chinese word segmentation and word embedding, the non-numerical attributes of the relational database with Chinese natural language are chosen to embed watermark. The method of the virtual splitting of the attribute column and the principle of modification minimum are proposed to guarantee the watermark capacity and reduce the data modification rate. The simulations are carried out to prove the algorithm has strong robustness to defense malicious attacks.","PeriodicalId":137835,"journal":{"name":"2020 29th International Conference on Computer Communications and Networks (ICCCN)","volume":"321 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Relational Database Watermarking Based on Chinese Word Segmentation and Word Embedding\",\"authors\":\"Wenling Li, Jianen Yan, Zhaoxin Zhang\",\"doi\":\"10.1109/ICCCN49398.2020.9209600\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of big data, relational databases are playing an important role in enterprises, military affairs, medical, etc. Moreover, they are vulnerable to piracy, forgery, and tampering. Consequently, the copyright protection of relational databases has become an issue with increasing concern. The use of digital watermarking technology can solve this problem effectively. In this paper, by using Chinese word segmentation and word embedding, the non-numerical attributes of the relational database with Chinese natural language are chosen to embed watermark. The method of the virtual splitting of the attribute column and the principle of modification minimum are proposed to guarantee the watermark capacity and reduce the data modification rate. The simulations are carried out to prove the algorithm has strong robustness to defense malicious attacks.\",\"PeriodicalId\":137835,\"journal\":{\"name\":\"2020 29th International Conference on Computer Communications and Networks (ICCCN)\",\"volume\":\"321 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 29th International Conference on Computer Communications and Networks (ICCCN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCN49398.2020.9209600\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 29th International Conference on Computer Communications and Networks (ICCCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCN49398.2020.9209600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Relational Database Watermarking Based on Chinese Word Segmentation and Word Embedding
With the development of big data, relational databases are playing an important role in enterprises, military affairs, medical, etc. Moreover, they are vulnerable to piracy, forgery, and tampering. Consequently, the copyright protection of relational databases has become an issue with increasing concern. The use of digital watermarking technology can solve this problem effectively. In this paper, by using Chinese word segmentation and word embedding, the non-numerical attributes of the relational database with Chinese natural language are chosen to embed watermark. The method of the virtual splitting of the attribute column and the principle of modification minimum are proposed to guarantee the watermark capacity and reduce the data modification rate. The simulations are carried out to prove the algorithm has strong robustness to defense malicious attacks.