{"title":"病毒株的同态多标记分类","authors":"Junwei Zhou, Botian Lei, Huile Lang","doi":"10.1109/ISSREW55968.2022.00082","DOIUrl":null,"url":null,"abstract":"Detecting the gene sequence of virus strains from patients and classifying them into specific strains are very important to provide effective treatment. However, there are significant barriers to sharing the virus strains' gene data in plaintext to the privacy concerns of the patients. Homomorphic encryption is a form of encryption that allows users to calculate encrypted data without decrypting it. Achieving highly accurate viral strain prediction while safeguarding user privacy is a challenge. We develop a secure multi-label virus strains classification method using the homomorphic encryption scheme. We first used the method of statistical genotype frequencies for preprocessing to reduce the gene dimension of viral strains. Second, we improved the TFHE library proposed by Chillotti et al. to accommodate the floating-point input of the neural network to make the homomorphic calculation result more accurate. Finally, we improve computational speed and reduce storage usage by a data packing method that packs multiple feature information into one ciphertext. We successfully calculated 2000 virus strains classification inference steps on 128-bit encrypted test data in 0.09 seconds, reaching an accuracy of 100 %.","PeriodicalId":178302,"journal":{"name":"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Homomorphic multi-label classification of virus strains\",\"authors\":\"Junwei Zhou, Botian Lei, Huile Lang\",\"doi\":\"10.1109/ISSREW55968.2022.00082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detecting the gene sequence of virus strains from patients and classifying them into specific strains are very important to provide effective treatment. However, there are significant barriers to sharing the virus strains' gene data in plaintext to the privacy concerns of the patients. Homomorphic encryption is a form of encryption that allows users to calculate encrypted data without decrypting it. Achieving highly accurate viral strain prediction while safeguarding user privacy is a challenge. We develop a secure multi-label virus strains classification method using the homomorphic encryption scheme. We first used the method of statistical genotype frequencies for preprocessing to reduce the gene dimension of viral strains. Second, we improved the TFHE library proposed by Chillotti et al. to accommodate the floating-point input of the neural network to make the homomorphic calculation result more accurate. Finally, we improve computational speed and reduce storage usage by a data packing method that packs multiple feature information into one ciphertext. We successfully calculated 2000 virus strains classification inference steps on 128-bit encrypted test data in 0.09 seconds, reaching an accuracy of 100 %.\",\"PeriodicalId\":178302,\"journal\":{\"name\":\"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSREW55968.2022.00082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSREW55968.2022.00082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Homomorphic multi-label classification of virus strains
Detecting the gene sequence of virus strains from patients and classifying them into specific strains are very important to provide effective treatment. However, there are significant barriers to sharing the virus strains' gene data in plaintext to the privacy concerns of the patients. Homomorphic encryption is a form of encryption that allows users to calculate encrypted data without decrypting it. Achieving highly accurate viral strain prediction while safeguarding user privacy is a challenge. We develop a secure multi-label virus strains classification method using the homomorphic encryption scheme. We first used the method of statistical genotype frequencies for preprocessing to reduce the gene dimension of viral strains. Second, we improved the TFHE library proposed by Chillotti et al. to accommodate the floating-point input of the neural network to make the homomorphic calculation result more accurate. Finally, we improve computational speed and reduce storage usage by a data packing method that packs multiple feature information into one ciphertext. We successfully calculated 2000 virus strains classification inference steps on 128-bit encrypted test data in 0.09 seconds, reaching an accuracy of 100 %.