{"title":"基于紧凑遗传算法的疾病相关性研究方法","authors":"Jing Zhao, Bin Wei, Xuan She","doi":"10.1109/ICITBE54178.2021.00093","DOIUrl":null,"url":null,"abstract":"The central goal of genetic studies is to identify the mapping relationship between DNA variation and diseases. Single nucleotide polymorphisms (SNPs) are the most common form of DNA variation. The availability of high-throughput genotyping technologies enable researchers to investigate SNPs that contribute to disease risks. However, contrasting to the number of SNPs involved, the available datasets generally have a fairly small sample size, which is a challenge to traditional data analysis methods. In this paper, the compact genetic algorithm (cGA) is used to identify the disease-associated SNPs. The algorithm used in this paper was tested on three datasets: Crohn's disease, Lung cancer and Tick-borne encephalitis. The experimental results showed that the method used in this paper has better performance than the existing algorithms and can greatly improve the prediction accuracy. It is potentially interesting as an alternative tool in disease association studies.","PeriodicalId":207276,"journal":{"name":"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The compact genetic algorithm based method for disease associated study\",\"authors\":\"Jing Zhao, Bin Wei, Xuan She\",\"doi\":\"10.1109/ICITBE54178.2021.00093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The central goal of genetic studies is to identify the mapping relationship between DNA variation and diseases. Single nucleotide polymorphisms (SNPs) are the most common form of DNA variation. The availability of high-throughput genotyping technologies enable researchers to investigate SNPs that contribute to disease risks. However, contrasting to the number of SNPs involved, the available datasets generally have a fairly small sample size, which is a challenge to traditional data analysis methods. In this paper, the compact genetic algorithm (cGA) is used to identify the disease-associated SNPs. The algorithm used in this paper was tested on three datasets: Crohn's disease, Lung cancer and Tick-borne encephalitis. The experimental results showed that the method used in this paper has better performance than the existing algorithms and can greatly improve the prediction accuracy. It is potentially interesting as an alternative tool in disease association studies.\",\"PeriodicalId\":207276,\"journal\":{\"name\":\"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)\",\"volume\":\"78 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITBE54178.2021.00093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITBE54178.2021.00093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The compact genetic algorithm based method for disease associated study
The central goal of genetic studies is to identify the mapping relationship between DNA variation and diseases. Single nucleotide polymorphisms (SNPs) are the most common form of DNA variation. The availability of high-throughput genotyping technologies enable researchers to investigate SNPs that contribute to disease risks. However, contrasting to the number of SNPs involved, the available datasets generally have a fairly small sample size, which is a challenge to traditional data analysis methods. In this paper, the compact genetic algorithm (cGA) is used to identify the disease-associated SNPs. The algorithm used in this paper was tested on three datasets: Crohn's disease, Lung cancer and Tick-borne encephalitis. The experimental results showed that the method used in this paper has better performance than the existing algorithms and can greatly improve the prediction accuracy. It is potentially interesting as an alternative tool in disease association studies.