{"title":"影响深度学习中神经网络基因型输入准确性的因素","authors":"Tianfeng Shi, Jing Peng","doi":"10.1145/3457682.3457688","DOIUrl":null,"url":null,"abstract":"The genotype imputation is an important topic in the field of genomics. Many genome analyses require data without missing values, which requires to impute the missing data. In recent years, deep learning has become hot, and it is more suitable for text sequence type problems, which may fit with the genotype imputation problem. Based on the recurrent neural network and convolutional neural network in deep learning, our study proposes and constructs five model combinations, imputes and compares the results under different missing rate scenarios. And on the basis of the basic model, a higher imputation accuracy is obtained by tuning the model hyperparameters. The results indicated that on all the data sets with various levels of missing rates, the CNN1D-RNNM with tuned hyperparameters well has obtained the best results. The combination of a one-dimensional convolutional neural network and a recurrent neural network with tuned hyperparameters can beat a single convolutional network or a recurrent network at various levels of missing rates. This research provides new solutions for genotype imputation by using the deep learning to build complex neural networks.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Factors Affecting Accuracy of Genotype Imputation Using Neural Networks in Deep Learning\",\"authors\":\"Tianfeng Shi, Jing Peng\",\"doi\":\"10.1145/3457682.3457688\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The genotype imputation is an important topic in the field of genomics. Many genome analyses require data without missing values, which requires to impute the missing data. In recent years, deep learning has become hot, and it is more suitable for text sequence type problems, which may fit with the genotype imputation problem. Based on the recurrent neural network and convolutional neural network in deep learning, our study proposes and constructs five model combinations, imputes and compares the results under different missing rate scenarios. And on the basis of the basic model, a higher imputation accuracy is obtained by tuning the model hyperparameters. The results indicated that on all the data sets with various levels of missing rates, the CNN1D-RNNM with tuned hyperparameters well has obtained the best results. The combination of a one-dimensional convolutional neural network and a recurrent neural network with tuned hyperparameters can beat a single convolutional network or a recurrent network at various levels of missing rates. This research provides new solutions for genotype imputation by using the deep learning to build complex neural networks.\",\"PeriodicalId\":142045,\"journal\":{\"name\":\"2021 13th International Conference on Machine Learning and Computing\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Machine Learning and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3457682.3457688\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Machine Learning and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3457682.3457688","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Factors Affecting Accuracy of Genotype Imputation Using Neural Networks in Deep Learning
The genotype imputation is an important topic in the field of genomics. Many genome analyses require data without missing values, which requires to impute the missing data. In recent years, deep learning has become hot, and it is more suitable for text sequence type problems, which may fit with the genotype imputation problem. Based on the recurrent neural network and convolutional neural network in deep learning, our study proposes and constructs five model combinations, imputes and compares the results under different missing rate scenarios. And on the basis of the basic model, a higher imputation accuracy is obtained by tuning the model hyperparameters. The results indicated that on all the data sets with various levels of missing rates, the CNN1D-RNNM with tuned hyperparameters well has obtained the best results. The combination of a one-dimensional convolutional neural network and a recurrent neural network with tuned hyperparameters can beat a single convolutional network or a recurrent network at various levels of missing rates. This research provides new solutions for genotype imputation by using the deep learning to build complex neural networks.