Jianan Wu, Chunguang Zhou, Zhangxu Li, Xuefei Xia, Seng Zhang, You Zhou
{"title":"A novel algorithm for generating simulated genetic data based on K-medoids","authors":"Jianan Wu, Chunguang Zhou, Zhangxu Li, Xuefei Xia, Seng Zhang, You Zhou","doi":"10.1109/CCIS.2012.6664360","DOIUrl":null,"url":null,"abstract":"Genetic data is very important for biological research, but it is hard to be obtained by experiment. In this paper, we introduce an algorithm for generating simulated genetic data based on K-mediods. A concept of Cluster Channel is proposed in this algorithm and used to generate simulated data. The noise of origin data could be eliminated using the proposed method. The experimental results show reliability of simulated genetic data. SAM is used to analyze the simulated data and original data, and we get a conclusion that the simulated data can effectively validate differentially expressed gene detected algorithm.","PeriodicalId":392558,"journal":{"name":"2012 IEEE 2nd International Conference on Cloud Computing and Intelligence Systems","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 2nd International Conference on Cloud Computing and Intelligence Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS.2012.6664360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Genetic data is very important for biological research, but it is hard to be obtained by experiment. In this paper, we introduce an algorithm for generating simulated genetic data based on K-mediods. A concept of Cluster Channel is proposed in this algorithm and used to generate simulated data. The noise of origin data could be eliminated using the proposed method. The experimental results show reliability of simulated genetic data. SAM is used to analyze the simulated data and original data, and we get a conclusion that the simulated data can effectively validate differentially expressed gene detected algorithm.