{"title":"利用w-shingling将相似基因分组到元桶中","authors":"Jorge Cedeno-Femández, F. Torres-Rojas","doi":"10.1109/CLEI47609.2019.235080","DOIUrl":null,"url":null,"abstract":"Abstract-The Pseudomonas aeruginosa AGI is an opportunistic superbug that affects patients with weakened immune systems due to infectious diseases in eyes, ears, burns or chronic diseases such as cystic flbrosis. This bacterium has genes without information and it is necessary to determine the function they fulfill in order to seek medical treatment that helps infected patients. Using w-shingling, metabuckets are generated that group genes by their similarity. Each metabucket contains genes with or without information, the latter can be associated with information about what they do possess. In addition, a factorial experiment is presented with the results of applying the algorithm to a set of genes obtained from public databases, where the average similarity of the genes grouped in metabuckets reaches 98%.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"196 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using w-shingling to Group Similar Genes intoMetabuckets\",\"authors\":\"Jorge Cedeno-Femández, F. Torres-Rojas\",\"doi\":\"10.1109/CLEI47609.2019.235080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract-The Pseudomonas aeruginosa AGI is an opportunistic superbug that affects patients with weakened immune systems due to infectious diseases in eyes, ears, burns or chronic diseases such as cystic flbrosis. This bacterium has genes without information and it is necessary to determine the function they fulfill in order to seek medical treatment that helps infected patients. Using w-shingling, metabuckets are generated that group genes by their similarity. Each metabucket contains genes with or without information, the latter can be associated with information about what they do possess. In addition, a factorial experiment is presented with the results of applying the algorithm to a set of genes obtained from public databases, where the average similarity of the genes grouped in metabuckets reaches 98%.\",\"PeriodicalId\":216193,\"journal\":{\"name\":\"2019 XLV Latin American Computing Conference (CLEI)\",\"volume\":\"196 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 XLV Latin American Computing Conference (CLEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CLEI47609.2019.235080\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 XLV Latin American Computing Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI47609.2019.235080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using w-shingling to Group Similar Genes intoMetabuckets
Abstract-The Pseudomonas aeruginosa AGI is an opportunistic superbug that affects patients with weakened immune systems due to infectious diseases in eyes, ears, burns or chronic diseases such as cystic flbrosis. This bacterium has genes without information and it is necessary to determine the function they fulfill in order to seek medical treatment that helps infected patients. Using w-shingling, metabuckets are generated that group genes by their similarity. Each metabucket contains genes with or without information, the latter can be associated with information about what they do possess. In addition, a factorial experiment is presented with the results of applying the algorithm to a set of genes obtained from public databases, where the average similarity of the genes grouped in metabuckets reaches 98%.