S. Bu, Lilu Guo, Rongyuan Li, Jianbo Lu, Xiaoshu Zhu
{"title":"GSE","authors":"S. Bu, Lilu Guo, Rongyuan Li, Jianbo Lu, Xiaoshu Zhu","doi":"10.1145/3378065.3378142","DOIUrl":null,"url":null,"abstract":"RNA-seq contains rich information about individual even single cell, implies certain biology pattern vary in special time or space two dimensions, e.g. different life stage or environment. Byusing clustering and other computing methods, we can efficient analysis and decode those data applying to cancer diagnosis and treat, biological evolution and so on. However, RNA-seq data has features of super-high dimensions, less labeled samples and strong noise, which bring large challenges for clustering analysis. Therefore, we proposed a new clustering method GSE, which can efficient enhance the signal-to-noise ratio of input similarity matrix using diffusion process in weighted connection network to improve clustering performance. Comparing with latest clustering methods, our method has advantages in external clustering criterions NMI and ARI indicators. Meanwhile inadequacy and improved idea are given. Code can be downloaded from Git-hub.","PeriodicalId":153062,"journal":{"name":"Proceedings of the 2019 4th International Conference on Intelligent Information Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 4th International Conference on Intelligent Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3378065.3378142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
RNA-seq contains rich information about individual even single cell, implies certain biology pattern vary in special time or space two dimensions, e.g. different life stage or environment. Byusing clustering and other computing methods, we can efficient analysis and decode those data applying to cancer diagnosis and treat, biological evolution and so on. However, RNA-seq data has features of super-high dimensions, less labeled samples and strong noise, which bring large challenges for clustering analysis. Therefore, we proposed a new clustering method GSE, which can efficient enhance the signal-to-noise ratio of input similarity matrix using diffusion process in weighted connection network to improve clustering performance. Comparing with latest clustering methods, our method has advantages in external clustering criterions NMI and ARI indicators. Meanwhile inadequacy and improved idea are given. Code can be downloaded from Git-hub.