{"title":"一种基于基因排序的高效投影分割算法","authors":"Zesheng Sun, Yuhai Zhao, D. Meng, He Pan","doi":"10.1109/FSKD.2013.6816286","DOIUrl":null,"url":null,"abstract":"Most existing methods perform the projected partition over gene expression data based on the untrue assumption of independence among genes. To address the problem, we propose two novel projected partition algorithms, PPA and PPA+. The basic idea of PPA is to take the order among genes as the criterion of phenotype structure discovery. Specially, in PPA, no any specific data distribution assumption is needed. By transforming the expression values into sequential data, PPA employs the branch and bound paradigm to conduct an efficient depth-first traverse over the sample enumeration space, where a user-specific favorite function is developed. Further, based on the hill-climbing strategy and the devised evaluation function, two different strategies are proposed to make the results satisfy different customer-requirements. Efficient pruning and optimization strategies are also devised to further improve the performance of the algorithms. We conducted the performance comparison of PPA and some related alternatives on five real Microarray datasets. The results show that PPA is more effective and efficient. More important, PPA may provide a fire-new insight into the pathogenesis problem.","PeriodicalId":368964,"journal":{"name":"2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An efficient projected partition algorithm based on the order among genes\",\"authors\":\"Zesheng Sun, Yuhai Zhao, D. Meng, He Pan\",\"doi\":\"10.1109/FSKD.2013.6816286\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most existing methods perform the projected partition over gene expression data based on the untrue assumption of independence among genes. To address the problem, we propose two novel projected partition algorithms, PPA and PPA+. The basic idea of PPA is to take the order among genes as the criterion of phenotype structure discovery. Specially, in PPA, no any specific data distribution assumption is needed. By transforming the expression values into sequential data, PPA employs the branch and bound paradigm to conduct an efficient depth-first traverse over the sample enumeration space, where a user-specific favorite function is developed. Further, based on the hill-climbing strategy and the devised evaluation function, two different strategies are proposed to make the results satisfy different customer-requirements. Efficient pruning and optimization strategies are also devised to further improve the performance of the algorithms. We conducted the performance comparison of PPA and some related alternatives on five real Microarray datasets. The results show that PPA is more effective and efficient. More important, PPA may provide a fire-new insight into the pathogenesis problem.\",\"PeriodicalId\":368964,\"journal\":{\"name\":\"2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FSKD.2013.6816286\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2013.6816286","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An efficient projected partition algorithm based on the order among genes
Most existing methods perform the projected partition over gene expression data based on the untrue assumption of independence among genes. To address the problem, we propose two novel projected partition algorithms, PPA and PPA+. The basic idea of PPA is to take the order among genes as the criterion of phenotype structure discovery. Specially, in PPA, no any specific data distribution assumption is needed. By transforming the expression values into sequential data, PPA employs the branch and bound paradigm to conduct an efficient depth-first traverse over the sample enumeration space, where a user-specific favorite function is developed. Further, based on the hill-climbing strategy and the devised evaluation function, two different strategies are proposed to make the results satisfy different customer-requirements. Efficient pruning and optimization strategies are also devised to further improve the performance of the algorithms. We conducted the performance comparison of PPA and some related alternatives on five real Microarray datasets. The results show that PPA is more effective and efficient. More important, PPA may provide a fire-new insight into the pathogenesis problem.