{"title":"无监督学习技术在DNA微阵列数据库中的应用:一个比较案例研究","authors":"Judith E. Gómez-Cuervo, A. Chavoya","doi":"10.1109/ICEV56253.2022.9959173","DOIUrl":null,"url":null,"abstract":"We present the application of the unsupervised machine learning clustering techniques K-Means and Fuzzy C-Means to DNA microarray gene expression data from breast cancer tissues. The mathematical basis and the development of the algorithm of the techniques are shown, and the Expectation Maximization technique is presented as an alternative for the assignment of missing data. Results show that the best method for the clustering of the genetic expression data under study is the Fuzzy C-Means algorithm.","PeriodicalId":178334,"journal":{"name":"2022 IEEE International Conference on Engineering Veracruz (ICEV)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of unsupervised learning techniques to DNA microarray databases: a comparative case study\",\"authors\":\"Judith E. Gómez-Cuervo, A. Chavoya\",\"doi\":\"10.1109/ICEV56253.2022.9959173\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present the application of the unsupervised machine learning clustering techniques K-Means and Fuzzy C-Means to DNA microarray gene expression data from breast cancer tissues. The mathematical basis and the development of the algorithm of the techniques are shown, and the Expectation Maximization technique is presented as an alternative for the assignment of missing data. Results show that the best method for the clustering of the genetic expression data under study is the Fuzzy C-Means algorithm.\",\"PeriodicalId\":178334,\"journal\":{\"name\":\"2022 IEEE International Conference on Engineering Veracruz (ICEV)\",\"volume\":\"102 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Engineering Veracruz (ICEV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEV56253.2022.9959173\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Engineering Veracruz (ICEV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEV56253.2022.9959173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of unsupervised learning techniques to DNA microarray databases: a comparative case study
We present the application of the unsupervised machine learning clustering techniques K-Means and Fuzzy C-Means to DNA microarray gene expression data from breast cancer tissues. The mathematical basis and the development of the algorithm of the techniques are shown, and the Expectation Maximization technique is presented as an alternative for the assignment of missing data. Results show that the best method for the clustering of the genetic expression data under study is the Fuzzy C-Means algorithm.