{"title":"Application of unsupervised learning techniques to DNA microarray databases: a comparative case study","authors":"Judith E. Gómez-Cuervo, A. Chavoya","doi":"10.1109/ICEV56253.2022.9959173","DOIUrl":null,"url":null,"abstract":"We present the application of the unsupervised machine learning clustering techniques K-Means and Fuzzy C-Means to DNA microarray gene expression data from breast cancer tissues. The mathematical basis and the development of the algorithm of the techniques are shown, and the Expectation Maximization technique is presented as an alternative for the assignment of missing data. Results show that the best method for the clustering of the genetic expression data under study is the Fuzzy C-Means algorithm.","PeriodicalId":178334,"journal":{"name":"2022 IEEE International Conference on Engineering Veracruz (ICEV)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Engineering Veracruz (ICEV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEV56253.2022.9959173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present the application of the unsupervised machine learning clustering techniques K-Means and Fuzzy C-Means to DNA microarray gene expression data from breast cancer tissues. The mathematical basis and the development of the algorithm of the techniques are shown, and the Expectation Maximization technique is presented as an alternative for the assignment of missing data. Results show that the best method for the clustering of the genetic expression data under study is the Fuzzy C-Means algorithm.