{"title":"Detection of Breast Cancer by the Identification of Circulating Tumor Cells Using Association Rule Mining","authors":"S. Jananee, R. Nedunchelian","doi":"10.4018/IJKDB.2016010102","DOIUrl":null,"url":null,"abstract":"Circulating Tumor Cells CTCs are cells that have shed into the vasculate from the primary tumor and circulate into the blood stream. In this proposed work, the major genes causing the breast cancer is identified by the principle of Association Rule. The trained set and training set is made to upload on the data store. By associating each row of a training set to all the rows of the trained data is done and the report is generated. The Baum welch process is called for the estimation of actual probabilities and emission probabilities by calculating its log likelihood factor which gives the high Priority gene values that are responsible for the cause of cancer. Based on this cell category is splitted into three clusters such as carcinoma level, metastasis level and Kaposi sarcoma. On each cluster it finds the highest priority value in it and classifies into high, low and medium values. On extraction of these higher gene values yields the major responsible genes causing breast cancer. Finally, the obtained results are validated through hierarchical clustering.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Knowl. Discov. Bioinform.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/IJKDB.2016010102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Circulating Tumor Cells CTCs are cells that have shed into the vasculate from the primary tumor and circulate into the blood stream. In this proposed work, the major genes causing the breast cancer is identified by the principle of Association Rule. The trained set and training set is made to upload on the data store. By associating each row of a training set to all the rows of the trained data is done and the report is generated. The Baum welch process is called for the estimation of actual probabilities and emission probabilities by calculating its log likelihood factor which gives the high Priority gene values that are responsible for the cause of cancer. Based on this cell category is splitted into three clusters such as carcinoma level, metastasis level and Kaposi sarcoma. On each cluster it finds the highest priority value in it and classifies into high, low and medium values. On extraction of these higher gene values yields the major responsible genes causing breast cancer. Finally, the obtained results are validated through hierarchical clustering.