{"title":"Biclustering on gene expression data","authors":"M. Shruthi","doi":"10.1109/ICAMMAET.2017.8186750","DOIUrl":null,"url":null,"abstract":"Microarray technology is a tool which is essential to observe and monitor the genes in an living organism. Biclustering is a strategy to distinguish qualities that are co-directed under a subset of conditions, however are not really co-controlled crosswise over different conditions. The dataset is in the form of matrix, row matrix represents a set of genes and column matrix represents a set of conditions. Each row in a matrix corresponds to set of genes and each column represents a set of conditions. The goal of this project is to identify groups of genes sharing a common subset of regulatory units. A method is needed to select clusters of genes and conditions simultaneously, finding distinctive clusters with less number of rules generated. Feature selection might be assessed from both the proficiency and adequacy perspectives. While the productivity concerns the time needed to discover a set of components is that the adequacy is identified with the nature of the subset of elements. Hence fast clustering is used to cluster the data into two categories (relevant data, irrelevant data). K-means algorithm is employed to bicluster the data in order to divide the input data into four classes. From the calculation of the maximum and minimum specificity of each class, better accuracy is outperformed.","PeriodicalId":425974,"journal":{"name":"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAMMAET.2017.8186750","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Microarray technology is a tool which is essential to observe and monitor the genes in an living organism. Biclustering is a strategy to distinguish qualities that are co-directed under a subset of conditions, however are not really co-controlled crosswise over different conditions. The dataset is in the form of matrix, row matrix represents a set of genes and column matrix represents a set of conditions. Each row in a matrix corresponds to set of genes and each column represents a set of conditions. The goal of this project is to identify groups of genes sharing a common subset of regulatory units. A method is needed to select clusters of genes and conditions simultaneously, finding distinctive clusters with less number of rules generated. Feature selection might be assessed from both the proficiency and adequacy perspectives. While the productivity concerns the time needed to discover a set of components is that the adequacy is identified with the nature of the subset of elements. Hence fast clustering is used to cluster the data into two categories (relevant data, irrelevant data). K-means algorithm is employed to bicluster the data in order to divide the input data into four classes. From the calculation of the maximum and minimum specificity of each class, better accuracy is outperformed.