{"title":"A joint framework for missing values estimation and biclusters detection in gene expression data.","authors":"Kin-On Cheng, Ngai-Fong Law, Yui-Lam Chan, Wan-Chi Siu","doi":"10.1504/IJBRA.2014.065243","DOIUrl":null,"url":null,"abstract":"<p><p>DNA microarray experiment unavoidably generates gene expression data with missing values. This hardens subsequent analysis such as biclusters detection which aims to find a set of co-expressed genes under some experimental conditions. Missing values are thus required to be estimated before biclusters detection. Existing missing values estimation algorithms rely on finding coherence among expression values throughout the data. In view that both missing values estimation and biclusters detection aim at exploiting coherence inside the expression data, we propose to integrate these two steps into a joint framework. The benefits are twofold; the missing values estimation can improve biclusters analysis and the coherence in detected biclusters can be exploited for accurate missing values estimation. Experimental results show that the bicluster information can significantly improve the accuracy in missing values estimation. Also, the joint framework enables the detection of biologically meaningful biclusters. </p>","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2014.065243","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Bioinformatics Research and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJBRA.2014.065243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Health Professions","Score":null,"Total":0}
引用次数: 2
Abstract
DNA microarray experiment unavoidably generates gene expression data with missing values. This hardens subsequent analysis such as biclusters detection which aims to find a set of co-expressed genes under some experimental conditions. Missing values are thus required to be estimated before biclusters detection. Existing missing values estimation algorithms rely on finding coherence among expression values throughout the data. In view that both missing values estimation and biclusters detection aim at exploiting coherence inside the expression data, we propose to integrate these two steps into a joint framework. The benefits are twofold; the missing values estimation can improve biclusters analysis and the coherence in detected biclusters can be exploited for accurate missing values estimation. Experimental results show that the bicluster information can significantly improve the accuracy in missing values estimation. Also, the joint framework enables the detection of biologically meaningful biclusters.
期刊介绍:
Bioinformatics is an interdisciplinary research field that combines biology, computer science, mathematics and statistics into a broad-based field that will have profound impacts on all fields of biology. The emphasis of IJBRA is on basic bioinformatics research methods, tool development, performance evaluation and their applications in biology. IJBRA addresses the most innovative developments, research issues and solutions in bioinformatics and computational biology and their applications. Topics covered include Databases, bio-grid, system biology Biomedical image processing, modelling and simulation Bio-ontology and data mining, DNA assembly, clustering, mapping Computational genomics/proteomics Silico technology: computational intelligence, high performance computing E-health, telemedicine Gene expression, microarrays, identification, annotation Genetic algorithms, fuzzy logic, neural networks, data visualisation Hidden Markov models, machine learning, support vector machines Molecular evolution, phylogeny, modelling, simulation, sequence analysis Parallel algorithms/architectures, computational structural biology Phylogeny reconstruction algorithms, physiome, protein structure prediction Sequence assembly, search, alignment Signalling/computational biomedical data engineering Simulated annealing, statistical analysis, stochastic grammars.