A novel interpolation based missing value estimation method to predict missing values in microarray gene expression data

S. Bose, C. Das, S. Dutta, S. Chattopadhyay
{"title":"A novel interpolation based missing value estimation method to predict missing values in microarray gene expression data","authors":"S. Bose, C. Das, S. Dutta, S. Chattopadhyay","doi":"10.1109/CODIS.2012.6422202","DOIUrl":null,"url":null,"abstract":"Microarray experiments can generate data sets with multiple missing expression values, normally due to various experimental problems. Unfortunately, many algorithms for gene expression analysis require a complete matrix of gene array values as input. Thereore, effective missing value estimation methods are essential to minimize the effect of incomplete data sets on analysis, and to increase the range of data sets to which these algorithms can be applied. In this regard, a new interpolation based imputation method is proposed to predict missing values in microarray gene expression data. The proposed method selects a subset of similar genes and a subset of similar samples with respect to each missing position and then applies interpolation in a novel manner to predict that missing value. The performance of the proposed method is studied based on the normalized root mean square error with existing estimation techniques including K-nearest neighbor (KNN), Sequential K-nearest neighbor (SKNN) and Iterative K-nearest neighbor (IKNN). The effectiveness of the proposed method, along with a comparison with existing methods, is demonstrated on different microarray data sets.","PeriodicalId":274831,"journal":{"name":"2012 International Conference on Communications, Devices and Intelligent Systems (CODIS)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Communications, Devices and Intelligent Systems (CODIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CODIS.2012.6422202","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Microarray experiments can generate data sets with multiple missing expression values, normally due to various experimental problems. Unfortunately, many algorithms for gene expression analysis require a complete matrix of gene array values as input. Thereore, effective missing value estimation methods are essential to minimize the effect of incomplete data sets on analysis, and to increase the range of data sets to which these algorithms can be applied. In this regard, a new interpolation based imputation method is proposed to predict missing values in microarray gene expression data. The proposed method selects a subset of similar genes and a subset of similar samples with respect to each missing position and then applies interpolation in a novel manner to predict that missing value. The performance of the proposed method is studied based on the normalized root mean square error with existing estimation techniques including K-nearest neighbor (KNN), Sequential K-nearest neighbor (SKNN) and Iterative K-nearest neighbor (IKNN). The effectiveness of the proposed method, along with a comparison with existing methods, is demonstrated on different microarray data sets.
基于插值的缺失值估计方法预测微阵列基因表达数据缺失值
通常由于各种实验问题,微阵列实验会产生多个缺失表达值的数据集。不幸的是,许多基因表达分析算法需要一个完整的基因阵列值矩阵作为输入。因此,有效的缺失值估计方法对于最小化不完整数据集对分析的影响以及增加这些算法可以应用的数据集范围至关重要。为此,提出了一种新的基于插值法的基因表达数据缺失值预测方法。该方法针对每个缺失位置选择一个相似基因子集和一个相似样本子集,然后以一种新颖的方式应用插值来预测缺失值。利用现有的k -近邻(KNN)、顺序k -近邻(SKNN)和迭代k -近邻(IKNN)估计技术,研究了基于归一化均方根误差的方法的性能。在不同的微阵列数据集上证明了所提出方法的有效性,并与现有方法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信