A Cloud Computing System to Quickly Implement New Microarray Data Pre-processing Methods

Q3 Computer Science
Dajie Luo, Prithish Banerjee, E. Harner, J. Mobley, Dongquan Chen
{"title":"A Cloud Computing System to Quickly Implement New Microarray Data Pre-processing Methods","authors":"Dajie Luo, Prithish Banerjee, E. Harner, J. Mobley, Dongquan Chen","doi":"10.2174/1875036201206010037","DOIUrl":null,"url":null,"abstract":"Background: Pre-processing, including normalization of raw microarray data is crucial to microarray-related data analysis. It takes time and effort to build newly-developed algorithms into commercial software or locally developed systems. While most new algorithms emerge in the form of sharable R packages, it can be difficult for many biologists to apply them as soon as they are available. Currently, we rely on statisticians and experienced programmers to develop and implement code to access those R packages. Therefore, we need a robust procedure to quickly implement pre-processing methods as they appear. The newly emerging cloud computing concept has directed us toward a new way for providing an easily accessible service to the biologists without requiring them to have any programming knowledge in R. Results: Based on our earlier Java-based software tool JavaStat, we developed an internet based application prototype to upload data and carry out pre-processing applications that include normalization, statistical analyses and plots. More im- portantly, R packages, e. g., for newly-developed normalization methods, and GC-robust multichip algorithm (RMA) for exon arrays, can be easily incorporated into the system with limited inputs from a biologist or a programmer. The data are stored in the cloud and the R code runs on server. Conclusion: The newly emerged cloud computing concept provides us a new way to provide an easily accessible and up- to-date service to biologists, as evidenced by our JavaStat system to incorporate new pre-processing package as they ap- pear. Users can access the application with a newly incorporated module through the Web. We expect this and other simi- lar systems greatly decrease turn-around time, improve accessibility of newly developed R model for pre-processing algo- rithms.","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open Bioinformatics Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1875036201206010037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1

Abstract

Background: Pre-processing, including normalization of raw microarray data is crucial to microarray-related data analysis. It takes time and effort to build newly-developed algorithms into commercial software or locally developed systems. While most new algorithms emerge in the form of sharable R packages, it can be difficult for many biologists to apply them as soon as they are available. Currently, we rely on statisticians and experienced programmers to develop and implement code to access those R packages. Therefore, we need a robust procedure to quickly implement pre-processing methods as they appear. The newly emerging cloud computing concept has directed us toward a new way for providing an easily accessible service to the biologists without requiring them to have any programming knowledge in R. Results: Based on our earlier Java-based software tool JavaStat, we developed an internet based application prototype to upload data and carry out pre-processing applications that include normalization, statistical analyses and plots. More im- portantly, R packages, e. g., for newly-developed normalization methods, and GC-robust multichip algorithm (RMA) for exon arrays, can be easily incorporated into the system with limited inputs from a biologist or a programmer. The data are stored in the cloud and the R code runs on server. Conclusion: The newly emerged cloud computing concept provides us a new way to provide an easily accessible and up- to-date service to biologists, as evidenced by our JavaStat system to incorporate new pre-processing package as they ap- pear. Users can access the application with a newly incorporated module through the Web. We expect this and other simi- lar systems greatly decrease turn-around time, improve accessibility of newly developed R model for pre-processing algo- rithms.
云计算系统快速实现新的微阵列数据预处理方法
背景:预处理,包括原始微阵列数据的规范化是微阵列相关数据分析的关键。将新开发的算法构建到商业软件或本地开发的系统中需要时间和精力。虽然大多数新算法以可共享的R包的形式出现,但对于许多生物学家来说,一旦它们可用,就很难应用它们。目前,我们依靠统计学家和经验丰富的程序员来开发和实现访问这些R包的代码。因此,我们需要一个健壮的程序来快速实现预处理方法。新出现的云计算概念为我们提供了一种新的方式,为生物学家提供易于访问的服务,而不需要他们有任何r语言的编程知识。结果:基于我们早期基于java的软件工具JavaStat,我们开发了一个基于互联网的应用程序原型,用于上传数据并执行包括标准化,统计分析和绘图在内的预处理应用程序。更重要的是,R包,例如,用于新开发的归一化方法,以及用于外显子阵列的gc -鲁棒多芯片算法(RMA),可以很容易地与生物学家或程序员的有限输入合并到系统中。数据存储在云中,R代码运行在服务器上。结论:新出现的云计算概念为我们提供了一种新的方式来为生物学家提供易于访问和最新的服务,正如我们的JavaStat系统在出现时包含新的预处理包所证明的那样。用户可以通过Web使用新合并的模块访问应用程序。我们期望这个和其他类似的系统能大大减少周转时间,提高新开发的R模型对预处理算法的可及性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Open Bioinformatics Journal
Open Bioinformatics Journal Computer Science-Computer Science (miscellaneous)
CiteScore
2.40
自引率
0.00%
发文量
4
期刊介绍: The Open Bioinformatics Journal is an Open Access online journal, which publishes research articles, reviews/mini-reviews, letters, clinical trial studies and guest edited single topic issues in all areas of bioinformatics and computational biology. The coverage includes biomedicine, focusing on large data acquisition, analysis and curation, computational and statistical methods for the modeling and analysis of biological data, and descriptions of new algorithms and databases. The Open Bioinformatics Journal, a peer reviewed journal, is an important and reliable source of current information on the developments in the field. The emphasis will be on publishing quality articles rapidly and freely available worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信