Source Code for Biology and Medicine最新文献

筛选
英文 中文
The non-negative matrix factorization toolbox for biological data mining. 生物数据挖掘的非负矩阵分解工具箱。
Source Code for Biology and Medicine Pub Date : 2013-04-16 DOI: 10.1186/1751-0473-8-10
Yifeng Li, Alioune Ngom
{"title":"The non-negative matrix factorization toolbox for biological data mining.","authors":"Yifeng Li,&nbsp;Alioune Ngom","doi":"10.1186/1751-0473-8-10","DOIUrl":"https://doi.org/10.1186/1751-0473-8-10","url":null,"abstract":"<p><strong>Background: </strong>Non-negative matrix factorization (NMF) has been introduced as an important method for mining biological data. Though there currently exists packages implemented in R and other programming languages, they either provide only a few optimization algorithms or focus on a specific application field. There does not exist a complete NMF package for the bioinformatics community, and in order to perform various data mining tasks on biological data.</p><p><strong>Results: </strong>We provide a convenient MATLAB toolbox containing both the implementations of various NMF techniques and a variety of NMF-based data mining approaches for analyzing biological data. Data mining approaches implemented within the toolbox include data clustering and bi-clustering, feature extraction and selection, sample classification, missing values imputation, data visualization, and statistical comparison.</p><p><strong>Conclusions: </strong>A series of analysis such as molecular pattern discovery, biological process identification, dimension reduction, disease prediction, visualization, and statistical comparison can be performed using this toolbox.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"10"},"PeriodicalIF":0.0,"publicationDate":"2013-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-10","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31455061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Inmembrane, a bioinformatic workflow for annotation of bacterial cell-surface proteomes. 细菌细胞表面蛋白质组注释的生物信息学工作流。
Source Code for Biology and Medicine Pub Date : 2013-03-19 DOI: 10.1186/1751-0473-8-9
Andrew J Perry, Bosco K Ho
{"title":"Inmembrane, a bioinformatic workflow for annotation of bacterial cell-surface proteomes.","authors":"Andrew J Perry,&nbsp;Bosco K Ho","doi":"10.1186/1751-0473-8-9","DOIUrl":"https://doi.org/10.1186/1751-0473-8-9","url":null,"abstract":"<p><strong>Background: </strong>The annotation of surface exposed bacterial membrane proteins is an important step in interpretation and validation of proteomic experiments. In particular, proteins detected by cell surface protease shaving experiments can indicate exposed regions of membrane proteins that may contain antigenic determinants or constitute vaccine targets in pathogenic bacteria.</p><p><strong>Results: </strong>Inmembrane is a tool to predict the membrane proteins with surface-exposed regions of polypeptide in sets of bacterial protein sequences. We have re-implemented a protocol for Gram-positive bacterial proteomes, and developed a new protocol for Gram-negative bacteria, which interface with multiple predictors of subcellular localization and membrane protein topology. Through the use of a modern scripting language, inmembrane provides an accessible code-base and extensible architecture that is amenable to modification for related sequence annotation tasks.</p><p><strong>Conclusions: </strong>Inmembrane easily integrates predictions from both local binaries and web-based queries to help gain an overview of likely surface exposed protein in a bacterial proteome. The program is hosted on the Github repository http://github.com/boscoh/inmembrane.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"9"},"PeriodicalIF":0.0,"publicationDate":"2013-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-9","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31316292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
SVAw - a web-based application tool for automated surrogate variable analysis of gene expression studies. SVAw - 一种基于网络的应用工具,用于对基因表达研究进行自动代理变量分析。
Source Code for Biology and Medicine Pub Date : 2013-03-11 DOI: 10.1186/1751-0473-8-8
Mehdi Pirooznia, Fayaz Seifuddin, Fernando S Goes, Jeffrey T Leek, Peter P Zandi
{"title":"SVAw - a web-based application tool for automated surrogate variable analysis of gene expression studies.","authors":"Mehdi Pirooznia, Fayaz Seifuddin, Fernando S Goes, Jeffrey T Leek, Peter P Zandi","doi":"10.1186/1751-0473-8-8","DOIUrl":"10.1186/1751-0473-8-8","url":null,"abstract":"<p><strong>Background: </strong>Surrogate variable analysis (SVA) is a powerful method to identify, estimate, and utilize the components of gene expression heterogeneity due to unknown and/or unmeasured technical, genetic, environmental, or demographic factors. These sources of heterogeneity are common in gene expression studies, and failing to incorporate them into the analysis can obscure results. Using SVA increases the biological accuracy and reproducibility of gene expression studies by identifying these sources of heterogeneity and correctly accounting for them in the analysis.</p><p><strong>Results: </strong>Here we have developed a web application called SVAw (Surrogate variable analysis Web app) that provides a user friendly interface for SVA analyses of genome-wide expression studies. The software has been developed based on open source bioconductor SVA package. In our software, we have extended the SVA program functionality in three aspects: (i) the SVAw performs a fully automated and user friendly analysis workflow; (ii) It calculates probe/gene Statistics for both pre and post SVA analysis and provides a table of results for the regression of gene expression on the primary variable of interest before and after correcting for surrogate variables; and (iii) it generates a comprehensive report file, including graphical comparison of the outcome for the user.</p><p><strong>Conclusions: </strong>SVAw is a web server freely accessible solution for the surrogate variant analysis of high-throughput datasets and facilitates removing all unwanted and unknown sources of variation. It is freely available for use at http://psychiatry.igm.jhmi.edu/sva. The executable packages for both web and standalone application and the instruction for installation can be downloaded from our web site.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2013-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3614430/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31312047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Git can facilitate greater reproducibility and increased transparency in science. Git可以促进科学中更大的可重复性和更高的透明度。
Source Code for Biology and Medicine Pub Date : 2013-02-28 DOI: 10.1186/1751-0473-8-7
Karthik Ram
{"title":"Git can facilitate greater reproducibility and increased transparency in science.","authors":"Karthik Ram","doi":"10.1186/1751-0473-8-7","DOIUrl":"https://doi.org/10.1186/1751-0473-8-7","url":null,"abstract":"<p><strong>Background: </strong>Reproducibility is the hallmark of good science. Maintaining a high degree of transparency in scientific reporting is essential not just for gaining trust and credibility within the scientific community but also for facilitating the development of new ideas. Sharing data and computer code associated with publications is becoming increasingly common, motivated partly in response to data deposition requirements from journals and mandates from funders. Despite this increase in transparency, it is still difficult to reproduce or build upon the findings of most scientific publications without access to a more complete workflow.</p><p><strong>Findings: </strong>Version control systems (VCS), which have long been used to maintain code repositories in the software industry, are now finding new applications in science. One such open source VCS, Git, provides a lightweight yet robust framework that is ideal for managing the full suite of research outputs such as datasets, statistical code, figures, lab notes, and manuscripts. For individual researchers, Git provides a powerful way to track and compare versions, retrace errors, explore new approaches in a structured manner, while maintaining a full audit trail. For larger collaborative efforts, Git and Git hosting services make it possible for everyone to work asynchronously and merge their contributions at any time, all the while maintaining a complete authorship trail. In this paper I provide an overview of Git along with use-cases that highlight how this tool can be leveraged to make science more reproducible and transparent, foster new collaborations, and support novel uses.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2013-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-7","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31270564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 170
RECOT: a tool for the coordinate transformation of next-generation sequencing reads for comparative genomics and transcriptomics. RECOT:用于比较基因组学和转录组学的下一代测序读数坐标转换的工具。
Source Code for Biology and Medicine Pub Date : 2013-02-26 DOI: 10.1186/1751-0473-8-6
Akiko Izawa, Jun Sese
{"title":"RECOT: a tool for the coordinate transformation of next-generation sequencing reads for comparative genomics and transcriptomics.","authors":"Akiko Izawa,&nbsp;Jun Sese","doi":"10.1186/1751-0473-8-6","DOIUrl":"https://doi.org/10.1186/1751-0473-8-6","url":null,"abstract":"<p><strong>Background: </strong>The whole-genome sequences of many non-model organisms have recently been determined. Using these genome sequences, next-generation sequencing based experiments such as RNA-seq and ChIP-seq have been performed and comparisons of the experiments between related species have provided new knowledge about evolution and biological processes. Although these comparisons require transformation of the genome coordinates of the reads between the species, current software tools are not suitable to convert the massive numbers of reads to the corresponding coordinates of other species' genomes.</p><p><strong>Results: </strong>Here, we introduce a set of programs, called REad COordinate Transformer (RECOT), created to transform the coordinates of short reads obtained from the genome of a query species being studied to that of a comparison target species after aligning the query and target gene/genome sequences. RECOT generates output in SAM format that can be viewed using recent genome browsers capable of displaying next-generation sequencing data.</p><p><strong>Conclusions: </strong>We demonstrate the usefulness of RECOT in comparing ChIP-seq results between two closely-related fruit flies. The results indicate position changes of a transcription factor binding site caused sequence polymorphisms at the binding site.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2013-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31266233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
CrypticIBDcheck: an R package for checking cryptic relatedness in nominally unrelated individuals. CrypticIBDcheck:一个R包,用于检查名义上不相关的个体的隐式相关性。
Source Code for Biology and Medicine Pub Date : 2013-02-06 DOI: 10.1186/1751-0473-8-5
Annick Nembot-Simo, Jinko Graham, Brad McNeney
{"title":"CrypticIBDcheck: an R package for checking cryptic relatedness in nominally unrelated individuals.","authors":"Annick Nembot-Simo,&nbsp;Jinko Graham,&nbsp;Brad McNeney","doi":"10.1186/1751-0473-8-5","DOIUrl":"https://doi.org/10.1186/1751-0473-8-5","url":null,"abstract":"<p><strong>Background: </strong>In population association studies, standard methods of statistical inference assume that study subjects are independent samples. In genetic association studies, it is therefore of interest to diagnose undocumented close relationships in nominally unrelated study samples.</p><p><strong>Results: </strong>We describe the R package CrypticIBDcheck to identify pairs of closely-related subjects based on genetic marker data from single-nucleotide polymorphisms (SNPs). The package is able to accommodate SNPs in linkage disequibrium (LD), without the need to thin the markers so that they are approximately independent in the population. Sample pairs are identified by superposing their estimated identity-by-descent (IBD) coefficients on plots of IBD coefficients for pairs of simulated subjects from one of several common close relationships.</p><p><strong>Conclusions: </strong>The methods implemented in CrypticIBDcheck are particularly relevant to candidate-gene association studies, in which dependent SNPs cluster in a relatively small number of genes spread throughout the genome. The accommodation of LD allows the use of all available genetic data, a desirable property when working with a modest number of dependent SNPs within candidate genes. CrypticIBDcheck is available from the Comprehensive R Archive Network (CRAN).</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2013-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31217825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
BlaSTorage: a fast package to parse, manage and store BLAST results. BlaSTorage:一个快速的包来解析,管理和存储BLAST结果。
Source Code for Biology and Medicine Pub Date : 2013-01-30 DOI: 10.1186/1751-0473-8-4
Massimiliano Orsini, Simone Carcangiu
{"title":"BlaSTorage: a fast package to parse, manage and store BLAST results.","authors":"Massimiliano Orsini,&nbsp;Simone Carcangiu","doi":"10.1186/1751-0473-8-4","DOIUrl":"https://doi.org/10.1186/1751-0473-8-4","url":null,"abstract":"<p><strong>Unlabelled: </strong></p><p><strong>Background: </strong>Large-scale sequence studies requiring BLAST-based analysis produce huge amounts of data to be parsed. BLAST parsers are available, but they are often missing some important features, such as keeping all information from the raw BLAST output, allowing direct access to single results, and performing logical operations over them.</p><p><strong>Findings: </strong>We implemented BlaSTorage, a Python package that parses multi BLAST results and returns them in a purpose-built object-database format. Unlike other BLAST parsers, BlaSTorage retains and stores all parts of BLAST results, including alignments, without loss of information; a complete API allows access to all the data components.</p><p><strong>Conclusions: </strong>BlaSTorage shows comparable speed of more basic parser written in compiled languages as C++ and can be easily integrated into web applications or software pipelines.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2013-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31199035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FCC - An automated rule-based processing tool for life science data. FCC -一个自动的基于规则的生命科学数据处理工具。
Source Code for Biology and Medicine Pub Date : 2013-01-11 DOI: 10.1186/1751-0473-8-3
Simon Barkow-Oesterreicher, Can Türker, Christian Panse
{"title":"FCC - An automated rule-based processing tool for life science data.","authors":"Simon Barkow-Oesterreicher,&nbsp;Can Türker,&nbsp;Christian Panse","doi":"10.1186/1751-0473-8-3","DOIUrl":"https://doi.org/10.1186/1751-0473-8-3","url":null,"abstract":"<p><strong>Background: </strong>Data processing in the bioinformatics field often involves the handling of diverse software programs in one workflow. The field is lacking a set of standards for file formats so that files have to be processed in different ways in order to make them compatible to different analysis programs. The problem is that mass spectrometry vendors at most provide only closed-source Windows libraries to programmatically access their proprietary binary formats. This prohibits the creation of an efficient and unified tool that fits all processing needs of the users. Therefore, researchers are spending a significant amount of time using GUI-based conversion and processing programs. Besides the time needed for manual usage, such programs also can show long running times for processing, because most of them make use of only a single CPU. In particular, algorithms to enhance data quality, e.g. peak picking or deconvolution of spectra, add waiting time for the users.</p><p><strong>Results: </strong>To automate these processing tasks and let them run continuously without user interaction, we developed the FGCZ Converter Control (FCC) at the Functional Genomics Center Zurich (FGCZ) core facility. The FCC is a rule-based system for automated file processing that reduces the operation of diverse programs to a single configuration task. Using filtering rules for raw data files, the parameters for all tasks can be custom-tailored to the needs of every single researcher and processing can run automatically and efficiently on any number of servers in parallel using all available CPU resources.</p><p><strong>Conclusions: </strong>FCC has been used intensively at FGCZ for processing more than hundred thousand mass spectrometry raw files so far. Since we know that many other research facilities have similar problems, we would like to report on our tool and the accompanying ideas for an efficient set-up for potential reuse.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":"8 1","pages":"3"},"PeriodicalIF":0.0,"publicationDate":"2013-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31154644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Knowledge Driven Variable Selection (KDVS) - a new approach to enrichment analysis of gene signatures obtained from high-throughput data. 知识驱动变量选择(KDVS)——一种从高通量数据中获得的基因特征富集分析的新方法。
Source Code for Biology and Medicine Pub Date : 2013-01-09 DOI: 10.1186/1751-0473-8-2
Grzegorz Zycinski, Annalisa Barla, Margherita Squillario, Tiziana Sanavia, Barbara Di Camillo, Alessandro Verri
{"title":"Knowledge Driven Variable Selection (KDVS) - a new approach to enrichment analysis of gene signatures obtained from high-throughput data.","authors":"Grzegorz Zycinski,&nbsp;Annalisa Barla,&nbsp;Margherita Squillario,&nbsp;Tiziana Sanavia,&nbsp;Barbara Di Camillo,&nbsp;Alessandro Verri","doi":"10.1186/1751-0473-8-2","DOIUrl":"https://doi.org/10.1186/1751-0473-8-2","url":null,"abstract":"<p><strong>Background: </strong>High-throughput (HT) technologies provide huge amount of gene expression data that can be used to identify biomarkers useful in the clinical practice. The most frequently used approaches first select a set of genes (i.e. gene signature) able to characterize differences between two or more phenotypical conditions, and then provide a functional assessment of the selected genes with an a posteriori enrichment analysis, based on biological knowledge. However, this approach comes with some drawbacks. First, gene selection procedure often requires tunable parameters that affect the outcome, typically producing many false hits. Second, a posteriori enrichment analysis is based on mapping between biological concepts and gene expression measurements, which is hard to compute because of constant changes in biological knowledge and genome analysis. Third, such mapping is typically used in the assessment of the coverage of gene signature by biological concepts, that is either score-based or requires tunable parameters as well, limiting its power.</p><p><strong>Results: </strong>We present Knowledge Driven Variable Selection (KDVS), a framework that uses a priori biological knowledge in HT data analysis. The expression data matrix is transformed, according to prior knowledge, into smaller matrices, easier to analyze and to interpret from both computational and biological viewpoints. Therefore KDVS, unlike most approaches, does not exclude a priori any function or process potentially relevant for the biological question under investigation. Differently from the standard approach where gene selection and functional assessment are applied independently, KDVS embeds these two steps into a unified statistical framework, decreasing the variability derived from the threshold-dependent selection, the mapping to the biological concepts, and the signature coverage. We present three case studies to assess the usefulness of the method.</p><p><strong>Conclusions: </strong>We showed that KDVS not only enables the selection of known biological functionalities with accuracy, but also identification of new ones. An efficient implementation of KDVS was devised to obtain results in a fast and robust way. Computing time is drastically reduced by the effective use of distributed resources. Finally, integrated visualization techniques immediately increase the interpretability of results. Overall, KDVS approach can be considered as a viable alternative to enrichment-based approaches.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":" ","pages":"2"},"PeriodicalIF":0.0,"publicationDate":"2013-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40220923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
MmPalateMiRNA, an R package compendium illustrating analysis of miRNA microarray data. mmpalatmirna,一个R包纲要,说明miRNA微阵列数据的分析。
Source Code for Biology and Medicine Pub Date : 2013-01-08 DOI: 10.1186/1751-0473-8-1
Guy N Brock, Partha Mukhopadhyay, Vasyl Pihur, Cynthia Webb, Robert M Greene, M Michele Pisano
{"title":"MmPalateMiRNA, an R package compendium illustrating analysis of miRNA microarray data.","authors":"Guy N Brock,&nbsp;Partha Mukhopadhyay,&nbsp;Vasyl Pihur,&nbsp;Cynthia Webb,&nbsp;Robert M Greene,&nbsp;M Michele Pisano","doi":"10.1186/1751-0473-8-1","DOIUrl":"https://doi.org/10.1186/1751-0473-8-1","url":null,"abstract":"<p><strong>Background: </strong>MicroRNAs (miRNAs) constitute the largest family of noncoding RNAs involved in gene silencing and represent critical regulators of cell and tissue differentiation. Microarray expression profiling of miRNAs is an effective means of acquiring genome-level information of miRNA activation and inhibition, as well as the potential regulatory role that these genes play within a biological system. As with mRNA expression profiling arrays, miRNA microarrays come in a variety of platforms from numerous manufacturers, and there are a multitude of techniques available for reducing and analyzing these data.</p><p><strong>Results: </strong>In this paper, we present an analysis of a typical two-color miRNA microarray experiment using publicly available packages from R and Bioconductor, the open-source software project for the analysis of genomic data. Covered topics include visualization, normalization, quality checking, differential expression, cluster analysis, miRNA target identification, and gene set enrichment analysis. Many of these tools carry-over from the analysis of mRNA microarrays, but with some notable differences that require special attention. The paper is presented as a \"compendium\" which, along with the accompanying R package MmPalateMiRNA, contains all of the experimental data and source code to reproduce the analyses contained in the paper.</p><p><strong>Conclusions: </strong>The compendium presented in this paper will provide investigators with an access point for applying the methods available in R and Bioconductor for analysis of their own miRNA array data.</p>","PeriodicalId":35052,"journal":{"name":"Source Code for Biology and Medicine","volume":" ","pages":"1"},"PeriodicalIF":0.0,"publicationDate":"2013-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1751-0473-8-1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40220194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信