A. Roy, J. Shao, J. Hartung, W. Schneider, R. Brlansky
{"title":"A Case Study on Discovery of Novel Citrus Leprosis Virus Cytoplasmic Type 2 Utilizing Small RNA Libraries by Next Generation Sequencing and Bioinformatic Analyses","authors":"A. Roy, J. Shao, J. Hartung, W. Schneider, R. Brlansky","doi":"10.4172/2153-0602.1000129","DOIUrl":"https://doi.org/10.4172/2153-0602.1000129","url":null,"abstract":"The advent of innovative sequencing technology referred to as “Next-Generation” Sequencing (NGS), provides a new approach to identify the ‘unknown known’ and ‘unknown unknown’ viral pathogens without a priori knowledge. The genomes of plant viruses can be rapidly determined even when occurring at extremely low titers in the infected host. The method is based on massively parallel sequencing of the population of small RNA molecules 18-35 nucleotides in length produced by RNA silencing host defense. Improvements in chemistries, bioinformatic tools and advances in engineering has reduced the costs of NGS, increased its accessibility, and enabled its application in the field of plant virology. In this review, we discuss the utilization of the Illumina GA IIX platform combined with the application of molecular biology and bioinformatic tools for the discovery of a novel cytoplasmic Citrus leprosis virus (CiLV). This new virus produced symptoms typical of CiLV but was not detected with either serological or PCR-based assays for the previously described virus. The new viral genome was also present in low titer in sweet orange (Citrus sinensis), an important horticultural crop with incomplete genomic resources. This is a common situation in horticultural research and provides an example of the broader utility of this approach. In addition to the discovery of novel viruses, the sequence data may be useful for studies of viral evolution and ecology and the interactions between viral and host transcriptomes.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"19 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76838281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qihua Tan, Martin Tepel, Hans Christian Beck, Lars Melholt Rasmussen, Jacob v. B. Hjelmborg
{"title":"Generalized Measure of Dependency for Analysis of Omics Data","authors":"Qihua Tan, Martin Tepel, Hans Christian Beck, Lars Melholt Rasmussen, Jacob v. B. Hjelmborg","doi":"10.4172/2153-0602.1000183","DOIUrl":"https://doi.org/10.4172/2153-0602.1000183","url":null,"abstract":"J Data Mining Genomics Proteomics ISSN: 2153-0602 JDMGP, an open access journal Volume 7 • Issue 1 • 1000183 As a popular measure of association, the Pearson’s correlation coefficient has been frequently used in omics data analysis e.g. in feature selection process during prediction model building using high dimensional gene expression data [1] and proteomics data [2]. However, Pearson’s correlation coefficient captures only linear relationships which greatly limit its use in situations of nonlinear association. Statistical modeling for dealing with nonlinear patterns can be complicated [3] and requires intensive computation in case of high dimensional data such as microarray data or genome sequence data. In the analysis of omics data, high dimension means that there can be diverse patterns of dependence not limited to linearity. In this situation, the generalized measures of association more adequate than the Pearson’s correlation and capable of capturing both linear and nonlinear correlations are needed. Recently, generalized correlation coefficients have been frequently discussed [4] and their application to large scale genomic data illustrated through microarray gene expression time-course analysis [5].","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"11 1","pages":"1000183"},"PeriodicalIF":0.0,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84274963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proteomics in Medicine","authors":"M. Simonian","doi":"10.4172/2153-0602.1000e126","DOIUrl":"https://doi.org/10.4172/2153-0602.1000e126","url":null,"abstract":"Proteomics is the identification of proteins in a tissue or cell, and the determination of their function, structure and modifications [1,2]. The term proteome was coined by Marc Wilkins to describe all the proteins expressed by a genome [1]. It is considered to be the next step in modern biology. Proteomics is dynamic compared to genomics because it changes constantly to reflect the cell’s environment. The main objectives in the field of proteomics are: (i) Identify all proteins; (ii) Analyse differential protein expression in different samples; (iii) Characterise proteins by identifying and studying their function and cellular localisation; and (iv) Understand protein interaction networks.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"20 1","pages":"1-1"},"PeriodicalIF":0.0,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83940121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparative Genomics of Salmonella Could Reveal Key Features of Adaptation","authors":"G. Nava, Yajaira Esquivel-Hern, Ez","doi":"10.4172/2153-0602.1000E121","DOIUrl":"https://doi.org/10.4172/2153-0602.1000E121","url":null,"abstract":"Worldwide, Salmonella enterica remains an important health threat. A recent study by the World Health Organization estimated that nontyphoidal S. enterica causes ∼ 230,000 deaths annually [1]. The biology of this pathogen has been studied for almost a century [2]; until recently, however, we have started to elucidate genomic features of adaptation to its hosts. Now, it is know that S. enterica has evolved to establish sympatric (generalists) and allopatric (specialists) and associations with its host. For example, S. enterica subspecies enterica, serotypes Choleraesuis, Dublin and Gallinarum have established allopatric association with porcine, bovine and avian species, respectively. In contrast, serotype Enteritidis and Typhimurium have adapted a sympatric strategy to colonize the intestinal tract of a broad number of avian and mammalian species [3]. This progress in knowledge has been accomplished with the aid of molecular microbiology.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"37 1","pages":"1-2"},"PeriodicalIF":0.0,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82616188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Interpretable Boosted Decision Trees for Prediction of Mortality Following Allogeneic Hematopoietic Stem Cell Transplantation","authors":"R. Shouval, A. Nagler, M. Labopin, R. Unger","doi":"10.4172/2153-0602.1000184","DOIUrl":"https://doi.org/10.4172/2153-0602.1000184","url":null,"abstract":"J Data Mining Genomics Proteomics ISSN: 2153-0602 JDMGP, an open access journal Volume 7 • Issue 1 • 1000184 Allogeneic (allo) hematopoietic stem transplantation (HSCT) is a potentially curative procedure for selected patients with hematological disease. Despite a reduction in transplant risk in recent years, morbidity and mortality remains substantial, making the decision of whom, how and when to transplant, of great importance [1].","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"30 1 1","pages":"1-2"},"PeriodicalIF":0.0,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77359203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data Mining and Geo-Engineering","authors":"Luis Sousa","doi":"10.4172/2153-0602.C1.001","DOIUrl":"https://doi.org/10.4172/2153-0602.C1.001","url":null,"abstract":"","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"69 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73874438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"In the Race towards a Better Diagnosis and Prognostication of Cancer Patients Long Non-Coding Intergenic RNA's (lincrna's) have found their Place","authors":"J. Mao, H. Weier","doi":"10.4172/2153-0602.1000E120","DOIUrl":"https://doi.org/10.4172/2153-0602.1000E120","url":null,"abstract":"","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"16 1","pages":"1-2"},"PeriodicalIF":0.0,"publicationDate":"2015-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81077021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dengue Fever Prediction: A Data Mining Problem","authors":"K. Shaukat, N. Masood, S. Mehreen, Ulya Azmeen","doi":"10.4172/2153-0602.1000181","DOIUrl":"https://doi.org/10.4172/2153-0602.1000181","url":null,"abstract":"Dengue is a threatening disease caused by female mosquitos. It is typically found in widespread hot regions. From long periods of time, Experts are trying to find out some of features on Dengue disease so that they can rightly categorize patients because different patients require different types of treatment. Pakistan has been target of Dengue disease from last few years. Dengue fever is used in classification techniques to evaluate and compare their performance. The dataset was collected from District Headquarter Hospital (DHQ) Jhelum. For properly categorizing our dataset, different classification techniques are used. These techniques are Naive Bayesian, REP Tree, Random tree, J48 and SMO. WEKA was used as Data mining tool for classification of data. Firstly we will evaluate the performance of all the techniques separately with the help of tables and graphs depending upon dataset and secondly we will compare the performance of all the techniques.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"132 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2015-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76152379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Genome Mining and Transcriptional Analysis of Bacteriocin Genes in Enterococcus faecium CRL1879","authors":"N. Suárez, J. Bonacina, E. Hebert, L. Saavedra","doi":"10.4172/2153-0602.1000180","DOIUrl":"https://doi.org/10.4172/2153-0602.1000180","url":null,"abstract":"Among 151 bacterial isolates from nine artisanal cheeses, Enterococcus faecium CRL 1879 showed antibacterial activity against the food-borne pathogen Listeria monocytogenes. The isolate produced a proteinase K-sensitive compound in the cell free supernatant. Genome analysis demonstrated the presence of enterocin A, enterocin B, enterocin P, enterocin SE-K4-like and enterocin X biosynthetic gene clusters. Nucleotide sequences encoding for a putative two-component bacteriocin were detected using bioinformatics tools, here named enterocin CRL1879αβ. A transcriptional analysis of all bacteriocin genes by quantitative real time PCR analysis (qRT-PCR) revealed the transcription of each enterocin gene at different levels. Finally, analysis of bacteriocin genes distribution in 251 E. faecium bioprojects was performed and compared to those identify in E. faecium CRL1879. The discriminative analysis demonstrated that bacteriocin genes are widely distributed among Enterococcus, independently of the origin of the strain. \u0000The results presented in this paper represent a unique finding since this is the first demonstration of an E. faecium strain isolated from an artisanal cheese with the complete genetic machinery to produce six classes II and one class III bacteriocins.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"32 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84794603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Leveraging Lymphoblastoid Cell Lines for Drug Response Modeling","authors":"A. Motsinger-Reif, Daniel M. Rotroff","doi":"10.4172/2153-0602.1000179","DOIUrl":"https://doi.org/10.4172/2153-0602.1000179","url":null,"abstract":"Lymphoblastoid cell lines (LCL) are becoming popular tools for modeling drug response. LCLs, and other in vitro assays, offer the ability to test many drugs, doses, and biological samples relatively quickly and inexpensively. In addition, a unique advantage to LCLs is that they are available from a large cohort of individuals, providing the capability to test for genetic variability on a scale not readily available in other in vitro systems. Since oftentimes the genotype data is publically available, the experimental costs can be limited to the cost of the drug response phenotyping. Here we describe several advantages and limitations of LCLs. In addition we review several important aspects of LCL experimental design and statistical analysis. Lastly, we present an example of LCLs being successfully used to identify candidate single nucleotide polymorphisms and genes for variability in response to a chemotherapeutic used to treat chronic myeloid leukemia.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"62 1","pages":"1-4"},"PeriodicalIF":0.0,"publicationDate":"2015-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90251801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}