M. Sánchez-Castillo, I. M. Tienda-Luna, D. Blanco-Navarro, M. Perez
{"title":"Revision of the variational Bayesian method for uncovering genes regulatory network","authors":"M. Sánchez-Castillo, I. M. Tienda-Luna, D. Blanco-Navarro, M. Perez","doi":"10.1109/GENSiPS.2011.6169481","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169481","url":null,"abstract":"We have revised the Markov model used in the analysis of microarray time-series data to uncover the gene regulatory network. Previous linear models establishes genetic relations between the microarray data which are assumed to have noise. We propose a new model to distinguish between observed data and real expression levels. The new model does not overestimate the noise and fits better the nature of the problem. We have also studied how the variational Bayesian algorithm can be modified to solve this problem. Finally, we have performed a prior analysis to include objective knowledge into the Bayesian methodology.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124924915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nguyen T. Nguyen, Xiaolin Zhang, Yunji Wang, Hai-Chao Han, Yufang Jin, Galen Schmidt, R. Lange, R. Chilton, M. Lindsey
{"title":"Targeting myocardial infarction-specific protein interaction network using computational analyses","authors":"Nguyen T. Nguyen, Xiaolin Zhang, Yunji Wang, Hai-Chao Han, Yufang Jin, Galen Schmidt, R. Lange, R. Chilton, M. Lindsey","doi":"10.1109/GENSiPS.2011.6169479","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169479","url":null,"abstract":"Myocardial infarction (MI) is a leading cause of deaths in the United States. Currently, the high mortality rate in MI is partially due to the lacking of diagnostic and prognostic biomarkers. Therefore, the purpose of this study was to develop a framework to understand MI-specific protein interaction network and identify MI-specific biomarkers with public databases and literatures. We established an MI-specific protein interaction network, examined the statistical significance of the MI-specific network compared to random networks, and evaluated the importance of the MI-specified proteins with its network properties and research intensity. The established MI-specific protein interaction network had less sub-networks and more links in addition to higher measurements on closeness centrality, clustering coefficient and degree centrality, suggesting a strong connectivity of hub proteins, which confirmed the determination of key proteins based on structural evaluation. In summary, this study established a framework to integrate published data in literatures and provided a promising way to identify biomarkers post-myocardial infarction.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122216703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mattia Bosio, Pau Bellot, P. Salembier, Albert Oliveras-Vergés
{"title":"Feature set enhancement via hierarchical clustering for microarray classification","authors":"Mattia Bosio, Pau Bellot, P. Salembier, Albert Oliveras-Vergés","doi":"10.1109/GENSiPS.2011.6169486","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169486","url":null,"abstract":"A new method for gene expression classification is proposed in this paper. In a first step, the original feature set is enriched by including new features, called metagenes, produced via hierarchical clustering. In a second step, a reliable classifier is built from a wrapper feature selection process. The selection relies on two criteria: the classical classification error rate and a new reliability measure. As a result, a classifier with good predictive ability using as few features as possible to reduce the risk of overfitting is obtained. This method has been tested on three public cancer datasets: leukemia, lymphoma and colon. The proposed method has obtained interesting classification results and the experiments have confirmed the utility of both metagenes and feature ranking criterion to improve the final classifier.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128220701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Joint corresponding feature identification and alignment for multiple LC/MS replicates","authors":"Jian Cui, Xuepo Ma, Jianqiu Zhang","doi":"10.1109/GENSiPS.2011.6169456","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169456","url":null,"abstract":"In Liquid Chromatography/Mass Spectrometry (LC-MS), identifying corresponding peptide features (LC peaks) in multiple replicate datasets plays a crucial role in the differential analysis of complex peptide or protein samples for biomarker discovery. Given a peptide sequence, we aim at identifying its LC peak intervals in all datasets simultaneously. Generally, features are first identified in each replicate dataset, and then the features are aligned using warping functions. In such a procedure, the error in feature identification will propagate to alignment. Instead, we consider the problem of joint feature identification and alignment in multiple datasets. Since accurate feature identification improves the accuracy of corresponding feature alignment and vice versa, joint processing provides better performance than separate processing. We propose an algorithm which combines peak identification quality scores, time shifts and the similarity of LC peak shapes between candidate corresponding features for accurate alignment. In addition, we also incorporate the approximate elution time interval of a peptide stored in an Accurate Time and Mass (ATM) database when available. We test our algorithm on publicly available datasets, and we compare its with that of separate feature identification and alignment. Results show that the number of accurately identified corresponding features is improved significantly by using the proposed method.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131427417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhan Zhou, Qi Li, Julie Tudyk, Yong-Quan Li, Yufeng Wang
{"title":"ECF sigma factor-associated regulatory networks in Streptomyces colicolor A3(2)","authors":"Zhan Zhou, Qi Li, Julie Tudyk, Yong-Quan Li, Yufeng Wang","doi":"10.1109/GENSiPS.2011.6169475","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169475","url":null,"abstract":"Sigma factors play important roles in transcriptional regulation in bacteria. Streptomyces coelicolor, a soil bacterium which is best known for its production of a variety of antibiotics, possesses a sophisticated transcriptional machinery. Among 58 sigma factors identified in the genome, 44 are extra-cytoplasmic function (ECF) sigma factors, which coordinate cellular responses to external signals. In this paper, we report a comprehensive analysis of the protein-protein association networks involving these sigma factors. The discovery of new network components and interactions has shed lights on the mechanisms underlying stress responses and morphological differentiation.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132403449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Gene network inference via sparse structural equation modeling with genetic perturbations","authors":"Xiaodong Cai, J. Bazerque, G. Giannakis","doi":"10.1109/GENSiPS.2011.6169445","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169445","url":null,"abstract":"Structural equation models (SEMs) have been recently proposed to infer gene regulatory network using gene expression data and genetic perturbations. However, lack of efficient inference method for SEMs prevents practical use of SEMs in the inference of relatively large gene networks. In this paper, relying on the sparsity of gene networks, we develop an efficient SEM-based method for inferring gene networks using both gene expression and expression quantitative trait locus (eQTL) data. Simulated tests demonstrate that the novel method significantly outperform state-of-the-art methods in the field.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117320703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The effect of certain Boolean functions in stability of networks with varying topology","authors":"V. H. Louzada, Fabricio M. Lopes, R. F. Hashimoto","doi":"10.1109/GENSiPS.2011.6169431","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169431","url":null,"abstract":"The stability of biological organisms is a major feature which contributes to their survival in the environment. However, the study of the stability in vivo is a very hard challenge. An objective way for the stability analysis is to adopt the Boolean network model, which can qualitatively describe the behavior of biological networks as well as allows the analysis of the results in a comprehensive and global way. Besides, certain Boolean function classes play an important role in Boolean network stability. In addition to this relationship, it is expected that many classes of network topology assigns greater or lesser resistance to damage. In this work, we define “local stability” as the stability resulted from the presence of a certain Boolean function class, such as the canalyzing Boolean functions, and “global stability” as the result of a certain network topology, such as the scale-free topology. Next, we investigate the interaction between these two factors using the size of the largest basin of attraction and generalized Derrida curves as measures for network stability. Our results show that there is a “topology order” for certain Boolean function classes, and that these two factors should be jointly addressed in future analysis of network stability.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129410430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Inference of a genetic regulatory network model from limited time series data","authors":"Saad Haider, R. Pal","doi":"10.1109/GENSiPS.2011.6169470","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169470","url":null,"abstract":"Numerous approaches exist for modeling of genetic regulatory networks (GRNs) but the low sampling rates often employed in biological studies prevents the inference of detailed models from experimental data. In this paper, we analyze the issues involved in estimating a model of a GRN from single cell line time series data with limited time points. We present an inference approach for a Boolean Network (BN) model of a GRN from limited transcriptomic or proteomic time series data based on prior biological knowledge of connectivity, constraints on attractor structure and robust design. Through theoretical analysis and simulations, we showed the rarity of arriving at a BN from limited time series data with plausible biological structure using random connectivity and absence of structure in data. We applied our inference approach to 6 time point transcriptomic data on HMEC cell lines after application of EGF and generated a BN with a plausible biological structure satisfying the data.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"18 12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132280026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Carmen D. Tekwe, Alan R. Dabney, Raymond J. Carroll
{"title":"Application of survival analysis methodology to the quantitative analysis of LC-MS proteomics data","authors":"Carmen D. Tekwe, Alan R. Dabney, Raymond J. Carroll","doi":"10.1109/GENSiPS.2011.6169453","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169453","url":null,"abstract":"Protein abundance in quantitative proteomics is often based on observed spectral features derived from LC-MS experiments. Peak intensities are largely non-Normal in distribution. Furthermore, LC-MS data frequently have large proportions of missing peak intensities due to censoring mechanisms on low-abundance spectral features. Recognizing that the observed peak intensities detected with the LC-MS method are all positive, skewed and often left-censored, we propose using survival methodology to carry out differential expression analysis of proteins. Various standard statistical techniques including non-parametric tests such as the Kolmogorov-Smirnov and Wilcoxon-Mann-Whitney rank sum tests, and the parametric survival model, accelerated failure time model with the Weibull distribution were used to detect any differentially expressed proteins. The statistical operating characteristics of each method are explored using both real and simulated data set.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134513621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Geometrical modification of wavelet SVM kernels and its application in microarray analysis","authors":"Hong Cai, Yufeng Wang","doi":"10.1109/GENSiPS.2011.6169467","DOIUrl":"https://doi.org/10.1109/GENSiPS.2011.6169467","url":null,"abstract":"The selection and design of appropriate kernel functions play a key role in effective support vector machine (SVM) leaning. A general strategy is to customize the existent kernel functions to fit into the data property and structure. Wavelet kernels have been developed to approximate arbitrary nonlinear functions for signal processing. In this paper, we propose novel wavelet kernels based on the Riemannian geometrical structure theory, by constructing a hyperplane with better spatial resolution. This wavelet kernel SVM approach was applied to the yeast time course microarray dataset and outperformed the traditional Gaussian kernel and polynomial kernel.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133040760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}