Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003最新文献

筛选
英文 中文
Iterative rank based methods for clustering 基于迭代秩的聚类方法
S. Perrey, H. Brinck, A. Zielesny
{"title":"Iterative rank based methods for clustering","authors":"S. Perrey, H. Brinck, A. Zielesny","doi":"10.1109/CSB.2003.1227379","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227379","url":null,"abstract":"Recently a new clustering algorithm was developed, useful in phylogenetic systematics and taxonomy. It derives a hierarchy from (dis)similarity data on a simple and rather natural way. It transforms a given dissimilarity by an iterative approach. Each iteration step consists of ranking the objects under consideration according to their pairwise dissimilarity and calculating the Euclidian distance of the resulting rank vectors. We investigate alterations of this order of steps as well as substitute the Euclidian distance by standard statistical measures for series of estimates. We evaluate the resulting different procedures on biological and other data sets of different structure regarding their underlying cluster systems. Thereby, potentials and limits of this kind of iterative approach become obvious.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130850226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Eulerian path methods for multiple sequence alignment 多序列比对的欧拉路径方法
M. Waterman, Yu Zhang
{"title":"Eulerian path methods for multiple sequence alignment","authors":"M. Waterman, Yu Zhang","doi":"10.1109/CSB.2003.1227296","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227296","url":null,"abstract":"With the rapid increase in the size of genome sequence databases, the multiple sequence alignment problem is increasingly important and often requires the alignment of a large number of sequences. Beginning in 1975, many heuristic algorithms have been created to improve the speed of computation and the quality of alignment. We introduce a novel approach that is fundamentally distinct from all currently available methods. Our motivation comes from the Eulerian method for fragment assembly in DNA sequence determination, that transforms all the DNA sequencing fragments into a de Bruijn graph and then reduces sequence assembly to a Eulerian path problem.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115837462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Probability profiles-novel approach in tandem mass spectrometry De Novo sequencing 概率剖面-串联质谱新方法De Novo测序
T. Fridman, Robert M. Day, Jane Razumovsbya, Dong Xu, A. Gorin
{"title":"Probability profiles-novel approach in tandem mass spectrometry De Novo sequencing","authors":"T. Fridman, Robert M. Day, Jane Razumovsbya, Dong Xu, A. Gorin","doi":"10.1109/CSB.2003.1227351","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227351","url":null,"abstract":"A novel method is proposed for deciphering experimental tandem mass spectra. A large database of previously resolved peptide spectra was used to determine \"neighborhood patterns\" for each peak category: C- or N-terminus ions, their dehydrated fragments, etc. The established patterns are applied to assign probabilities for new spectra peaks to fit into these categories. A few peaks often could be identified with a fair confidence creating strong \"anchor points\" for De Novo algorithm assembling sequence subgraphs. Our approach is utilizing all informational content of a given MS experimental data set, including peak intensities, weak and noisy peaks, and unusual fragments. We also discuss ways to provide learning features in our method: adjustments for a specific MS device and user initiated changes in the list of considered peak identities.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115718058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
PTC: an interactive tool for phylogenetic tree construction 系统发育树构建的交互式工具
Chen Yang, S. Khuri
{"title":"PTC: an interactive tool for phylogenetic tree construction","authors":"Chen Yang, S. Khuri","doi":"10.1109/CSB.2003.1227378","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227378","url":null,"abstract":"A phylogenetic tree represents the evolutionary history of a group of organisms. In this work, we introduce a novel interactive tool for constructing phylogenetic trees, phylogenetic tree construction package. The package supports four well-known algorithms, unweighted pair group method using arithmetic average, neighbor joining, Fitch Margoliash, and maximum parsimony.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115812619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Statistical inference for well-ordered structures in nucleotide sequences 核苷酸序列中有序结构的统计推断
S. Le, Jih-Hsiang Chen, J. Maizel
{"title":"Statistical inference for well-ordered structures in nucleotide sequences","authors":"S. Le, Jih-Hsiang Chen, J. Maizel","doi":"10.1109/CSB.2003.1227318","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227318","url":null,"abstract":"Distinct, local structures are frequently correlated with functional RNA elements involved in post-transcriptional regulation of gene expression. Discovery of microRNAs (miRNAs) suggests that there are a large class of small noncoding RNAs in eukaryotic genomes. These miRNAs have the potential to form distinct fold-back stem-loop structures. The prediction of those well-ordered folding sequences (WFS) in genomic sequences is very helpful for our understanding of RNA-based gene regulation and the determination of local RNA elements with structure-dependent functions. In this study, we describe a novel method for discovering the local WFS in a nucleotide sequence by Monte Carlo simulation and RNA folding. In the approach the quality of a local WFS is assessed by the energy difference (E/sub diff/) between the optimal structure folded in the local segment and its corresponding optimal, restrained structure where all the previous base pairings formed in the optimal structure are prohibited. Distinct WFS can be discovered by scanning successive segments along a sequence for evaluating the difference between Ediff of the natural sequence and those computed from randomly shuffled sequences. Our results indicate that the statistically significant WFS detected in the genomic sequences of Caenorhabditis elegans (C.elegans) F49E12, T07C5, T07D1, T10H9, Y56A3A and Y71G12B are coincident with known fold-back stem-loops found in miRNA precursors. The potential and implications of our method in searching for miRNAs in genomes is discussed.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"214 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114593185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A block coding method that leads to significantly lower entropy values for the proteins and coding sections of Haemophilus influenzae 一种块编码方法,可显著降低流感嗜血杆菌蛋白质和编码部分的熵值
G. Sampath
{"title":"A block coding method that leads to significantly lower entropy values for the proteins and coding sections of Haemophilus influenzae","authors":"G. Sampath","doi":"10.1109/CSB.2003.1227329","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227329","url":null,"abstract":"A simple statistical block code in combination with the LZW-based compression utilities gzip and compress has been found to increase by a significant amount the level of compression possible for the proteins encoded in Haemophilus influenzae, the first fully sequenced genome. The method yields an entropy value of 3.665 bits per symbol (bps), which is 0.657 bps below the maximum of 4.322 bps and an improvement of 0.452 bps over the best known to date of 4.118 bps using Matsumoto, Sadakane, and Imai's Iza-CTW algorithm. Calculations based on a compact inverse genetic code show that the genome has a maximum entropy of 1.757 bps for the coding regions, with a possibly lower actual entropy. These results hint at the existence of hitherto unexplored redundancies that do not show up in Markov models and are indicative of more internal structure than suspected in both the protein and the genome.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130011353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Genetic algorithm approach for the closest string problem 最近弦问题的遗传算法
Holger Mauch, M. Melzer, John S. Hu
{"title":"Genetic algorithm approach for the closest string problem","authors":"Holger Mauch, M. Melzer, John S. Hu","doi":"10.1109/CSB.2003.1227407","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227407","url":null,"abstract":"A fundamental aspect of post-transcriptional gene silencing (PTGS) or RNA interference (RNAi) is the requirement of sequence homology between the transgene and viral or messenger RNAs being targeted. For example, virus-resistant transgenic plants are resistant only to viruses that are closely related (i.e. high sequence homology) to the virus from which the transgene was derived. One idea for broadening this resistance is to devise an artificial sequence that incorporates the sequence variation found in a viral population. This requires an algorithm which can determine an artificial sequence with an optimal (or at least a 90-95% ) homology to all of the viral sequences in a population. The genetic algorithm (GA) presented in this paper serves this purpose. It should be of great value to all researchers who utilize PTGS or RNAi. In the context of coding theory, the task is to find the radius of a code S /spl sub/ {A, C, G, T} /sup n/. In computational biology this problem is commonly referred to as the closest string problem. Experimental results suggest that this NP-complete optimization problem can be approached well with a custom-built GA.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124005400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Molecular evaluation using comparative molecular interaction profile analysis system 利用比较分子相互作用谱分析系统进行分子评价
Yoshiharu Hayashi, Katsuyoshi Sakaguchi, Nao Iwata, Masaki Kobayashi
{"title":"Molecular evaluation using comparative molecular interaction profile analysis system","authors":"Yoshiharu Hayashi, Katsuyoshi Sakaguchi, Nao Iwata, Masaki Kobayashi","doi":"10.1109/CSB.2003.1227387","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227387","url":null,"abstract":"Creating a new molecular description factor based on the results of computational docking study will add new dimensions in molecular evaluation. We propose a new molecular description factor analysis system named comparative molecular interaction profile analysis system (CoMIPA) in which the AutoDock program is used for docking evaluation of small molecule compound-protein complexes. Interaction energies are calculated, and the data sets obtained are named interaction profiles (IPFs). Using IPF as a scoring indicator, the system could be a powerful tool to cluster the interacting properties between small molecules and bio macromolecules such as ligand-receptor bindings. The system can use computational molecular docking results to explain biological events such as adverse drug reactions and possibly other unforeseen interactions caused by environmental hormones. We believe that the system has the potential to be a major stepping-stone for bridging computational science and biology.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124071101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene selection for multiclass prediction of microarray data 微阵列数据多类别预测的基因选择
Dechang Chen, D. Hua, J. Reifman, Xiuzhen Cheng
{"title":"Gene selection for multiclass prediction of microarray data","authors":"Dechang Chen, D. Hua, J. Reifman, Xiuzhen Cheng","doi":"10.1109/CSB.2003.1227385","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227385","url":null,"abstract":"Gene expression data from microarrays have been successfully applied to class prediction, where the purpose is to classify and predict the diagnostic category of a sample by its gene expression profile. A typical microarray dataset consists of expression levels for a large number of genes on a relatively small number of samples. As a consequence, one basic and important question associated with class prediction is: how do we identify a small subset of informative genes contributing the most to the classification task? Many methods have been proposed but most focus on two-class problems, such as discrimination between normal and disease samples. This paper addresses selecting informative genes for multiclass prediction problems by jointly considering all the classes simultaneously. Our approach is based on the power of the genes in discriminating among the different classes (e.g., tumor types) and the existing correlation between genes. We formulate the expression levels of a given gene by a one-way analysis of variance model with heterogeneity of variances, and determine the discriminatory power of the gene by a test statistic designed to test the equality of the class means. In other words, the discriminatory power of a gene is associated with a Behrens-Fisher problem. Informative genes are chosen such that each selected gene has a high discriminatory power and the correlation between any pair of selected genes is low. Test statistics considered in this paper include the ANOVA F test statistic, the Brown-Forsythe test statistic, the Cochran test statistic, and the Welch test statistic. Their performances are evaluated over several classification methods applied to two publicly available microarray datasets. The results show that Brown-Forsythe test statistic achieves the best performance.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124288880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
A computational approach to reconstructing gene regulatory networks 重构基因调控网络的计算方法
Xutao Deng, H. Ali
{"title":"A computational approach to reconstructing gene regulatory networks","authors":"Xutao Deng, H. Ali","doi":"10.1109/CSB.2003.1227350","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227350","url":null,"abstract":"With the rapid accumulation of gene expression data in publicly accessible databases, computational study of gene regulation has become an obtainable goal Intrinsic to this task will be data mining tools for inferring knowledge from biological data. In this project, we have developed a new data mining technique in which we adapt the connectivity of a recurrent neural network model by indexing regulatory elements and including nonlinear interaction terms. The new technique reduces the number of parameters by O(n), therefore increasing the chance of recovering the underlying regulatory network. In order to fit the model from data, we have developed a genetic fitting algorithm with O(n) time complexity and that adapts the connectivity during the fitting process until a satisfactory fit is obtained. We have implemented this fitting algorithm and applied it to two data sets: rat central nervous system development (CNS) data with 112 genes, and yeast whole genome data with 2467 genes. With multiple runs of the fitting algorithm, we were able to efficiently generate a statistical pattern of the model parameters from the data. Because of its adaptive features, this method will be especially useful for reconstructing coarse-grained gene regulatory network from large scale or genome scale gene expression data sets.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125012563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信