Proceedings. IEEE Computer Society Bioinformatics Conference最新文献_第9页

Simultaneous classification and feature clustering using discriminant vector quantization with applications to microarray data analysis 判别矢量量化同时分类和特征聚类在微阵列数据分析中的应用

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039347

Jia Li, H. Zha

{"title":"Simultaneous classification and feature clustering using discriminant vector quantization with applications to microarray data analysis","authors":"Jia Li, H. Zha","doi":"10.1109/CSB.2002.1039347","DOIUrl":"https://doi.org/10.1109/CSB.2002.1039347","url":null,"abstract":"In many applications of supervised learning, automatic feature clustering is often desirable for a better understanding of the interaction among the various features as well as the interplay between the features and the class labels. In addition, for high dimensional data sets, feature clustering has the potential for improvement in classification accuracy and reduction in computational complexity. In this paper, a method is developed for simultaneous classification and feature clustering by extending discriminant vector quantization (DVQ), a prototype classification method derived from the principle of minimum description length using source coding techniques. The method incorporates feature clustering with classification performed by fusing features in the same clusters. To illustrate its effectiveness, the method has been applied to microarray gene expression data for human lymphoma classification. It is demonstrated that incorporating feature clustering improves classification accuracy, and the clusters generated match well with biological meaningful gene expression signature groups.","PeriodicalId":87204,"journal":{"name":"Proceedings. IEEE Computer Society Bioinformatics Conference","volume":"1 1","pages":"246-255"},"PeriodicalIF":0.0,"publicationDate":"2002-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/CSB.2002.1039347","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62214502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Constrained multiple sequence alignment tool development and its application to RNase family alignment 约束多序列比对工具的开发及其在RNase家族比对中的应用

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039336

C. Tang, C. Lu, M. Chang, Yin-Te Tsai, Yuh-Ju Sun, K. Chao, Jia-Ming Chang, Yu-Han Chiou, Chia-Mao Wu, Hao-Teng Chang, Wei-I Chou

引用次数: 57

A security system for personal genome information at DNA level DNA级别的个人基因组信息安全系统

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039353

Y. Kawazoe, Toshikazu Shiba, Masahito Yamamoto, A. Ohuchi

引用次数: 0

An efficient branch-and-bound algorithm for the assignment of protein backbone NMR peaks 一种高效的蛋白质骨架核磁共振波峰分配分支结合算法

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039339

Guohui Lin, Dong Xu, Zhi-Zhong Chen, Tao Jiang, Jianjun Wen, Ying Xu

{"title":"An efficient branch-and-bound algorithm for the assignment of protein backbone NMR peaks","authors":"Guohui Lin, Dong Xu, Zhi-Zhong Chen, Tao Jiang, Jianjun Wen, Ying Xu","doi":"10.1109/CSB.2002.1039339","DOIUrl":"https://doi.org/10.1109/CSB.2002.1039339","url":null,"abstract":"NMR resonance assignment is one of the key steps in solving an NMR protein structure. The assignment process links resonance peaks to individual residues of the target protein sequence, providing the prerequisite for establishing intra- and inter-residue spatial relationships between atoms. The assignment process is tedious and time-consuming, which could take many weeks. Though there exist a number of computer programs to assist the assignment process, many NMR labs are still doing the assignments manually to ensure quality. This paper presents a new computational method based on our recent work towards automating the assignment process, particularly the process of backbone resonance peak assignment. We formulate the assignment problem as a constrained weighted bipartite matching problem. While the problem, in the most general situation, is NP-hard, we present an efficient solution based on a branch-and-bound algorithm with effective bounding techniques and a greedy filtering algorithm for reducing the search space. Our experimental results on 70 instances of (pseudo) real NMR data derived from 14 proteins demonstrate that the new solution runs much faster than a recently introduced (exhaustive) two-layer algorithm and recovers more correct peak assignments than the two-layer algorithm.","PeriodicalId":87204,"journal":{"name":"Proceedings. IEEE Computer Society Bioinformatics Conference","volume":"1 1","pages":"165-174"},"PeriodicalIF":0.0,"publicationDate":"2002-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/CSB.2002.1039339","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62214271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

Selective tree growing: a deterministic constant-space linear-time algorithm for pattern discovery and for computing multiple sequence alignment 选择性树生长:用于模式发现和计算多序列比对的确定性常空间线性时间算法

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039367

Mashilamani Sambasivam

{"title":"Selective tree growing: a deterministic constant-space linear-time algorithm for pattern discovery and for computing multiple sequence alignment","authors":"Mashilamani Sambasivam","doi":"10.1109/CSB.2002.1039367","DOIUrl":"https://doi.org/10.1109/CSB.2002.1039367","url":null,"abstract":"Summary form only given. Given a set of n sequences, the multiple sequence alignment problem is to align these n sequences, with gaps or otherwise, such that the commonality of the sequences is projected appropriately. If m is the total sum of the lengths of the input sequences, A is the alphabet size of the input sequences, and P is the final number of unique patterns, fixed by the user, that cause an alignment between sequences, then the algorithm runs in time bound O(m(A + P)), linear worst case time. Our algorithm runs on both sequences where A is small and large. Our algorithm forms the alignment by first discovering patterns, and thus is also a pattern discovery solution. We support our theoretical conclusions with experimental results obtained from running our algorithm on GenPept sequences and human genome sequences from the GenBank public domain database. Our algorithm uses direct n-wise alignment and constant memory space irrespective of the value of m. What differentiates this algorithm from most others is that it is deterministic; it is guaranteed and theoretically proved that all patterns of any arbitrary length that occur in at least k sequences and that are responsible for multiple sequence alignment are found by the algorithm, where k is specified by the user.","PeriodicalId":87204,"journal":{"name":"Proceedings. IEEE Computer Society Bioinformatics Conference","volume":"1 1","pages":"344-"},"PeriodicalIF":0.0,"publicationDate":"2002-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/CSB.2002.1039367","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62214669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Protein-based analysis of alternative splicing in the human genome 人类基因组中选择性剪接的蛋白质分析

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039335

A. Loraine, G. Helt, M. Cline, Michael A. Siani-Rose

引用次数: 8

Designing oscillators in synthetic gene networks based on multi-scale dynamics 基于多尺度动力学的合成基因网络振荡子设计

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039359

Luonan Chen, Tetsuya J. Kobayashi, K. Aihara

{"title":"Designing oscillators in synthetic gene networks based on multi-scale dynamics","authors":"Luonan Chen, Tetsuya J. Kobayashi, K. Aihara","doi":"10.1109/CSB.2002.1039359","DOIUrl":"https://doi.org/10.1109/CSB.2002.1039359","url":null,"abstract":"Multistability, oscillations, and switching exist at various levels of biological processes and organizations and have been investigated on the basis of many theoretical models, such as circadian oscillations with the period protein (PER) and the timeless protein (TIM) in Drosophila, and multistable dynamics regulated by transcriptional factors. Considerable experimental evidence suggests that cellular processes are intrinsically rhythmic or periodic. Various periodic oscillations with different time scales ranging from less than a second to more than a year, which may allow for living organisms to adapt their behaviors to a periodically varying environment, have also been observed experimentally. On the other hand, in synthetic gene networks, both toggle switch and repressilator have been theoretically proposed and further confirmed by experiments. All of these works stress the importance of feedback regulation of transcriptional factors, which is a key in giving rise to oscillatory or multistable dynamical behaviors exhibited by biological genetic systems. In addition, it should be noted that many periodic behaviors do not simply oscillate smoothly; rather, they change rapidly or jump at certain states. In gene expression systems, many different time scales characterize the gene regulatory processes. For instance, the transcription and translation processes generally evolve on a time scale that is much slower than that of phosphorylation, dimerization or binding reactions of transcription factors. In genetic networks, the time scale for expression of some genes is much slower than that of others, depending on the length of the genes. We aim to design robust periodic oscillators in synthetic gene-protein systems by simple nonlinear models and to analyze the basic mechanism of limit cycles with jumping behaviors or relaxation oscillations by exploiting multiple time-scale properties [1, 2]. We show that periodic oscillations are mainly generated by nonlinear feedback loops in gene regulatory systems and the jumping dynamics caused by time scale differences among biochemical reactions. Moreover, effects of time delay are also examined. We show that time delay generally enlarges the stability region of oscillations, thereby making the oscillations more sustainable despite parameter changes or noise [1, 2]. The dynamics of the proposed models is robust in terms of stability and period length to the parameter perturbations or environment variations. Although we mainly analyze some specific models, the mechanisms identified in this work are likely to apply to a variety of genetic regulatory systems. These simple models may actually act as basic building block in synthetic gene-protein networks, such as genetic oscillators or switches because the dynamics is robust for parameter perturbations or environment variations. Several examples are also provided to demonstrate implementation of synthetic oscillators by using genes of the /spl lambda/ phage bact","PeriodicalId":87204,"journal":{"name":"Proceedings. IEEE Computer Society Bioinformatics Conference","volume":"1 1","pages":"336-"},"PeriodicalIF":0.0,"publicationDate":"2002-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/CSB.2002.1039359","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62214378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Electronic polymerase chain reaction (EPCR) search algorithm 电子聚合酶链反应(EPCR)搜索算法

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039361

Conrad Shyu, J. Foster, L. Forney

引用次数: 3

Automated identification of single nucleotide polymorphisms from sequencing data 从测序数据中自动识别单核苷酸多态性

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039332

Masazumi Takahashi, F. Matsuda, N. Margetic, M. Lathrop

引用次数: 77

AxML: a fast program for sequential and parallel phylogenetic tree calculations based on the maximum likelihood method 基于最大似然方法的顺序和并行系统发育树计算的快速程序

Proceedings. IEEE Computer Society Bioinformatics Conference Pub Date : 2002-08-14 DOI: 10.1109/CSB.2002.1039325

A. Stamatakis, T. Ludwig, H. Meier, Marty J. Wolf

引用次数: 41