Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003最新文献

筛选
英文 中文
Towards index-based similarity search for protein structure databases 基于索引的蛋白质结构数据库相似性搜索研究
Orhan Çamoglu, Tamer Kahveci, Ambuj K. Singh
{"title":"Towards index-based similarity search for protein structure databases","authors":"Orhan Çamoglu, Tamer Kahveci, Ambuj K. Singh","doi":"10.1109/CSB.2003.1227314","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227314","url":null,"abstract":"We propose two methods for finding similarities in protein structure databases. Our techniques extract feature vectors on triplets of SSEs (secondary structure elements) of proteins. These feature vectors are then indexed using a multidimensional index structure. Our first technique considers the problem of finding proteins similar to a given query protein in a protein dataset. This technique quickly finds promising proteins using the index structure. These proteins are then aligned to the query protein using a popular pairwise alignment tool such as VAST. We also develop a novel statistical model to estimate the goodness of a match using the SSEs. Our second technique considers the problem of joining two protein datasets to find an all-to-all similarity. Experimental results show that our techniques improve the pruning time of VAST3 to 3.5 times while keeping the sensitivity similar.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125096050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
The cybertory sequence file system for managing large DNA sequences 用于管理大型DNA序列的cybertory序列文件系统
Carl E. McMillin, R. Horton
{"title":"The cybertory sequence file system for managing large DNA sequences","authors":"Carl E. McMillin, R. Horton","doi":"10.1109/CSB.2003.1227403","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227403","url":null,"abstract":"The cybertory sequence file system (SFS) is a hybrid file-manager/database for storing and indexing large DNA sequences. It is accessed through custom functions from PostgreSQL to support queries combining database tables with custom DNA indexes and sequence access.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122527890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Refining the extraction of relevant documents from biomedical literature to create a corpus for pathway text mining 对生物医学文献中相关文档的提取进行细化,建立路径文本挖掘的语料库
R. Harte, Yan Lu, Stephen Osborn, David Dehoney, Daniel Chin
{"title":"Refining the extraction of relevant documents from biomedical literature to create a corpus for pathway text mining","authors":"R. Harte, Yan Lu, Stephen Osborn, David Dehoney, Daniel Chin","doi":"10.1109/CSB.2003.1227432","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227432","url":null,"abstract":"For biologists to keep up with developments in their field or related fields, automation is desirable to more efficiently read and interpret a rapidly growing literature. Identification of proteins or genes and their interactions can facilitate the mapping of canonical or evolving pathways from the literature. In order to mine such data, we developed procedures and tools to pre-qualify documents for further analysis. Initially, a corpus of documents for proteins of interest was built using alternate symbols from Locuslink and the Stanford SOURCE as MEDLINE search terms. The query was refined using the optimum keywords together with MeSH terms combined in a Boolean query to minimize false positives. The document space was examined using a strategy employing; latent semantic indexing (LSI), which uses Entrez's \"related papers\" utility for MEDLINE. Documents' relationships were visualized using an undirected graph and scored by their relatedness. Distinct document clusters, formed by the most highly connected related papers, are mostly composed of abstracts relating to one aspect of research. This feature was used to filter irrelevant abstracts, which resulted in a reduction in corpus size of 10% to 30% depending on the domain. The excluded documents were examined to confirm their lack of relevance. Corpora consisted of the most relevant documents thus reducing the number of false positives and irrelevant examples in the training set for pathway mapping. Documents were tagged, using a modified version of GATE2, with terms based on GO for rule induction using RAPIER.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131173933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Computational simulation of lipid bilayer reorientation at gaps 间隙处脂质双分子层重定向的计算模拟
P. Kasson, V. Pande
{"title":"Computational simulation of lipid bilayer reorientation at gaps","authors":"P. Kasson, V. Pande","doi":"10.1109/CSB.2003.1227374","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227374","url":null,"abstract":"Understanding cellular membrane processes is critical for the study of events such as viral entry, neurotransmitter exocytosis, and immune activation. Supported lipid bilayers serve as a model system for many membrane processes. Despite the relative simplicity of this system, many important structural and dynamic parameters are not experimentally observable with current techniques. Computational approaches allow the development of a high-resolution model of bilayer processes. We have performed molecular dynamics simulations of phospholipid bilayers to model the creation of bilayer gaps and to analyze their structure and dynamics. Our simulations show rapid reorientation and movement of phospholipids near simulated bilayer edges. These data suggest that lipids may undergo rapid local rearrangements during membrane fusion, facilitating formation of fusion intermediates thought key to the infection cycle of viruses such as influenza and HIV.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"127 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131366551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MedfoLink: bridging the gap between IT and the medical community MedfoLink:弥合IT和医学界之间的差距
J. Gerrein, A. Kherlopian, Vezen Wu, Mitchell Berman, A. Wald
{"title":"MedfoLink: bridging the gap between IT and the medical community","authors":"J. Gerrein, A. Kherlopian, Vezen Wu, Mitchell Berman, A. Wald","doi":"10.1109/CSB.2003.1227436","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227436","url":null,"abstract":"MedfoLink is a new software technology that applies novel design features to help solve issues regarding medical records processing that are part of the national agenda. This software overcomes the vocabulary and performance limitations of existing medical language processing technologies by employing high speed databases, Java, and the UMLS (unified medical language source). MedfoLink is a Java technology that enables a computer to accurately record and interpret data from patient records. MedfoLink's design incorporates two novel approaches: (a) intelligent algorithms that with practice improve comprehension of medical records and (b) the ALI (adaptive learning interface) architecture that enhances performance by integrating existing technologies with an object-oriented approach. Future applications of MedfoLink range from individual patient care, to clinical drug trials, to public health monitoring.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115049000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Genome on demand: interactive substring searching 基因组按需:交互式子串搜索
Tamer Kahveci, Ambuj K. Singh
{"title":"Genome on demand: interactive substring searching","authors":"Tamer Kahveci, Ambuj K. Singh","doi":"10.1109/CSB.2003.1227405","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227405","url":null,"abstract":"We consider the problem of interactive string searching, and propose two k-NN (k-nearest neighbor) search algorithms. For a given query, our techniques start reporting the initial results quickly. Later, these results are periodically refined depending on the user satisfaction. We briefly discuss the two techniques. The first technique, called LIS (Local statistics-based Interactive Search),and the second technique, called GIS (Global Statistics-based Interactive Search).","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125552030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
SMASHing regulatory sites in DNA by human-mouse sequence comparisons 通过人鼠序列比较粉碎DNA中的调控位点
M. Zavolan, N. Socci, N. Rajewsky, T. Gaasterland
{"title":"SMASHing regulatory sites in DNA by human-mouse sequence comparisons","authors":"M. Zavolan, N. Socci, N. Rajewsky, T. Gaasterland","doi":"10.1109/CSB.2003.1227328","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227328","url":null,"abstract":"Regulatory sequence elements provide important clues to understanding and predicting gene expression. Although the binding sites for hundreds of transcription factors are known, there has been no systematic attempt to incorporate this information in the annotation of the human genome. Cross species sequence comparisons are critical to a meaningful annotation of regulatory elements since they generally reside in conserved noncoding regions. To take advantage of the recently completed drafts of the mouse and human genomes for annotating transcription factor binding sites, we developed SMASH, a computational pipeline that identifies thousands of orthologous human/mouse proteins, maps them to genomic sequences, extracts and compares upstream regions and annotates putative regulatory elements in conserved, noncoding, upstream regions. Our current dataset consists of approximately 2500 human/mouse gene pairs. Transcription start sites were estimated by mapping quasifull length cDNA sequences. SMASH uses a novel probabilistic method to identify putative conserved binding sites that takes into account the competition between transcription factors for binding DNA. SMASH presents the results via a genome browser web interface which displays the predicted regulatory information together with the current annotations for the human genome. Our results are validated by comparison to previously published experimental data. SMASH results compare favorably to other existing computational approaches.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130258277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Initial large-scale exploration of protein-protein interactions in human brain 对人脑中蛋白质-蛋白质相互作用的初步大规模探索
J. Chen, A. Sivachenko, R. Bell, C. Kurschner, I. Ota, S. Sahasrabudhe
{"title":"Initial large-scale exploration of protein-protein interactions in human brain","authors":"J. Chen, A. Sivachenko, R. Bell, C. Kurschner, I. Ota, S. Sahasrabudhe","doi":"10.1109/CSB.2003.1227322","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227322","url":null,"abstract":"Study of protein interaction networks is crucial to post-genomic systems biology. Aided by high-throughput screening technologies, biologists are rapidly accumulating protein-protein interaction data. Using a random yeast two-hybrid (R2H) process, we have performed large-scale yeast two-hybrid searches with approximately fifty thousand random human brain cDNA bait fragments against a human brain cDNA prey fragment library. From these searches, we have identified 13,656 unique protein-protein interaction pairs involving 4,473 distinct known human loci. In this paper, we have performed our initial characterization of the protein interaction network in human brain tissue. We have classified and characterized all identified interactions based on gene ontology (GO) annotation of interacting loci. We have also described the \"scale-free\" topological structure of the network.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"98 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116367123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Local similarity in RNA secondary structures RNA二级结构的局部相似性
M. Höchsmann, Thomas Töller, R. Giegerich, S. Kurtz
{"title":"Local similarity in RNA secondary structures","authors":"M. Höchsmann, Thomas Töller, R. Giegerich, S. Kurtz","doi":"10.1109/CSB.2003.1227315","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227315","url":null,"abstract":"We present a systematic treatment of alignment distance and local similarity algorithms on trees and forests. We build upon the tree alignment algorithm for ordered trees given by Jiang et. al (1995) and extend it to calculate local forest alignments, which is essential for finding local similar regions in RNA secondary structures. The time complexity of our algorithm is O(/F/sub 1///spl middot//F/sub 2//)/spl middot/deg(F/sub 1/)/spl middot/deg(F/sub 2/)/spl middot/(deg(F/sub 1/)+deg(F/sub 2/)) where /Fi/ is the number of nodes in forest Fi and deg(Fi) is the degree of Fi. We provide carefully engineered dynamic programming implementations using dense, two-dimensional tables which considerably reduces the space requirement. We suggest a new representation of RNA secondary structures as forests that allow reasonable scoring of edit operations on RNA secondary structures. The comparison of RNA secondary structures is facilitated by a new visualization technique for RNA secondary structure alignments. Finally, we show how potential regulatory motifs can be discovered solely by their structural preservation, and independent of their sequence conservation and position.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"375 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129087835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 271
Representing and reasoning about signal networks: an illustration using NF/spl kappa/B dependent signaling pathways 信号网络的表示和推理:使用NF/spl kappa/B依赖的信号通路的说明
Chitta Baral, K. Chancellor, Tran Hoai Nam, Nhan Tran
{"title":"Representing and reasoning about signal networks: an illustration using NF/spl kappa/B dependent signaling pathways","authors":"Chitta Baral, K. Chancellor, Tran Hoai Nam, Nhan Tran","doi":"10.1109/CSB.2003.1227427","DOIUrl":"https://doi.org/10.1109/CSB.2003.1227427","url":null,"abstract":"We propose a formal language to represent and reason about signal transduction networks. The existing approaches such as ones based on Petri nets, and /spl pi/-calculus fall short in many ways and our work suggests that an artificial intelligence (AI) based approach may be well suited for many aspects. We apply a form of action language to represent and reason about NF/spl kappa/B dependent signaling pathways. Our language supports several essential features of reasoning with signal transduction knowledge, such as: reasoning with partial (or incomplete) knowledge, and reasoning about triggered evolutions of the world and elaboration tolerance. Because of its growing important role in cellular functions, we select NF/spl kappa/B dependent signaling to be our test bed. NF/spl kappa/B is a central mediator of the immune response, and it can regulate stress responses, as well as cell death/survival in several cell types. While many extracellular signals may lead to the activation of NF/spl kappa/B, few related pathways are elucidated. We study the tasks of representation of pathways, reasoning with pathways, explaining observations, and planning to alter the outcomes; and show that all of them can be well formulated in our framework. Thus our work shows that our AI based approach is a good candidate for feasible and practical representation of and reasoning about signal networks.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130622260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信