2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)最新文献

筛选
英文 中文
Patterns of gene deletion following genome duplication in yeast 酵母基因组复制后基因缺失的模式
J. Byrnes, Wen-Hsiung Li
{"title":"Patterns of gene deletion following genome duplication in yeast","authors":"J. Byrnes, Wen-Hsiung Li","doi":"10.1109/CSBW.2005.104","DOIUrl":"https://doi.org/10.1109/CSBW.2005.104","url":null,"abstract":"Whole genome duplication (WGD) is followed by massive duplicate deletion that reorganizes gene adjacencies. We compare the deletion patterns and adjacency reorganization following WGD in yeast with simulations. We find that deletion events alternate between paralogous chromosomes more often than expected under a random duplicate deletion model.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133006375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Current challenges in bioimage database design 生物图像数据库设计面临的挑战
Ambuj K. Singh, Arnab Bhattacharya, Vebjorn Ljosa
{"title":"Current challenges in bioimage database design","authors":"Ambuj K. Singh, Arnab Bhattacharya, Vebjorn Ljosa","doi":"10.1109/CSBW.2005.47","DOIUrl":"https://doi.org/10.1109/CSBW.2005.47","url":null,"abstract":"Information technology research has played a significant role in the high-throughput acquisition and analysis of biological information. The tremendous amount of information gathered from genomics in the past decade is being complemented by knowledge from comprehensive, systematic studies of the properties and behaviors of all proteins and other biomolecules. Understanding complex systems such as the nervous system requires the high-resolution imaging of molecules and cells and the analysis of these images in order to understand how distribution patterns (e.g., the localization of specific neuron types within a region of the central nervous system, or the localization of molecules at the subcellular level) change in response to stress, injury, aging, and disease. We discuss two kinds of bioimage data: retinal images and microtubule images. We argue that supporting effective access to them requires new database techniques for description of probabilistic and interpreted data, and analysis of spatial and temporal information. The developed techniques are being implemented in a publicly available bioimage database.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131349244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Which normalization method is best? A platform-independent biologically inspired quantitative comparison of normalization methods 哪种归一化方法是最好的?一个独立于平台的生物学启发的标准化方法的定量比较
E. Someren, M. Reinders
{"title":"Which normalization method is best? A platform-independent biologically inspired quantitative comparison of normalization methods","authors":"E. Someren, M. Reinders","doi":"10.1109/CSBW.2005.142","DOIUrl":"https://doi.org/10.1109/CSBW.2005.142","url":null,"abstract":"Since the introduction of microarray technology, several different normalization techniques have been introduced, but it is still unclear which normalization method is best. We present the first comparative study of normalization methods for both cDNA as well as oligonucleotide arrays that is based on their overall performance on five complementary performance measures. The presented comparison is unique in that it 1) compares normalization methods with very different outcomes, 2) is applied to two different array platforms, 3) introduces several different (biologically inspired) performance measures and 4) can be applied to any data set. The results show amongst others that, for cDNA arrays, the well-established lowest-compensation of logratio is not biologically beneficial and that a novel ratio-based normalization (without logarithm) performs best overall. For Affymetrix arrays, we found that Rosetta's Experiment Builder is generally to be preferred.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124958979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structural genomics analysis of alternative splicing and its application in modeling structures of alternatively spliced variants 交替剪接的结构基因组学分析及其在交替剪接变异体结构建模中的应用
Peng Wang, Bo Yan, Jun-tao Guo, C. Hicks, Ying Xu
{"title":"Structural genomics analysis of alternative splicing and its application in modeling structures of alternatively spliced variants","authors":"Peng Wang, Bo Yan, Jun-tao Guo, C. Hicks, Ying Xu","doi":"10.1109/CSBW.2005.129","DOIUrl":"https://doi.org/10.1109/CSBW.2005.129","url":null,"abstract":"Ii this paper we carry out a structural genomics analysis of known alternative splicing events and show that threading is a valid approach to model structures of alternatively spliced variants. We collect 3-D structures of proteins with know translated splicing products from PDB and further expand the dataset with high quality models generated with threading approach. Our analysis shows that splicing events have a strong preference for non-regular secondary structure elements and tend to avoid buried residues. Those observations suggest evolutionary constrains exist for locations of splicing in the context of 3-D environment. We then show that majority of substitutions in splicing events share high structural similarity and splicing events also tend to remove entire domain and avoid exposing hydrophobic cores when part of a domain was removed. Those observations support the notion that majority of splicing isoforms adopt same fold as full-length protein despite sequence substitutions and deletions. This principle was then utilized to generate high quality structures of splicing variants that could be a valuable resource for studying their structures and functions and may provide new insights into pathogenesis of related diseases.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122279921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
cis-Regulatory element prediction in mammalian genomes 哺乳动物基因组的顺式调控元件预测
A. Siddiqui, Gordon Robertson, M. Bilenky, T. Astakhova, O. Griffith, M. Hassel, Keven Lin, S. Montgomery, M. Oveisi, E. Pleasance, Neil Robertson, M. Sleumer, Kevin Teague, R. Varhol, Maggie Zhang, Steven J. M. Jones
{"title":"cis-Regulatory element prediction in mammalian genomes","authors":"A. Siddiqui, Gordon Robertson, M. Bilenky, T. Astakhova, O. Griffith, M. Hassel, Keven Lin, S. Montgomery, M. Oveisi, E. Pleasance, Neil Robertson, M. Sleumer, Kevin Teague, R. Varhol, Maggie Zhang, Steven J. M. Jones","doi":"10.1109/CSBW.2005.35","DOIUrl":"https://doi.org/10.1109/CSBW.2005.35","url":null,"abstract":"The identification of cis-regulatory elements and modules is an important step in understanding the regulation of genes. We have developed a pipeline capable of running multiple motif prediction methods on a whole genome scale. Using gene expression datasets to identify co-expressed genes and the Ensemhl Compara database orthologues, we assemble input sequence sets comprised of the upstream regions of a target gene, its orthologues and co-expressed genes on the premise that such genes will share promoters by evolution (orthologues) or share regulatory control mechanisms (co-expressed genes). Co-expressed genes are identified by an approach that combines Pearson distances from multiple gene expression datasets derived from multiple experimental approaches and calibrated against the GO database. Our pipeline runs a number of established motif detection algorithms with a range of parameter settings on the input dataset. We integrate the diverse result sets by scoring motifs with a method-independent function. For each target gene, we assign p-values to the motif score by running the discovery pipeline on multiple sets of input sequence containing the target gene, non-coexpressed genes and \"Jake\" orthologues generated by neutral numerical evolution. We have predicted 30,636 motif binding sites in human for 4,182 genes and an initial set of 472 motif binding sites in mouse for 92 genes with p<0.001. The positive predictive value against a library of biologically confirmed regulatory sites approaches 0.4 at the highest p-value threshold. Predicted regulatory elements and other resources from the project are available at www.cisred.org.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126629240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Adapting support vector machines to predict translation initiation sites in the human genome 应用支持向量机预测人类基因组翻译起始位点
R. Akbani, Stephen Kwek
{"title":"Adapting support vector machines to predict translation initiation sites in the human genome","authors":"R. Akbani, Stephen Kwek","doi":"10.1109/CSBW.2005.18","DOIUrl":"https://doi.org/10.1109/CSBW.2005.18","url":null,"abstract":"This study is concerned with predicting translation initiation sites (TIS) in the human genome that start with the nucleotide sequence ATG. This sequence occurs 104 million times in the entire genome. However, current estimates predict that there are only about 30,000 or so TIS in the human genome, giving an imbalance ratio of about 1:3500 for TIS ATG vs. non-TIS ATG sites. Algorithms that are designed using datasets that have low imbalance ratio may not be well suited to predict TIS at the genomic level. In this paper, we modified the SVM algorithm that can handle moderately high imbalance ratio. The F-measures for other approaches were: linear discriminant 0%, SVM with under-sampling 4.1%, SVM with over-sampling 8.2%, neural network 13.3%, decision tree 20%, our approach 44%. This shows how poorly standard approaches perform at the genomic level due to the high imbalance ratio. Our approach improves the performance significantly.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126725203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Multivariate gene selection: does it help? 多变量基因选择:有帮助吗?
Carmen Lai, M. Reinders
{"title":"Multivariate gene selection: does it help?","authors":"Carmen Lai, M. Reinders","doi":"10.1109/CSBW.2005.95","DOIUrl":"https://doi.org/10.1109/CSBW.2005.95","url":null,"abstract":"When building predictors of disease state based on gene expression data, gene selection is performed in order to achieve a good performance and to identify a relevant subset of genes. Although several gene selection algorithms have been proposed, a fair comparison of the available results is very problematic. This mainly stems from two factors. First, the results are often biased, since the test set is in one way or another involved in training the predictor, resulting in optimistically biased performance estimates. Second, the published results are often based on a small number of relatively simple datasets. Therefore, no general applicative conclusions can be drawn. We therefore adopted an unbiased protocol to perform a fair comparison of state of the art multivariate and univariate gene selection techniques, in combination with a range of classifiers. Our conclusions are based on seven gene expression datasets, across many cancer types. Surprisingly, we could not detect any significant improvement of multivariate feature selection techniques over univariate approaches. We speculate on the possible causes of this finding, ranging from the small sample size problem to the particular nature of the multivariate gene dependencies.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128310727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Large-scale drug function prediction by integrating QIS D/sup 2/ and biospice 整合QIS D/sup 2/和生物香料的大规模药物功能预测
Ying Zhao, Charles C. Zhou, I. Oglesby, Cliff Zhou
{"title":"Large-scale drug function prediction by integrating QIS D/sup 2/ and biospice","authors":"Ying Zhao, Charles C. Zhou, I. Oglesby, Cliff Zhou","doi":"10.1109/CSBW.2005.84","DOIUrl":"https://doi.org/10.1109/CSBW.2005.84","url":null,"abstract":"Quantum Intelligence System for Drug Discovery (QIS D/sup 2/) is a unique adaptive learning system designed to predict potential large-scale drug characteristics such as toxicity and efficacy. BioSpice is a set of software tools designed to represent and simulate cellular processes funded by DARPA. We show a QIS D/sup 2/ model is successfully trained, tested and validated on experimental data sets for predicting the potential in vivo effects of drug molecules in biological systems. QIS D/sup 2/ is interoperable with BioSpice. The workflow and visualization are built-in capabilities for easy-of-use. The integration of QIS D/sup 2/ and BioSpice draw on diversified technologies to deliver unique benefits for simulation and screening of potential drugs and their targets. We show that our approach leverages both structured and unstructured bioinformatics databases such as BioWarehouse and GeneWays in BioSpice to greatly enhance a QIS D/sup 2/ model. We show QIS D/sup 2/ models data from seven sources for 37,330 chemicals, performs an automatic sequence clustering using 1234 structure fragments, and accurately predict 1829 targets simultaneously.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121742908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automatic protein function annotation through candidate ortholog clusters from incomplete genomes 通过来自不完整基因组的候选同源簇自动注释蛋白质功能
A. Vashist, C. Kulikowski, I. Muchnik
{"title":"Automatic protein function annotation through candidate ortholog clusters from incomplete genomes","authors":"A. Vashist, C. Kulikowski, I. Muchnik","doi":"10.1109/CSBW.2005.27","DOIUrl":"https://doi.org/10.1109/CSBW.2005.27","url":null,"abstract":"Annotation of protein function often arises in the context of partially complete genomes but is not adequately addressed. We present an annotation method by extracting ortholog clusters from incomplete genomes that are evolutionary closely related to the genome of interest. To construct clusters, our method focuses on sequence similarities across genomes rather than similarities between sequences within a genome. We use the quasi-concave set function optimization for extracting the ortholog clusters as extreme groups of sequences such that similarity of the least similar sequence in this group is maximum. A protein sequence is annotated with the ortholog cluster whose average similarity is highest. We have applied this method for annotating the Rice proteome based on clusters constructed on four partially complete cereal proteomes and the complete proteome from Arabidopsis.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123026662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multispectral multidimensional multiplexed data: the more, the merrier 多光谱多维复用数据:越多越好
R. Levenson, C. Hoyt, J. Mansfield, K. Gossage
{"title":"Multispectral multidimensional multiplexed data: the more, the merrier","authors":"R. Levenson, C. Hoyt, J. Mansfield, K. Gossage","doi":"10.1109/CSBW.2005.94","DOIUrl":"https://doi.org/10.1109/CSBW.2005.94","url":null,"abstract":"The ability to detect multiple molecular species at once is becoming increasingly important. Multispectral imaging systems can be used to capture multiplexed molecular signals, and can be applied to the analysis of chromogenically stained slides in brightfield mode and of samples stained with a variety of light-emitting dyes (from the visible to the NIR range) in fluorescence mode. Quantum dots make a particularly good match with this imaging technology, which is also extremely helpful for the identification and elimination of interfering autofluorescence. The ability to accurately determine the spectral qualities of dyes in-situ is also valuable. Multispectral imaging has proven to be useful for multicolor FISH, for resolving multiple species of GFP with overlapping emission spectra and for resolving red/brown double-labeled histopathology stains. The uses of spectral imaging in clinical pathology are still being explored and need to be matched to appropriate software tools. Appropriately constrained linear unmixing algorithms and novel automated tools have recently been developed to provide simple, accurate analysis procedures. Conventional hematoxylin-and-eosin-or Papanicolaou-stained pathology sections can have sufficient spectral content to allow the classification of cells of different lineage or to separate normal from neoplastic cells. Analysis of such specimens may succeed using spectral \"signatures\" and simple segmentation algorithms. The rich data sets also reward the use of more advanced analysis techniques. These can include a number of approaches pioneered for remote sensing purposes, such as spectral similarity mapping, automated clustering algorithms in n dimensions, principal component analysis, as well as other more sophisticated techniques.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115040190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信