Applied bioinformatics最新文献_第9页

Phenotype characterisation using integrated gene transcript, protein and metabolite profiling. 使用综合基因转录，蛋白质和代谢物分析表型特征。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403040-00002

Matej Oresic, Clary B Clish, Eugene J Davidov, Elwin Verheij, Jack Vogels, Louis M Havekes, Eric Neumann, Aram Adourian, Stephen Naylor, Jan van der Greef, Thomas Plasterer

{"title":"Phenotype characterisation using integrated gene transcript, protein and metabolite profiling.","authors":"Matej Oresic, Clary B Clish, Eugene J Davidov, Elwin Verheij, Jack Vogels, Louis M Havekes, Eric Neumann, Aram Adourian, Stephen Naylor, Jan van der Greef, Thomas Plasterer","doi":"10.2165/00822942-200403040-00002","DOIUrl":"https://doi.org/10.2165/00822942-200403040-00002","url":null,"abstract":"Multifactorial diseases present a significant challenge for functional genomics. Owing to their multiple compartmental effects and complex biomolecular activities, such diseases cannot be adequately characterised by changes in single components, nor can pathophysiological changes be understood by observing gene transcripts alone. Instead, a pattern of subtle changes is observed in multifactorial diseases across multiple tissues and organs with complex associations between corresponding gene, protein and metabolite levels. This article presents methods for exploratory and integrative analysis of pathophysiological changes at the biomolecular level. In particular, novel approaches are introduced for the following challenges: (i) data processing and analysis methods for proteomic and metabolomic data obtained by electrospray ionisation (ESI) liquid chromatography-tandem mass spectrometry (LC/MS); (ii) association analysis of integrated gene, protein and metabolite patterns that are most descriptive of pathophysiological changes; and (iii) interpretation of results obtained from association analyses in the context of known biological processes. These novel approaches are illustrated with the apolipoprotein E3-Leiden transgenic mouse model, a commonly used model of atherosclerosis. We seek to gain insight into the early responses of disease onset and progression by determining and identifying--well in advance of pathogenic manifestations of disease--the sets of gene transcripts, proteins and metabolites, along with their putative relationships in the transgenic model and associated wild-type cohort. Our results corroborate previous findings and extend predictions for three processes in atherosclerosis: aberrant lipid metabolism, inflammation, and tissue development and maintenance.","PeriodicalId":87049,"journal":{"name":"Applied bioinformatics","volume":"3 4","pages":"205-17"},"PeriodicalIF":0.0,"publicationDate":"2004-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2165/00822942-200403040-00002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25118637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 69

Representation of mutation pressure and selection pressure by PAM matrices. 用PAM矩阵表示突变压力和选择压力。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403010-00005

Aleksandra Nowicka, Pawel Mackiewicz, Malgorzata Dudkiewicz, Dorota Mackiewicz, Maria Kowalczuk, Joanna Banaszak, Stanislaw Cebrat, Miroslaw R Dudek

{"title":"Representation of mutation pressure and selection pressure by PAM matrices.","authors":"Aleksandra Nowicka, Pawel Mackiewicz, Malgorzata Dudkiewicz, Dorota Mackiewicz, Maria Kowalczuk, Joanna Banaszak, Stanislaw Cebrat, Miroslaw R Dudek","doi":"10.2165/00822942-200403010-00005","DOIUrl":"https://doi.org/10.2165/00822942-200403010-00005","url":null,"abstract":"This paper analyses the relationship between the mutation data matrix 1PAM/PET91, representing the effect of both mutation and selection pressures exerted on 16130 homologous proteins of different organisms, and a mutation probability matrix (1PAM/MPM) representing the effect of pure mutation pressure on protein coding of the Borrelia burgdorferi genome. The 1PAM/PMP matrix was derived with the help of computer simulations, which used empirical nucleotide substitution rates found for the B. burgdorferi genome. Here, it is shown that the frequency of amino acid occurrence is strongly related to their effective survival time. We found that the shorter the turnover time of an amino acid under pure mutation pressure, the lower its fraction in the proteins coded by the genome and the more protected by selection pressure is its position in proteins. Results of analyses suggest that during evolution the mutational pressure has been optimised to some extent to the selection requirements.","PeriodicalId":87049,"journal":{"name":"Applied bioinformatics","volume":"3 1","pages":"31-9"},"PeriodicalIF":0.0,"publicationDate":"2004-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2165/00822942-200403010-00005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25739565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Genomic conflict settled in favour of the species rather than the gene at extreme GC percentage values. 在极端GC百分比值下，基因组冲突倾向于物种而不是基因。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403040-00003

Shang-Jung Lee, James R Mortimer, Donald R Forsdyke

{"title":"Genomic conflict settled in favour of the species rather than the gene at extreme GC percentage values.","authors":"Shang-Jung Lee, James R Mortimer, Donald R Forsdyke","doi":"10.2165/00822942-200403040-00003","DOIUrl":"https://doi.org/10.2165/00822942-200403040-00003","url":null,"abstract":"Wada and colleagues have shown that, whether prokaryotic or eukaryotic, each gene has a \"homostabilising propensity\" to adopt a relatively uniform GC percentage (GC%). Accordingly, each gene can be viewed as a \"microisochore\" occupying a discrete GC% niche of relatively uniform base composition amongst its fellow genes. Although first, second and third codon positions usually differ in GC%, each position tends to maintain a uniform, gene-specific GC% value. Thus, within a genome, genic GC% values can cover a wide range. This is most evident at third codon positions, which are least constrained by amino acid encoding needs. In 1991, Wada and colleagues further noted that, within a phylogenetic group, genomic GC% values can also cover a wide range. This is again most evident at third codon positions. Thus, the dispersion of GC% values among genes within a genome matches the dispersion of GC% values among genomes within a phylogenetic group. Wada described the context-independence of plots of different codon position GC% values against total GC% as a \"universal\" characteristic. Several studies relate this to recombination. We have confirmed that third codon positions usually relate more to the genes that contain them than to the species. However, in genomes with extreme GC% values (low or high), third codon positions tend to maintain a constant GC%, thus relating more to the species than to the genes that contain them. Genes in an extreme-GC% genome collectively span a smaller GC% range, and mainly rely on first and second codon positions for differentiation as \"microisochores\". Our results are consistent with the view that differences in GC% serve to recombinationally isolate both genome sectors (facilitating gene duplication) and genomes (facilitating genome duplication, e.g. speciation). In intermediate-GC% genomes, conflict between the needs of the species and the needs of individual genes within that species is minimal. However, in extreme-GC% genomes there is a conflict, which is settled in favour of the species (i.e. group selection) rather than in favour of the gene (genic selection).","PeriodicalId":87049,"journal":{"name":"Applied bioinformatics","volume":"3 4","pages":"219-28"},"PeriodicalIF":0.0,"publicationDate":"2004-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2165/00822942-200403040-00003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25118638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Microarray data analysis: a hierarchical T-test to handle heteroscedasticity. 微阵列数据分析:处理异方差的分层t检验。

Applied bioinformatics Pub Date : 2004-01-01

Renée X de Menezes, Judith M Boer, Hans C van Houwelingen

引用次数: 0

MSAT: a multiple sequence alignment tool based on TOPS. MSAT:基于TOPS的多序列比对工具。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403020-00009

Te Ren, Mallika Veeramalai, Aik Choon Tan, David Gilbert

{"title":"MSAT: a multiple sequence alignment tool based on TOPS.","authors":"Te Ren, Mallika Veeramalai, Aik Choon Tan, David Gilbert","doi":"10.2165/00822942-200403020-00009","DOIUrl":"https://doi.org/10.2165/00822942-200403020-00009","url":null,"abstract":"This article describes the development of a new method for multiple sequence alignment based on fold-level protein structure alignments, which provides an improvement in accuracy compared with the most commonly used sequence-only-based techniques. This method integrates the widely used, progressive multiple sequence alignment approach ClustalW with the Topology of Protein Structure (TOPS) topology-based alignment algorithm. The TOPS approach produces a structural alignment for the input protein set by using a topology-based pattern discovery program, providing a set of matched sequence regions that can be used to guide a sequence alignment using ClustalW. The resulting alignments are more reliable than a sequence-only alignment, as determined by 20-fold cross-validation with a set of 106 protein examples from the CATH database, distributed in seven superfold families. The method is particularly effective for sets of proteins that have similar structures at the fold level but low sequence identity. The aim of this research is to contribute towards bridging the gap between protein sequence and structure analysis, in the hope that this can be used to assist the understanding of the relationship between sequence, structure and function. The tool is available at http://balabio.dcs.gla.ac.uk/msat/.","PeriodicalId":87049,"journal":{"name":"Applied bioinformatics","volume":"3 2-3","pages":"149-58"},"PeriodicalIF":0.0,"publicationDate":"2004-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2165/00822942-200403020-00009","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"24941802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Challenges and opportunities for biological language modelling in biomedical high-throughput genomic and proteomic informatics. 生物医学高通量基因组学和蛋白质组学中生物语言建模的挑战和机遇。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403020-00001

James Lyons-Weiler

引用次数: 0

Case study: data management strategies in an integrated pathway tool. 案例研究:集成路径工具中的数据管理策略。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403010-00008

Sabine Bernauer, David Croft, Paul Gardina, Eric Minch, Manuel de Rinaldis, Ivayla Vatcheva

引用次数: 2

A probabilistic method to correlate ion pairs with protein thermostability. 一种将离子对与蛋白质热稳定性联系起来的概率方法。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403010-00004

Shir-Ly Huang, Li-Cheng Wu, Hsien-Da Huang, Han-Kuen Liang, Ming-Tat Ko, Jorng-Tzong Horng

引用次数: 5

caGEDA: a web application for the integrated analysis of global gene expression patterns in cancer. caGEDA:用于综合分析癌症中全球基因表达模式的web应用程序。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403010-00007

Satish Patel, James Lyons-Weiler

{"title":"caGEDA: a web application for the integrated analysis of global gene expression patterns in cancer.","authors":"Satish Patel, James Lyons-Weiler","doi":"10.2165/00822942-200403010-00007","DOIUrl":"https://doi.org/10.2165/00822942-200403010-00007","url":null,"abstract":"The explosion of microarray data from pilot studies, basic research and large-scale clinical trials requires the development of integrative computational tools that can not only analyse gene expression patterns but that can also evaluate the methods of analysis adopted and then provide a boost to post-analysis translational interpretation of those patterns. We have developed a web application called caGEDA (cancer gene expression data analyzer) that can: (1) upload gene expression profiles from cDNA or oligonucleotide microarrays; (2) conduct a diverse range of serial linear normalisations; (3) identify differentially expressed genes using a variety of tests - either threshold or permutation tests; (4) produce tables of literature references to papers reporting that specific genes (identified by accession numbers) are up- or down-regulated in specific cancers; (5) estimate the error of sample class prediction using the significant gene set for features; (6) perform low-bias and accurate validated learning using three computational validation techniques (leave-one out validation, k-fold validation, random re-sampling validation); and (7) validate a classifier with a randomly selected or user-defined validation set. Significant genes are reported in a table of links to entries in the following databases: Locus Link, Genome View, UCSC, Ensembl, UniGene, dbSNP, AmiGO and OMIM. caGEDA is seamlessly integrated via embedded forms with UCSD's (University of California at San Diego) 2HAPI server (for medical subject heading (MeSH) term exploration) and EZ-Retrieve (to identify common transcription factors located upstream of sets of genes that exhibit similar modes of differential expression). caGEDA offers a variety of previously described and novel tests for differentially expressed genes, most notably the permutation percentile separability test, which is most appropriate for identifying genes that are significantly differentially expressed in a subset of patients. caGEDA, which is open source and free to academic users, will soon be greatly enhanced by operating with the components of the National Cancer Institute's new cancer bioinformatics grid (caBIG).","PeriodicalId":87049,"journal":{"name":"Applied bioinformatics","volume":"3 1","pages":"49-62"},"PeriodicalIF":0.0,"publicationDate":"2004-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2165/00822942-200403010-00007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25732257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 89

Neural networks for protein classification. 蛋白质分类的神经网络。

Applied bioinformatics Pub Date : 2004-01-01 DOI: 10.2165/00822942-200403010-00006

Wagner Rodrigo Weinert, Heitor Silvério Lopes

引用次数: 38