C. Congdon, Charles Fizer, N. W. Smith, H. Gaskins, Joseph C. Aman, G. Nava, C. Mattingly
{"title":"Preliminary Results for GAMI: A Genetic Algorithms Approach to Motif Inference","authors":"C. Congdon, Charles Fizer, N. W. Smith, H. Gaskins, Joseph C. Aman, G. Nava, C. Mattingly","doi":"10.1109/CIBCB.2005.1594904","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594904","url":null,"abstract":"We have developed GAMI, an approach to motif inference that uses a genetic algorithms search and is designed specifically to work with divergent species and possibly long nucleotide sequences. The system design reduces the size of the search space as compared to typical window-location approaches for motif inference. This paper describes the motivation and system design for GAMI, discusses how we have designed the search space and compares this to the search space of other approaches, and presents initial results with data from the literature and from novel tasks. GAMI is able to find a host of putative conserved patterns; possible approaches for validating the utility of the conserved regions are discussed.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131115295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
H. Iyatomi, H. Oka, M. Hashimoto, Masaru Tanaka, K. Ogawa
{"title":"An Internet-based Melanoma Diagnostic System - Toward the Practical Application -","authors":"H. Iyatomi, H. Oka, M. Hashimoto, Masaru Tanaka, K. Ogawa","doi":"10.1109/CIBCB.2005.1594952","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594952","url":null,"abstract":"In this paper, we report a practical application of world’s first internet-based melanoma diagnostic system. The system is now available from all over the world, 24 hours 365 days. As key components of this system, we developed a new dermatologist-like tumor area extraction algorithm and an artificial neural network (ANN) classifier. Our dermatologist-like tumor area extraction algorithm achieved superior extraction performance and the ANN classifier achieved classification accuracy of 97.3% in sensitivity and 86.1% in specificity with leave-one-out cross-validation test of 319 dermoscopy images. Our system supported SSL encrypted transaction and required only several seconds to complete a procedure. On the other hand, we developed portable skin camera as the alternative of dermoscopy and started field-application tests by distributing them for hospitals or medical universities at first setout for making the system into practical use.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115481464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Oh, Jean X. Gao, A. Nandi, Prem Gurnani, Lynne Knowles, J. Schorge, K. Rosenblatt
{"title":"Multicategory Classification using Extended SVM-RFE and Markov Blanket on SELDI-TOF Mass Spectrometry Data","authors":"J. Oh, Jean X. Gao, A. Nandi, Prem Gurnani, Lynne Knowles, J. Schorge, K. Rosenblatt","doi":"10.1109/CIBCB.2005.1594938","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594938","url":null,"abstract":"Surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) mass spectrometry data has been increasingly analyzed for identifying biomarkers for disease to help early detection of the disease. Recently, support vector machine (SVM) algorithm based on recursive feature elimination (RFE) was proposed to find a set of genes for cancer classification. In our study, we extend the SVM-RFE such that it can be used in the multicategory classification work using SELDI-TOF mass spectrometry data and propose a new feature selection algorithm (SVM-MB/RFE : SVM-Markov Blanket/Recursive Feature Elimination). In the preprocessing task of SVM-MB/RFE, ANOVA (Analysis of Variance) and binning methods are used for feature filtering. We demonstrate that the performance is improved through the preprocessing work. Compared with other methods such as not only SVM-RFE and Markov blanket but also PCA (Principle Components Analysis)+LDA (Linear Discriminant Analysis) and other feature selection algorithms, SVM-MB/RFE performs better than them.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124141208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Improving Protein Secondary-Structure Prediction by Predicting Ends of Secondary-Structure Segments","authors":"U. Midic, Dunker Ak, Z. Obradovic","doi":"10.1109/CIBCB.2005.1594959","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594959","url":null,"abstract":"Motivated by known preferences for certain amino acids in positions around a-helices, we developed neural network-based predictors of both N and C a-helix ends, which achieved about 88% accuracy. We applied a similar approach for predicting the ends of three types of secondary structure segments. The predictors for the ends of H, E and C segments were then used to create input for protein secondary-structure prediction. By incorporating this new type of input, we significantly improved the basic one-stage predictor of protein secondary structure in terms of both per-residue (Q3) accuracy (+0.8%) and segment overlap (SOV3) measure (+1.4).","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127078004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sung-Kyu Kim, Jin-Wu Nam, Wha-Jin Lee, Byoung-Tak Zhang
{"title":"A Kernel Method for MicroRNA Target Prediction Using Sensible Data and Position-Based Features","authors":"Sung-Kyu Kim, Jin-Wu Nam, Wha-Jin Lee, Byoung-Tak Zhang","doi":"10.1109/CIBCB.2005.1594897","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594897","url":null,"abstract":"MicroRNAs (miRNAs) are small endogenous RNAs of ~ 22nt that act as direct post-transcriptional regulators in animals and plants. MicroRNAs generally perform a function by binding to the complementary site on the 3’ untranslated region of its target gene and especially the 8mers on the 5’ part of miRNA seems important as a seed. Computational methods for miRNA target prediction have been focusing on this seed region, but recent researches revealed that the specificity of the seed region may be sharply decreased even by a point mutation. In this paper, we present a kernel method for miRNA target prediction in animals, which improves the prediction performance with biologically sensible data and position-based features reflecting the way of miRNA: mRNA pairing mechanism. In building a training dataset, we choose experimentally verified data only to improve the quality of dataset by excluding randomly synthesized one and consequently to make the result of learning valid. We use sensitivity, specificity, and area under ROC curve as performance measures of our algorithm and compare the results of various dataset configurations. The overall results were 92.1% in sensitivity, 83.3% in specificity, and 0.931 in area under ROC curve. With position-based features, an increase of 3.3% in sensitivity and 1.6% in specificity were observed.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134211227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Leif E. Peterson, M. Ozen, Halime Erdem, Andrew Amini, L. Gomez, C. Nelson, M. Ittmann
{"title":"Artificial Neural Network Analysis of DNA Microarray-based Prostate Cancer Recurrence","authors":"Leif E. Peterson, M. Ozen, Halime Erdem, Andrew Amini, L. Gomez, C. Nelson, M. Ittmann","doi":"10.1109/CIBCB.2005.1594929","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594929","url":null,"abstract":"DNA microarray-based gene expression profiles have been established for a variety of adult cancers. This paper addresses application of an artificial neural network (ANN) with leave-one-out testsing and 8-fold cross-validation for analyzing DNA microarray data to identify genes predictive of recurrence after prostatectomy. Among 725 genes screened for ANN input, a 16-gene model resulted in 99-100% diagnostic sensitivity and specificity: DGCR5, FLJ10618, RIS1, PRO1855, ABCB9, AK057203, GOLGA5, HARS, AK024152, HEP27, PPIA, SNRPF, SULT1A3, SECTM1, EIF4EBP1, and S71435. Genes identified with ANN that are prognostic of prostate cancer recurrence may be either causal for prostate cancer or secondary to the disease. Nevertheless, the genes identified may be confirmed in the future to be markers of early detection and/or therapy.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132602776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Network Motifs, Feedback Loops and the Dynamics of Genetic Regulatory Networks","authors":"J. Hallinan, P. Jackway","doi":"10.1109/CIBCB.2005.1594903","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594903","url":null,"abstract":"We analyse a suite of Boolean networks which have been evolved to exhibit limit cycle-type dynamics in terms of the distribution of small network motifs and feedback loops. We find that asynchronously updated Boolean networks can be evolved to exhibit fuzzy limit cycle dynamics without significant changes to the number of nodes and links in the network. Analysis of all possible triads of nodes in the networks and all feedback loops of length one to eight reveal no significant differences between the evolved and unevolved networks. We conclude that the reductionist, motif-based approach to network analysis may be inadequate to full understanding of network dynamics, and that some dynamic behaviour is an emergent property of complex networks as a whole.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123784658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Predicting Structural Disruption of Proteins Caused by Crossover","authors":"Denis C. Bauer, M. Bodén, R. Thier, Zheng Yuan","doi":"10.1109/CIBCB.2005.1594962","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594962","url":null,"abstract":"We present a machine learning model that predicts a structural disruption score from a protein’s primary structure. SCHEMA was introduced by Frances Arnold and colleagues as a method for determining putative recombination sites of a protein on the basis of the full (PDB) description of its structure. The present method provides an alternative to SCHEMA that is able to determine the same score from sequence data only. Circumventing the need for resolving the full structure enables the exploration of yet unresolved and even hypothetical sequences for protein design efforts. Deriving the SCHEMA score from a primary structure is achieved using a two step approach: first predicting a secondary structure from the sequence and then predicting the SCHEMA score from the predicted secondary structure. The correlation coefficient for the prediction is 0.88 and indicates the feasibility of replacing SCHEMA with little loss of precision.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"21 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121009726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Truncated Profile Hidden Markov Models","authors":"Scott F. Smith","doi":"10.1109/CIBCB.2005.1594926","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594926","url":null,"abstract":"The profile hidden Markov model (HMM) is a powerful method for remote homolog database search. However, evaluating the score of each database sequence against a profile HMM is computationally demanding. The computation time required for score evaluation is proportional to the number of states in the profile HMM. This paper examines whether the number of states can be truncated without reducing the ability of the HMM to find proteins containing members of a protein domain family. A genetic algorithm (GA) is presented which finds a good truncation of the HMM states. The results of using truncation on searches of the yeast, E. coli, and pig genomes for several different protein domain families is shown.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125866253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"System Identification and Nonlinear Factor Analysis for Discovery and Visualization of Dynamic Gene Regulatory Pathways","authors":"A. Darvish, K. Najarian, D. Jeong, W. Ribarsky","doi":"10.1109/CIBCB.2005.1594901","DOIUrl":"https://doi.org/10.1109/CIBCB.2005.1594901","url":null,"abstract":"DNA microarray time-series provide the information vital to estimate the dynamic regulatory pathways and therefore predict the dynamic interaction among genes in time. While dynamic system identification theory has been applied to many fields of study, due to some practical limitations, this theory has been widely used to analyze DNA microarray time series. In this paper, we describe some of these limitations and propose a hierarchical model utilizing nonlinear factor analysis methods to analyze time-series DNA microarray data and identify the dynamic regulatory pathways. The proposed model is applied to model the eukaryotic cell cycle process using a popular dataset of cell cycle time-series. The results indicate that the proposed method can successfully predict the dynamic pathway involved in the process.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123418942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}