{"title":"基因家族识别网络设计","authors":"C. Wu, S. Shivakumar","doi":"10.1109/IJSIS.1998.685426","DOIUrl":null,"url":null,"abstract":"The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis.","PeriodicalId":289764,"journal":{"name":"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Gene family identification network design\",\"authors\":\"C. Wu, S. Shivakumar\",\"doi\":\"10.1109/IJSIS.1998.685426\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis.\",\"PeriodicalId\":289764,\"journal\":{\"name\":\"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IJSIS.1998.685426\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJSIS.1998.685426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis.