基因家族识别网络设计

C. Wu, S. Shivakumar
{"title":"基因家族识别网络设计","authors":"C. Wu, S. Shivakumar","doi":"10.1109/IJSIS.1998.685426","DOIUrl":null,"url":null,"abstract":"The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis.","PeriodicalId":289764,"journal":{"name":"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Gene family identification network design\",\"authors\":\"C. Wu, S. Shivakumar\",\"doi\":\"10.1109/IJSIS.1998.685426\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis.\",\"PeriodicalId\":289764,\"journal\":{\"name\":\"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IJSIS.1998.685426\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJSIS.1998.685426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

分子数据的指数积累将有助于利用嵌入在同源序列家族中的信息发现新知识。作为一种管理和分析序列数据的方法,我们开发了一个集成的系统,称为GeneFIND(基因家族识别网络设计),用于基因家族的数据库搜索。它通过结合全局和基序序列相似性以及结合ProClass家族信息,提供快速准确的蛋白质家族鉴定。采用多级过滤,从MOTIFIND神经网络和BLAST搜索开始,然后是SSEARCH对齐基序模式匹配,基序的隐马尔可夫建模和ClustalW基序对齐。GeneFIND已经作为一个全面的系统实施,用于分类超过1000个ProSite和3000个PIR家族。它被用来识别成千上万的新家庭成员,非常适合基因组序列分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Gene family identification network design
The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信