{"title":"一种新的基于混合概率的蛋白质和蛋白质修饰识别方法","authors":"Penghao Wang, Susan R. Wilson","doi":"10.1109/CIBCB.2013.6595381","DOIUrl":null,"url":null,"abstract":"Tandem mass spectrometry is a powerful tool for studying proteins and protein post-translational modifications. However, typically less than half of the proteins in a complex sample can be successfully identified. The low identification coverage is largely due to the presence of various protein modifications, which usually lead to incorrect protein identifications by existing methods. Therefore, how to effectively detect protein modifications simultaneously with protein identification is crucial for improving the identification coverage and accuracy. We have developed a new hybrid probability-based protein identification method to address this issue. Our method applies a new two-stage algorithmic framework that incorporates (i) spectra library searching and (ii) a more sophisticated scoring model. In the first stage, fast spectra library searching and simplified database searching are utilised to determine a reduced search space, which in the second stage is comprehensively explored to find the most likely protein and its modifications. Evaluated on large public datasets, our method is shown to identify more proteins and protein modifications than other popular protein identification engines.","PeriodicalId":350407,"journal":{"name":"2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A new hybrid probability-based method for identifying proteins and protein modifications\",\"authors\":\"Penghao Wang, Susan R. Wilson\",\"doi\":\"10.1109/CIBCB.2013.6595381\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Tandem mass spectrometry is a powerful tool for studying proteins and protein post-translational modifications. However, typically less than half of the proteins in a complex sample can be successfully identified. The low identification coverage is largely due to the presence of various protein modifications, which usually lead to incorrect protein identifications by existing methods. Therefore, how to effectively detect protein modifications simultaneously with protein identification is crucial for improving the identification coverage and accuracy. We have developed a new hybrid probability-based protein identification method to address this issue. Our method applies a new two-stage algorithmic framework that incorporates (i) spectra library searching and (ii) a more sophisticated scoring model. In the first stage, fast spectra library searching and simplified database searching are utilised to determine a reduced search space, which in the second stage is comprehensively explored to find the most likely protein and its modifications. Evaluated on large public datasets, our method is shown to identify more proteins and protein modifications than other popular protein identification engines.\",\"PeriodicalId\":350407,\"journal\":{\"name\":\"2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2013.6595381\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2013.6595381","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new hybrid probability-based method for identifying proteins and protein modifications
Tandem mass spectrometry is a powerful tool for studying proteins and protein post-translational modifications. However, typically less than half of the proteins in a complex sample can be successfully identified. The low identification coverage is largely due to the presence of various protein modifications, which usually lead to incorrect protein identifications by existing methods. Therefore, how to effectively detect protein modifications simultaneously with protein identification is crucial for improving the identification coverage and accuracy. We have developed a new hybrid probability-based protein identification method to address this issue. Our method applies a new two-stage algorithmic framework that incorporates (i) spectra library searching and (ii) a more sophisticated scoring model. In the first stage, fast spectra library searching and simplified database searching are utilised to determine a reduced search space, which in the second stage is comprehensively explored to find the most likely protein and its modifications. Evaluated on large public datasets, our method is shown to identify more proteins and protein modifications than other popular protein identification engines.