利用非冗余序列的理化特征改进致敏蛋白预测

Sher Singh, Jr-Rou Chiu, Kuei-Ling Sun, E. C. Su
{"title":"利用非冗余序列的理化特征改进致敏蛋白预测","authors":"Sher Singh, Jr-Rou Chiu, Kuei-Ling Sun, E. C. Su","doi":"10.1109/ICMLC48188.2019.8949197","DOIUrl":null,"url":null,"abstract":"Despite extensive studies in allergen prediction, current approaches still have room for performance improvement and suffer from the problem of lack of interpretable biological features. Thus, developments of allergen prediction method from sequences have become highly important to facilitate in silico vaccine design. In this study, we propose a systematic approach to predict allergenic proteins by incorporating sequence and physicochemical properties in machine learning algorithms. In addition, predictive performance of previous studies could be overestimated due to high redundancy in the data sets. Therefore, we reduce sequence redundancy in the data set and experiment results show that we achieve better predictive performance when compared with other approaches. This study can help discover new prophylactic and therapeutic vaccines for diseases. Moreover, we analyze immunological features that can provide valuable insights into immunotherapies of allergy and autoimmune diseases in translational bioinformatics.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving Allergenic Protein Prediction Using Physicochemical Features on Non-Redundant Sequences\",\"authors\":\"Sher Singh, Jr-Rou Chiu, Kuei-Ling Sun, E. C. Su\",\"doi\":\"10.1109/ICMLC48188.2019.8949197\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite extensive studies in allergen prediction, current approaches still have room for performance improvement and suffer from the problem of lack of interpretable biological features. Thus, developments of allergen prediction method from sequences have become highly important to facilitate in silico vaccine design. In this study, we propose a systematic approach to predict allergenic proteins by incorporating sequence and physicochemical properties in machine learning algorithms. In addition, predictive performance of previous studies could be overestimated due to high redundancy in the data sets. Therefore, we reduce sequence redundancy in the data set and experiment results show that we achieve better predictive performance when compared with other approaches. This study can help discover new prophylactic and therapeutic vaccines for diseases. Moreover, we analyze immunological features that can provide valuable insights into immunotherapies of allergy and autoimmune diseases in translational bioinformatics.\",\"PeriodicalId\":221349,\"journal\":{\"name\":\"2019 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC48188.2019.8949197\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC48188.2019.8949197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

尽管在过敏原预测方面进行了广泛的研究,但目前的方法仍然存在性能改进的空间,并且存在缺乏可解释的生物学特征的问题。因此,基于序列的过敏原预测方法的发展对于促进硅疫苗的设计变得非常重要。在这项研究中,我们提出了一种系统的方法,通过在机器学习算法中结合序列和物理化学性质来预测致敏蛋白。此外,由于数据集的高冗余,以往研究的预测性能可能被高估。因此,我们减少了数据集中的序列冗余,实验结果表明,与其他方法相比,我们取得了更好的预测性能。这项研究有助于发现新的预防和治疗疾病的疫苗。此外,我们分析了免疫学特征,可以为转化生物信息学中过敏和自身免疫性疾病的免疫治疗提供有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improving Allergenic Protein Prediction Using Physicochemical Features on Non-Redundant Sequences
Despite extensive studies in allergen prediction, current approaches still have room for performance improvement and suffer from the problem of lack of interpretable biological features. Thus, developments of allergen prediction method from sequences have become highly important to facilitate in silico vaccine design. In this study, we propose a systematic approach to predict allergenic proteins by incorporating sequence and physicochemical properties in machine learning algorithms. In addition, predictive performance of previous studies could be overestimated due to high redundancy in the data sets. Therefore, we reduce sequence redundancy in the data set and experiment results show that we achieve better predictive performance when compared with other approaches. This study can help discover new prophylactic and therapeutic vaccines for diseases. Moreover, we analyze immunological features that can provide valuable insights into immunotherapies of allergy and autoimmune diseases in translational bioinformatics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信