199: Blossom AI:一个基于随机森林算法预测多重蛋白相互作用复合物热点的新型药物发现应用程序

Stephanie Zhang, Minsoo Kang
{"title":"199: Blossom AI:一个基于随机森林算法预测多重蛋白相互作用复合物热点的新型药物发现应用程序","authors":"Stephanie Zhang, Minsoo Kang","doi":"10.1158/1538-7445.AM2021-199","DOIUrl":null,"url":null,"abstract":"Protein protein interactions (PPIs) form the backbone of signal transduction pathways in diverse physiological processes, mediating the transmission and regulation of oncogenic signals essential to cellular proliferation and survival, thus representing a potential new class of drug targets for anticancer therapeutic discovery. However, several challenges face the targeting of PPIs, including large PPI interface areas, a lack of deep pockets, the presence of noncontiguous binding sites, and a general lack of natural ligands. The presence of hot spots (small subsets of amino acid residues that contribute significantly to free binding energy) makes PPIs amenable to small molecule perturbations, playing essential roles in the stability of protein binding. Effectively identifying which specific interface residues of protein protein complexes form the hot spots is critical for understanding the principles of protein interactions and has broad application prospects in protein design and drug development. This project presents Blossom AI, a novel, user friendly mobile app developed in XCode and CoreML that uses random forest decision tree algorithms (RF) to computationally predict the presence of hotspots on protein complexes within seconds, aiding the design of small molecule and peptide drugs that target protein-protein interactions, particularly for anticancer therapy. Leveraging features such as solvent accessible surface area (ASA), blocks substitution matrix, physicochemical properties (hydrophobicity, polarity, polarizability, propensities), position specific scoring matrix (PSSM) and solvent exposure, the RF is trained through a dataset of 313 mutated interface residues (133 hotspot residues and 180 non hotspot residues) from over 60 protein complexes to produce a training accuracy of 88.75%, validation accuracy of 92.86%, specificity of 87.18%, sensitivity of 75.38%, PPV 94.23%, NPV 86.61%. Blossom is high speed, low cost, and user friendly with significantly improved accuracy over the standard of alanine scanning mutagenesis. Citation Format: Stephanie Zhang, Minsoo Kang. Blossom AI: A novel drug discovery app for the prediction of hotspots on multiplex protein protein interaction complexes using random forest algorithms [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 199.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"37 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Abstract 199: Blossom AI: A novel drug discovery app for the prediction of hotspots on multiplex protein protein interaction complexes using random forest algorithms\",\"authors\":\"Stephanie Zhang, Minsoo Kang\",\"doi\":\"10.1158/1538-7445.AM2021-199\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Protein protein interactions (PPIs) form the backbone of signal transduction pathways in diverse physiological processes, mediating the transmission and regulation of oncogenic signals essential to cellular proliferation and survival, thus representing a potential new class of drug targets for anticancer therapeutic discovery. However, several challenges face the targeting of PPIs, including large PPI interface areas, a lack of deep pockets, the presence of noncontiguous binding sites, and a general lack of natural ligands. The presence of hot spots (small subsets of amino acid residues that contribute significantly to free binding energy) makes PPIs amenable to small molecule perturbations, playing essential roles in the stability of protein binding. Effectively identifying which specific interface residues of protein protein complexes form the hot spots is critical for understanding the principles of protein interactions and has broad application prospects in protein design and drug development. This project presents Blossom AI, a novel, user friendly mobile app developed in XCode and CoreML that uses random forest decision tree algorithms (RF) to computationally predict the presence of hotspots on protein complexes within seconds, aiding the design of small molecule and peptide drugs that target protein-protein interactions, particularly for anticancer therapy. Leveraging features such as solvent accessible surface area (ASA), blocks substitution matrix, physicochemical properties (hydrophobicity, polarity, polarizability, propensities), position specific scoring matrix (PSSM) and solvent exposure, the RF is trained through a dataset of 313 mutated interface residues (133 hotspot residues and 180 non hotspot residues) from over 60 protein complexes to produce a training accuracy of 88.75%, validation accuracy of 92.86%, specificity of 87.18%, sensitivity of 75.38%, PPV 94.23%, NPV 86.61%. Blossom is high speed, low cost, and user friendly with significantly improved accuracy over the standard of alanine scanning mutagenesis. Citation Format: Stephanie Zhang, Minsoo Kang. Blossom AI: A novel drug discovery app for the prediction of hotspots on multiplex protein protein interaction complexes using random forest algorithms [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 199.\",\"PeriodicalId\":73617,\"journal\":{\"name\":\"Journal of bioinformatics and systems biology : Open access\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of bioinformatics and systems biology : Open access\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1158/1538-7445.AM2021-199\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of bioinformatics and systems biology : Open access","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1158/1538-7445.AM2021-199","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

蛋白-蛋白相互作用(PPIs)是多种生理过程中信号转导通路的主干,介导细胞增殖和存活所必需的致癌信号的传递和调控,因此代表了抗癌治疗发现的潜在新一类药物靶点。然而,靶向PPI面临着一些挑战,包括大的PPI界面区域,缺乏深度的资金,存在不连续的结合位点,以及普遍缺乏天然配体。热点(对自由结合能有显著贡献的一小部分氨基酸残基)的存在使PPIs能够适应小分子扰动,在蛋白质结合的稳定性中起着至关重要的作用。有效识别蛋白质复合物中哪些特定的界面残基形成热点是理解蛋白质相互作用原理的关键,在蛋白质设计和药物开发中具有广阔的应用前景。该项目介绍了Blossom AI,这是一个用XCode和CoreML开发的新颖的用户友好的移动应用程序,它使用随机森林决策树算法(RF)在几秒钟内计算预测蛋白质复合物上热点的存在,帮助设计针对蛋白质-蛋白质相互作用的小分子和肽药物,特别是用于抗癌治疗。利用溶剂可达表面积(ASA)、块取代矩阵、物理化学性质(疏水性、极性、极化性、倾向)、位置特异性分数矩阵(PSSM)和溶剂暴露等特征,RF通过来自60多个蛋白质复合物的313个突变界面残基(133个热点残基和180个非热点残基)数据集进行训练,训练准确率为88.75%,验证准确率为92.86%,特异性为87.18%。灵敏度75.38%,PPV 94.23%, NPV 86.61%。Blossom是高速度,低成本,和用户友好与显著提高准确性比标准的丙氨酸扫描诱变。引文格式:Stephanie Zhang, Minsoo Kang。Blossom AI:一款新型药物发现应用程序,用于使用随机森林算法预测多重蛋白质蛋白质相互作用复合物的热点[摘要]。见:美国癌症研究协会2021年年会论文集;2021年4月10日至15日和5月17日至21日。费城(PA): AACR;癌症杂志,2021;81(13 -增刊):摘要第199期。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Abstract 199: Blossom AI: A novel drug discovery app for the prediction of hotspots on multiplex protein protein interaction complexes using random forest algorithms
Protein protein interactions (PPIs) form the backbone of signal transduction pathways in diverse physiological processes, mediating the transmission and regulation of oncogenic signals essential to cellular proliferation and survival, thus representing a potential new class of drug targets for anticancer therapeutic discovery. However, several challenges face the targeting of PPIs, including large PPI interface areas, a lack of deep pockets, the presence of noncontiguous binding sites, and a general lack of natural ligands. The presence of hot spots (small subsets of amino acid residues that contribute significantly to free binding energy) makes PPIs amenable to small molecule perturbations, playing essential roles in the stability of protein binding. Effectively identifying which specific interface residues of protein protein complexes form the hot spots is critical for understanding the principles of protein interactions and has broad application prospects in protein design and drug development. This project presents Blossom AI, a novel, user friendly mobile app developed in XCode and CoreML that uses random forest decision tree algorithms (RF) to computationally predict the presence of hotspots on protein complexes within seconds, aiding the design of small molecule and peptide drugs that target protein-protein interactions, particularly for anticancer therapy. Leveraging features such as solvent accessible surface area (ASA), blocks substitution matrix, physicochemical properties (hydrophobicity, polarity, polarizability, propensities), position specific scoring matrix (PSSM) and solvent exposure, the RF is trained through a dataset of 313 mutated interface residues (133 hotspot residues and 180 non hotspot residues) from over 60 protein complexes to produce a training accuracy of 88.75%, validation accuracy of 92.86%, specificity of 87.18%, sensitivity of 75.38%, PPV 94.23%, NPV 86.61%. Blossom is high speed, low cost, and user friendly with significantly improved accuracy over the standard of alanine scanning mutagenesis. Citation Format: Stephanie Zhang, Minsoo Kang. Blossom AI: A novel drug discovery app for the prediction of hotspots on multiplex protein protein interaction complexes using random forest algorithms [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 199.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信