CIAA: Integrated Proteomics and Structural Modeling for Understanding Cysteine Reactivity with Iodoacetamide Alkyne.

IF 3.8 2区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
ACS Chemical Biology Pub Date : 2025-07-18 Epub Date: 2025-06-29 DOI:10.1021/acschembio.5c00225
Lisa M Boatner, Jerome Eberhardt, Flowreen Shikwana, Matthew Holcomb, Peiyuan Lee, Kendall N Houk, Stefano Forli, Keriann M Backus
{"title":"CIAA: Integrated Proteomics and Structural Modeling for Understanding Cysteine Reactivity with Iodoacetamide Alkyne.","authors":"Lisa M Boatner, Jerome Eberhardt, Flowreen Shikwana, Matthew Holcomb, Peiyuan Lee, Kendall N Houk, Stefano Forli, Keriann M Backus","doi":"10.1021/acschembio.5c00225","DOIUrl":null,"url":null,"abstract":"<p><p>Cysteine residues play key roles in protein structure and function and can serve as targets for chemical probes and even drugs. Chemoproteomic studies have revealed that heightened cysteine reactivity toward electrophilic probes, such as iodoacetamide alkyne (IAA), is indicative of likely residue functionality. However, while the cysteine coverage of chemoproteomic studies has increased substantially, these methods still provide only a partial assessment of proteome-wide cysteine reactivity, with cysteines from low-abundance proteins and tough-to-detect peptides still largely refractory to chemoproteomic analysis. Here, we integrate cysteine chemoproteomic reactivity data sets with structure-guided computational analysis to delineate key structural features of proteins that favor elevated cysteine reactivity toward IAA. We first generated and aggregated multiple descriptors of cysteine microenvironment, including amino acid content, solvent accessibility, residue proximity, secondary structure, and predicted p<i>K</i><sub>a</sub>. We find that no single feature is sufficient to accurately predict the reactivity. Therefore, we developed the CIAA (Cysteine reactivity toward IodoAcetamide Alkyne) method, which utilizes a Random Forest model to assess cysteine reactivity by incorporating descriptors that characterize the three-dimensional (3D) structural properties of thiol microenvironments. We trained the CIAA model on existing and newly generated cysteine chemoproteomic reactivity data paired with high-resolution crystal structures from the Protein Data Bank (PDB), with cross-validation against an external data set. CIAA analysis reveals key features driving cysteine reactivity, such as backbone hydrogen bond donor atoms, and reveals still underserved needs in the area of computational predictions of cysteine reactivity, including challenges surrounding protein structure selection data set curation. Thus, our work provides a strong foundation for deploying artificial intelligence (AI) on cysteine chemoproteomic data sets.</p>","PeriodicalId":11,"journal":{"name":"ACS Chemical Biology","volume":" ","pages":"1669-1682"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Chemical Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1021/acschembio.5c00225","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/29 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Cysteine residues play key roles in protein structure and function and can serve as targets for chemical probes and even drugs. Chemoproteomic studies have revealed that heightened cysteine reactivity toward electrophilic probes, such as iodoacetamide alkyne (IAA), is indicative of likely residue functionality. However, while the cysteine coverage of chemoproteomic studies has increased substantially, these methods still provide only a partial assessment of proteome-wide cysteine reactivity, with cysteines from low-abundance proteins and tough-to-detect peptides still largely refractory to chemoproteomic analysis. Here, we integrate cysteine chemoproteomic reactivity data sets with structure-guided computational analysis to delineate key structural features of proteins that favor elevated cysteine reactivity toward IAA. We first generated and aggregated multiple descriptors of cysteine microenvironment, including amino acid content, solvent accessibility, residue proximity, secondary structure, and predicted pKa. We find that no single feature is sufficient to accurately predict the reactivity. Therefore, we developed the CIAA (Cysteine reactivity toward IodoAcetamide Alkyne) method, which utilizes a Random Forest model to assess cysteine reactivity by incorporating descriptors that characterize the three-dimensional (3D) structural properties of thiol microenvironments. We trained the CIAA model on existing and newly generated cysteine chemoproteomic reactivity data paired with high-resolution crystal structures from the Protein Data Bank (PDB), with cross-validation against an external data set. CIAA analysis reveals key features driving cysteine reactivity, such as backbone hydrogen bond donor atoms, and reveals still underserved needs in the area of computational predictions of cysteine reactivity, including challenges surrounding protein structure selection data set curation. Thus, our work provides a strong foundation for deploying artificial intelligence (AI) on cysteine chemoproteomic data sets.

CIAA:了解半胱氨酸与碘乙酰胺炔反应性的集成蛋白质组学和结构建模。
半胱氨酸残基在蛋白质结构和功能中起着关键作用,可以作为化学探针甚至药物的靶标。化学蛋白质组学研究表明,半胱氨酸对亲电探针(如碘乙酰胺炔(IAA))的反应性增强表明可能存在残基功能。然而,尽管半胱氨酸在化学蛋白质组学研究中的覆盖范围已经大大增加,但这些方法仍然只能对全蛋白质组的半胱氨酸反应性进行部分评估,来自低丰度蛋白质和难以检测的肽段的半胱氨酸在很大程度上仍然难以进行化学蛋白质组学分析。在这里,我们将半胱氨酸化学蛋白质组学反应性数据集与结构指导的计算分析相结合,以描绘有利于提高半胱氨酸对IAA反应性的蛋白质的关键结构特征。我们首先生成并聚合了半胱氨酸微环境的多个描述符,包括氨基酸含量、溶剂可及性、残基接近性、二级结构和预测pKa。我们发现没有单一的特征足以准确地预测反应性。因此,我们开发了CIAA(半胱氨酸对碘乙酰胺炔的反应性)方法,该方法利用随机森林模型通过结合表征硫醇微环境三维(3D)结构特性的描述子来评估半胱氨酸的反应性。我们将现有的和新生成的半胱氨酸化学蛋白质组学反应性数据与来自蛋白质数据库(PDB)的高分辨率晶体结构配对,对CIAA模型进行训练,并对外部数据集进行交叉验证。CIAA分析揭示了驱动半胱氨酸反应性的关键特征,如主氢键供体原子,并揭示了半胱氨酸反应性计算预测领域仍未满足的需求,包括围绕蛋白质结构选择数据集管理的挑战。因此,我们的工作为在半胱氨酸化学蛋白质组学数据集上部署人工智能(AI)提供了坚实的基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Chemical Biology
ACS Chemical Biology 生物-生化与分子生物学
CiteScore
7.50
自引率
5.00%
发文量
353
审稿时长
3.3 months
期刊介绍: ACS Chemical Biology provides an international forum for the rapid communication of research that broadly embraces the interface between chemistry and biology. The journal also serves as a forum to facilitate the communication between biologists and chemists that will translate into new research opportunities and discoveries. Results will be published in which molecular reasoning has been used to probe questions through in vitro investigations, cell biological methods, or organismic studies. We welcome mechanistic studies on proteins, nucleic acids, sugars, lipids, and nonbiological polymers. The journal serves a large scientific community, exploring cellular function from both chemical and biological perspectives. It is understood that submitted work is based upon original results and has not been published previously.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信