基于网络的异常检测算法揭示了在人体组织中起重要作用的蛋白质。

IF 11.8 2区 生物学 Q1 MULTIDISCIPLINARY SCIENCES
Dima Kagan, Juman Jubran, Esti Yeger-Lotem, Michael Fire
{"title":"基于网络的异常检测算法揭示了在人体组织中起重要作用的蛋白质。","authors":"Dima Kagan, Juman Jubran, Esti Yeger-Lotem, Michael Fire","doi":"10.1093/gigascience/giaf034","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Proteins act through physical interactions with other molecules to maintain organismal health. Protein-protein interaction (PPI) networks have proved to be a powerful framework for obtaining insight into protein functions, cellular organization, response to signals, and disease states. In multicellular organisms, protein content varies between tissues, influencing tissue morphology and function. Weighted PPI networks, reflecting the likelihood of interactions in specific tissues, offer insights into tissue-specific processes and disease mechanisms. We hypothesized that detecting anomalous nodes in these networks could reveal proteins with key tissue-specific functions.</p><p><strong>Results: </strong>Here, we introduce Weighted Graph Anomalous Node Detection (WGAND), a novel machine-learning algorithm to identify anomalous nodes in weighted graphs. WGAND estimates expected edge weights and uses deviations to generate anomaly detection features, which are then used to score network nodes. We applied WGAND to weighted PPI networks of 17 human tissues. High-ranking anomalous nodes were enriched for proteins associated with tissue-specific diseases and tissue-specific biological processes, such as neuron signaling in the brain and spermatogenesis in the testis. WGAND outperformed other methods in terms of area under the ROC curve and precision at K, highlighting its effectiveness in uncovering biologically meaningful anomalies.</p><p><strong>Conclusions: </strong>Our findings demonstrate WGAND's potential as a powerful tool for detecting anomalous proteins with significant biological roles. By identifying proteins involved in critical tissue-specific processes and diseases, WGAND offers valuable insights for discovering novel biomarkers and therapeutic targets. Its versatile algorithm is suitable for any weighted graph and is broadly applicable across various fields. The WGAND algorithm is available as an open-source Python library at https://github.com/data4goodlab/wgand.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":"14 ","pages":""},"PeriodicalIF":11.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11976396/pdf/","citationCount":"0","resultStr":"{\"title\":\"Network-based anomaly detection algorithm reveals proteins with major roles in human tissues.\",\"authors\":\"Dima Kagan, Juman Jubran, Esti Yeger-Lotem, Michael Fire\",\"doi\":\"10.1093/gigascience/giaf034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Proteins act through physical interactions with other molecules to maintain organismal health. Protein-protein interaction (PPI) networks have proved to be a powerful framework for obtaining insight into protein functions, cellular organization, response to signals, and disease states. In multicellular organisms, protein content varies between tissues, influencing tissue morphology and function. Weighted PPI networks, reflecting the likelihood of interactions in specific tissues, offer insights into tissue-specific processes and disease mechanisms. We hypothesized that detecting anomalous nodes in these networks could reveal proteins with key tissue-specific functions.</p><p><strong>Results: </strong>Here, we introduce Weighted Graph Anomalous Node Detection (WGAND), a novel machine-learning algorithm to identify anomalous nodes in weighted graphs. WGAND estimates expected edge weights and uses deviations to generate anomaly detection features, which are then used to score network nodes. We applied WGAND to weighted PPI networks of 17 human tissues. High-ranking anomalous nodes were enriched for proteins associated with tissue-specific diseases and tissue-specific biological processes, such as neuron signaling in the brain and spermatogenesis in the testis. WGAND outperformed other methods in terms of area under the ROC curve and precision at K, highlighting its effectiveness in uncovering biologically meaningful anomalies.</p><p><strong>Conclusions: </strong>Our findings demonstrate WGAND's potential as a powerful tool for detecting anomalous proteins with significant biological roles. By identifying proteins involved in critical tissue-specific processes and diseases, WGAND offers valuable insights for discovering novel biomarkers and therapeutic targets. Its versatile algorithm is suitable for any weighted graph and is broadly applicable across various fields. The WGAND algorithm is available as an open-source Python library at https://github.com/data4goodlab/wgand.</p>\",\"PeriodicalId\":12581,\"journal\":{\"name\":\"GigaScience\",\"volume\":\"14 \",\"pages\":\"\"},\"PeriodicalIF\":11.8000,\"publicationDate\":\"2025-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11976396/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GigaScience\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/gigascience/giaf034\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaScience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gigascience/giaf034","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

背景:蛋白质通过与其他分子的物理相互作用来维持机体健康。蛋白质-蛋白质相互作用(PPI)网络已被证明是深入了解蛋白质功能、细胞组织、对信号的反应和疾病状态的强大框架。在多细胞生物中,蛋白质含量因组织而异,影响组织形态和功能。加权PPI网络反映了特定组织中相互作用的可能性,为组织特异性过程和疾病机制提供了见解。我们假设检测这些网络中的异常节点可以揭示具有关键组织特异性功能的蛋白质。结果:在这里,我们引入了加权图异常节点检测(WGAND),这是一种新的机器学习算法,用于识别加权图中的异常节点。WGAND估计预期的边缘权重,并使用偏差生成异常检测特征,然后使用这些特征对网络节点进行评分。我们将WGAND应用于17个人体组织的加权PPI网络。高级别异常淋巴结富含与组织特异性疾病和组织特异性生物过程相关的蛋白质,如大脑中的神经元信号传导和睾丸中的精子发生。WGAND在ROC曲线下的面积和K点的精度方面优于其他方法,突出了其在发现具有生物学意义的异常方面的有效性。结论:我们的发现证明了WGAND作为检测具有重要生物学作用的异常蛋白的强大工具的潜力。通过识别参与关键组织特异性过程和疾病的蛋白质,WGAND为发现新的生物标志物和治疗靶点提供了有价值的见解。它的通用算法适用于任何加权图,广泛适用于各个领域。WGAND算法可以在https://github.com/data4goodlab/wgand上作为开源Python库获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Network-based anomaly detection algorithm reveals proteins with major roles in human tissues.

Background: Proteins act through physical interactions with other molecules to maintain organismal health. Protein-protein interaction (PPI) networks have proved to be a powerful framework for obtaining insight into protein functions, cellular organization, response to signals, and disease states. In multicellular organisms, protein content varies between tissues, influencing tissue morphology and function. Weighted PPI networks, reflecting the likelihood of interactions in specific tissues, offer insights into tissue-specific processes and disease mechanisms. We hypothesized that detecting anomalous nodes in these networks could reveal proteins with key tissue-specific functions.

Results: Here, we introduce Weighted Graph Anomalous Node Detection (WGAND), a novel machine-learning algorithm to identify anomalous nodes in weighted graphs. WGAND estimates expected edge weights and uses deviations to generate anomaly detection features, which are then used to score network nodes. We applied WGAND to weighted PPI networks of 17 human tissues. High-ranking anomalous nodes were enriched for proteins associated with tissue-specific diseases and tissue-specific biological processes, such as neuron signaling in the brain and spermatogenesis in the testis. WGAND outperformed other methods in terms of area under the ROC curve and precision at K, highlighting its effectiveness in uncovering biologically meaningful anomalies.

Conclusions: Our findings demonstrate WGAND's potential as a powerful tool for detecting anomalous proteins with significant biological roles. By identifying proteins involved in critical tissue-specific processes and diseases, WGAND offers valuable insights for discovering novel biomarkers and therapeutic targets. Its versatile algorithm is suitable for any weighted graph and is broadly applicable across various fields. The WGAND algorithm is available as an open-source Python library at https://github.com/data4goodlab/wgand.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
GigaScience
GigaScience MULTIDISCIPLINARY SCIENCES-
CiteScore
15.50
自引率
1.10%
发文量
119
审稿时长
1 weeks
期刊介绍: GigaScience seeks to transform data dissemination and utilization in the life and biomedical sciences. As an online open-access open-data journal, it specializes in publishing "big-data" studies encompassing various fields. Its scope includes not only "omic" type data and the fields of high-throughput biology currently serviced by large public repositories, but also the growing range of more difficult-to-access data, such as imaging, neuroscience, ecology, cohort data, systems biology and other new types of large-scale shareable data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信