推论:对疾病相关基因的统计推断揭示了组织与疾病的关联。

IF 2.8 Q1 GENETICS & HEREDITY
NAR Genomics and Bioinformatics Pub Date : 2025-12-31 eCollection Date: 2025-12-01 DOI:10.1093/nargab/lqaf205
Boqi Wang, Jiayi Wang, Ammar Aleem Rashied, Bo Meng, Jesse Zhang, Jun S Liu, Jie Jiang, Zhaohui S Qin
{"title":"推论:对疾病相关基因的统计推断揭示了组织与疾病的关联。","authors":"Boqi Wang, Jiayi Wang, Ammar Aleem Rashied, Bo Meng, Jesse Zhang, Jun S Liu, Jie Jiang, Zhaohui S Qin","doi":"10.1093/nargab/lqaf205","DOIUrl":null,"url":null,"abstract":"<p><p>Accurate identification of affected tissues of human diseases is important for the derivation of disease etiology and the development of new treatment strategies. In this study, we develop a logistic regression-based method named DEDUCE (disease tissue detection using logistic regression) that combines genomics big data and machine learning to address this important problem. The central hypothesis is that most disease-associated genes are expressed specifically in affected tissues. DEDUCE takes advantage of newly emerged data on disease-related genes as well as tissue-specific gene expression data. The unique feature of DEDUCE is that it takes into account the strength of gene-disease associations. When we applied DEDUCE to a total of 3261, 324 gene-disease associations collected from DisGeNET covering 30,170 diseases and 21,666 genes, we identified 216 significant tissue-disease pairs composed of 120 unique diseases and 37 unique tissues. Many of them shed light on potential explanations for disease pathogenesis. The results showed great consistency with previous findings and were proven effective by empirical plots and gene set enrichment analysis. Overall, DEDUCE has shown great potential in uncovering novel pathogenesis mechanisms of complex diseases. In-depth analysis and experimental validation were required to fully understand these discovered tissue-trait associations and their enriched genes.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 4","pages":"lqaf205"},"PeriodicalIF":2.8000,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12754781/pdf/","citationCount":"0","resultStr":"{\"title\":\"DEDUCE: statistical inference on disease-associated genes uncovers tissue-disease associations.\",\"authors\":\"Boqi Wang, Jiayi Wang, Ammar Aleem Rashied, Bo Meng, Jesse Zhang, Jun S Liu, Jie Jiang, Zhaohui S Qin\",\"doi\":\"10.1093/nargab/lqaf205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Accurate identification of affected tissues of human diseases is important for the derivation of disease etiology and the development of new treatment strategies. In this study, we develop a logistic regression-based method named DEDUCE (disease tissue detection using logistic regression) that combines genomics big data and machine learning to address this important problem. The central hypothesis is that most disease-associated genes are expressed specifically in affected tissues. DEDUCE takes advantage of newly emerged data on disease-related genes as well as tissue-specific gene expression data. The unique feature of DEDUCE is that it takes into account the strength of gene-disease associations. When we applied DEDUCE to a total of 3261, 324 gene-disease associations collected from DisGeNET covering 30,170 diseases and 21,666 genes, we identified 216 significant tissue-disease pairs composed of 120 unique diseases and 37 unique tissues. Many of them shed light on potential explanations for disease pathogenesis. The results showed great consistency with previous findings and were proven effective by empirical plots and gene set enrichment analysis. Overall, DEDUCE has shown great potential in uncovering novel pathogenesis mechanisms of complex diseases. In-depth analysis and experimental validation were required to fully understand these discovered tissue-trait associations and their enriched genes.</p>\",\"PeriodicalId\":33994,\"journal\":{\"name\":\"NAR Genomics and Bioinformatics\",\"volume\":\"7 4\",\"pages\":\"lqaf205\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12754781/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NAR Genomics and Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/nargab/lqaf205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/12/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAR Genomics and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/nargab/lqaf205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

准确识别人类疾病的受影响组织对于疾病病因的推导和新的治疗策略的发展是重要的。在本研究中,我们开发了一种基于逻辑回归的方法,名为推导(利用逻辑回归进行疾病组织检测),该方法结合了基因组学大数据和机器学习来解决这一重要问题。中心假设是大多数疾病相关基因在受影响组织中特异性表达。演绎利用新出现的数据对疾病相关基因以及组织特异性基因表达数据。演绎的独特之处在于它考虑了基因-疾病关联的强度。当我们将推导结果应用于从DisGeNET收集的3261,324个基因-疾病关联,涵盖30,170种疾病和21,666个基因时,我们确定了216个重要的组织-疾病对,由120种独特疾病和37种独特组织组成。其中许多研究揭示了疾病发病机制的潜在解释。结果与前人的研究结果一致,并通过实验图和基因集富集分析证明了该方法的有效性。总的来说,演绎在揭示复杂疾病的新发病机制方面显示出巨大的潜力。为了充分理解这些发现的组织性状关联及其富集的基因,需要进行深入的分析和实验验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

DEDUCE: statistical inference on disease-associated genes uncovers tissue-disease associations.

DEDUCE: statistical inference on disease-associated genes uncovers tissue-disease associations.

DEDUCE: statistical inference on disease-associated genes uncovers tissue-disease associations.

DEDUCE: statistical inference on disease-associated genes uncovers tissue-disease associations.

Accurate identification of affected tissues of human diseases is important for the derivation of disease etiology and the development of new treatment strategies. In this study, we develop a logistic regression-based method named DEDUCE (disease tissue detection using logistic regression) that combines genomics big data and machine learning to address this important problem. The central hypothesis is that most disease-associated genes are expressed specifically in affected tissues. DEDUCE takes advantage of newly emerged data on disease-related genes as well as tissue-specific gene expression data. The unique feature of DEDUCE is that it takes into account the strength of gene-disease associations. When we applied DEDUCE to a total of 3261, 324 gene-disease associations collected from DisGeNET covering 30,170 diseases and 21,666 genes, we identified 216 significant tissue-disease pairs composed of 120 unique diseases and 37 unique tissues. Many of them shed light on potential explanations for disease pathogenesis. The results showed great consistency with previous findings and were proven effective by empirical plots and gene set enrichment analysis. Overall, DEDUCE has shown great potential in uncovering novel pathogenesis mechanisms of complex diseases. In-depth analysis and experimental validation were required to fully understand these discovered tissue-trait associations and their enriched genes.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.00
自引率
2.20%
发文量
95
审稿时长
15 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书