Interpretable AI for inference of causal molecular relationships from omics data.

IF 12.5 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Payam Dibaeinia, Abhishek Ojha, Saurabh Sinha
{"title":"Interpretable AI for inference of causal molecular relationships from omics data.","authors":"Payam Dibaeinia, Abhishek Ojha, Saurabh Sinha","doi":"10.1126/sciadv.adk0837","DOIUrl":null,"url":null,"abstract":"<p><p>The discovery of molecular relationships from high-dimensional data is a major open problem in bioinformatics. Machine learning and feature attribution models have shown great promise in this context but lack causal interpretation. Here, we show that a popular feature attribution model, under certain assumptions, estimates an average of a causal quantity reflecting the direct influence of one variable on another. We leverage this insight to propose a precise definition of a gene regulatory relationship and implement a new tool, CIMLA (Counterfactual Inference by Machine Learning and Attribution Models), to identify differences in gene regulatory networks between biological conditions, a problem that has received great attention in recent years. Using extensive benchmarking on simulated data, we show that CIMLA is more robust to confounding variables and is more accurate than leading methods. Last, we use CIMLA to analyze a previously published single-cell RNA sequencing dataset from subjects with and without Alzheimer's disease (AD), discovering several potential regulators of AD.</p>","PeriodicalId":21609,"journal":{"name":"Science Advances","volume":"11 7","pages":"eadk0837"},"PeriodicalIF":12.5000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11827637/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Advances","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1126/sciadv.adk0837","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The discovery of molecular relationships from high-dimensional data is a major open problem in bioinformatics. Machine learning and feature attribution models have shown great promise in this context but lack causal interpretation. Here, we show that a popular feature attribution model, under certain assumptions, estimates an average of a causal quantity reflecting the direct influence of one variable on another. We leverage this insight to propose a precise definition of a gene regulatory relationship and implement a new tool, CIMLA (Counterfactual Inference by Machine Learning and Attribution Models), to identify differences in gene regulatory networks between biological conditions, a problem that has received great attention in recent years. Using extensive benchmarking on simulated data, we show that CIMLA is more robust to confounding variables and is more accurate than leading methods. Last, we use CIMLA to analyze a previously published single-cell RNA sequencing dataset from subjects with and without Alzheimer's disease (AD), discovering several potential regulators of AD.

从 omics 数据推断因果分子关系的可解释人工智能。
从高维数据中发现分子关系是生物信息学中的一个重大开放问题。机器学习和特征归因模型在这种情况下显示出很大的希望,但缺乏因果解释。在这里,我们展示了一个流行的特征归因模型,在某些假设下,估计反映一个变量对另一个变量的直接影响的因果量的平均值。我们利用这一见解提出了基因调控关系的精确定义,并实施了一个新的工具,CIMLA(机器学习和归因模型反事实推理),以识别生物条件之间基因调控网络的差异,这是近年来受到极大关注的一个问题。通过对模拟数据进行广泛的基准测试,我们表明CIMLA对混杂变量具有更强的鲁棒性,并且比领先的方法更准确。最后,我们使用CIMLA分析了先前发表的来自阿尔茨海默病(AD)患者和非AD患者的单细胞RNA测序数据集,发现了AD的几个潜在调节因子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Science Advances
Science Advances 综合性期刊-综合性期刊
CiteScore
21.40
自引率
1.50%
发文量
1937
审稿时长
29 weeks
期刊介绍: Science Advances, an open-access journal by AAAS, publishes impactful research in diverse scientific areas. It aims for fair, fast, and expert peer review, providing freely accessible research to readers. Led by distinguished scientists, the journal supports AAAS's mission by extending Science magazine's capacity to identify and promote significant advances. Evolving digital publishing technologies play a crucial role in advancing AAAS's global mission for science communication and benefitting humankind.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信