利用权威医学本体论中的专业知识改进因果贝叶斯网络

ACM transactions on computing for healthcare Pub Date : 2023-06-17 DOI:10.1145/3604561

Hengyi Hu, L. Kerschberg

{"title":"利用权威医学本体论中的专业知识改进因果贝叶斯网络","authors":"Hengyi Hu, L. Kerschberg","doi":"10.1145/3604561","DOIUrl":null,"url":null,"abstract":"Discovering causal relationships among symptoms is a topical issue in the analysis of observational patient datasets. A Causal Bayesian Network (CBN) is a popular analytical framework for causal inference. While there are many methods and algorithms capable of learning a Bayesian network, they are reliant on the complexity and thoroughness of the algorithm and do not consider prior expertise from authoritative sources. This paper proposes a novel method of extracting prior causal knowledge contained in Authoritative Medical Ontologies (AMOs) and using this prior knowledge to orient arcs in a CBN learned from observational patient data. Since AMOs are robust biomedical ontologies containing the collective knowledge of the experts who created them, utilizing the ordering information contained within them produces improved CBNs which provide additional insight into the disease domain. To demonstrate our method, we obtained prior causal ordering information among symptoms from three AMOs: 1) the Medical Dictionary for Regulatory Activities Terminology (MedDRA), 2) the International Classification of Diseases Version 10 Clinical Modification (ICD-10-CM), and 3) Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). The prior ontological knowledge from these three AMOs is then used to orient arcs in a series of CBNs learned from the National Institutes of Mental Health study on Sequenced Treatment Alternatives to Relieve Depression (STAR*D) patient dataset using the Max-Min Hill-Climbing (MMHC) algorithm. Six distinct CBNs are generated using MMHC: an unmodified baseline model using only the algorithm, three CBNs oriented with ordered-variable pairs from MedDRA, ICD-10-CM, and SNOMED CT, and two more with ordered pairs from a combination of these AMOs. The resulting CBNs modified using ordered-variable pairs significantly change the structure of the network. The agreement between the Modified networks and the Baseline ranges from 50% to 90%. A modified network using ordering information from all ontologies obtained an agreement of 50% (10 out of 20 arcs exist in both the Baseline and Modified models) while maintaining comparable predictive accuracy. This indicates that the Modified CBN reflects the causal claims in the AMOs and agrees with both the AMOs and the observational STAR*D dataset. Furthermore, the Modified models discovered new potentially causal relationships among symptoms in the model, while eliminating weaker edges in a qualitative analysis of the significance of these relationships in existing epidemiological research.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving Causal Bayesian Networks using Expertise in Authoritative Medical Ontologies\",\"authors\":\"Hengyi Hu, L. Kerschberg\",\"doi\":\"10.1145/3604561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Discovering causal relationships among symptoms is a topical issue in the analysis of observational patient datasets. A Causal Bayesian Network (CBN) is a popular analytical framework for causal inference. While there are many methods and algorithms capable of learning a Bayesian network, they are reliant on the complexity and thoroughness of the algorithm and do not consider prior expertise from authoritative sources. This paper proposes a novel method of extracting prior causal knowledge contained in Authoritative Medical Ontologies (AMOs) and using this prior knowledge to orient arcs in a CBN learned from observational patient data. Since AMOs are robust biomedical ontologies containing the collective knowledge of the experts who created them, utilizing the ordering information contained within them produces improved CBNs which provide additional insight into the disease domain. To demonstrate our method, we obtained prior causal ordering information among symptoms from three AMOs: 1) the Medical Dictionary for Regulatory Activities Terminology (MedDRA), 2) the International Classification of Diseases Version 10 Clinical Modification (ICD-10-CM), and 3) Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). The prior ontological knowledge from these three AMOs is then used to orient arcs in a series of CBNs learned from the National Institutes of Mental Health study on Sequenced Treatment Alternatives to Relieve Depression (STAR*D) patient dataset using the Max-Min Hill-Climbing (MMHC) algorithm. Six distinct CBNs are generated using MMHC: an unmodified baseline model using only the algorithm, three CBNs oriented with ordered-variable pairs from MedDRA, ICD-10-CM, and SNOMED CT, and two more with ordered pairs from a combination of these AMOs. The resulting CBNs modified using ordered-variable pairs significantly change the structure of the network. The agreement between the Modified networks and the Baseline ranges from 50% to 90%. A modified network using ordering information from all ontologies obtained an agreement of 50% (10 out of 20 arcs exist in both the Baseline and Modified models) while maintaining comparable predictive accuracy. This indicates that the Modified CBN reflects the causal claims in the AMOs and agrees with both the AMOs and the observational STAR*D dataset. Furthermore, the Modified models discovered new potentially causal relationships among symptoms in the model, while eliminating weaker edges in a qualitative analysis of the significance of these relationships in existing epidemiological research.\",\"PeriodicalId\":72043,\"journal\":{\"name\":\"ACM transactions on computing for healthcare\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM transactions on computing for healthcare\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3604561\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM transactions on computing for healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3604561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

发现症状之间的因果关系是观察患者数据集分析中的一个热门问题。因果贝叶斯网络（CBN）是一种流行的因果推理分析框架。虽然有许多方法和算法能够学习贝叶斯网络，但它们依赖于算法的复杂性和彻底性，并且不考虑来自权威来源的先验专业知识。本文提出了一种提取权威医学本体论（AMOs）中包含的先验因果知识的新方法，并使用该先验知识来确定从观察患者数据中学习的CBN中的弧的方向。由于AMO是强大的生物医学本体，包含创建它们的专家的集体知识，因此利用其中包含的排序信息可以产生改进的CBN，从而提供对疾病领域的额外见解。为了证明我们的方法，我们从三个AMO中获得了症状之间的先验因果排序信息：1）医学活动术语词典（MedDRA），2）国际疾病分类第10版临床修改（ICD-10-CM），以及3）医学临床术语系统命名法（SNOMED CT）。然后，来自这三个AMO的先验本体论知识被用于确定一系列CBN中的弧的方向，这些CBN是从美国国立精神卫生研究院关于缓解抑郁的顺序治疗替代方案（STAR*D）患者数据集的研究中学习到的，使用了Max-Min爬山（MMHC）算法。使用MMHC生成了六个不同的CBN：一个仅使用算法的未修改基线模型，三个CBN使用MedDRA、ICD-10-CM和SNOMED CT的有序变量对定向，另外两个使用这些AMO组合的有序对。使用有序变量对修改的CBN显著改变了网络的结构。修改后的网络和基线之间的一致性在50%到90%之间。使用来自所有本体的排序信息的修改网络获得了50%的一致性（基线模型和修改模型中都存在20个弧中的10个），同时保持了可比较的预测精度。这表明修正后的CBN反映了AMOs中的因果关系，并与AMOs和观测STAR*D数据集一致。此外，修正模型在模型中发现了症状之间新的潜在因果关系，同时在对这些关系在现有流行病学研究中的重要性进行定性分析时消除了较弱的边缘。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving Causal Bayesian Networks using Expertise in Authoritative Medical Ontologies

Discovering causal relationships among symptoms is a topical issue in the analysis of observational patient datasets. A Causal Bayesian Network (CBN) is a popular analytical framework for causal inference. While there are many methods and algorithms capable of learning a Bayesian network, they are reliant on the complexity and thoroughness of the algorithm and do not consider prior expertise from authoritative sources. This paper proposes a novel method of extracting prior causal knowledge contained in Authoritative Medical Ontologies (AMOs) and using this prior knowledge to orient arcs in a CBN learned from observational patient data. Since AMOs are robust biomedical ontologies containing the collective knowledge of the experts who created them, utilizing the ordering information contained within them produces improved CBNs which provide additional insight into the disease domain. To demonstrate our method, we obtained prior causal ordering information among symptoms from three AMOs: 1) the Medical Dictionary for Regulatory Activities Terminology (MedDRA), 2) the International Classification of Diseases Version 10 Clinical Modification (ICD-10-CM), and 3) Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). The prior ontological knowledge from these three AMOs is then used to orient arcs in a series of CBNs learned from the National Institutes of Mental Health study on Sequenced Treatment Alternatives to Relieve Depression (STAR*D) patient dataset using the Max-Min Hill-Climbing (MMHC) algorithm. Six distinct CBNs are generated using MMHC: an unmodified baseline model using only the algorithm, three CBNs oriented with ordered-variable pairs from MedDRA, ICD-10-CM, and SNOMED CT, and two more with ordered pairs from a combination of these AMOs. The resulting CBNs modified using ordered-variable pairs significantly change the structure of the network. The agreement between the Modified networks and the Baseline ranges from 50% to 90%. A modified network using ordering information from all ontologies obtained an agreement of 50% (10 out of 20 arcs exist in both the Baseline and Modified models) while maintaining comparable predictive accuracy. This indicates that the Modified CBN reflects the causal claims in the AMOs and agrees with both the AMOs and the observational STAR*D dataset. Furthermore, the Modified models discovered new potentially causal relationships among symptoms in the model, while eliminating weaker edges in a qualitative analysis of the significance of these relationships in existing epidemiological research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACM transactions on computing for healthcare

CiteScore

10.30

自引率

0.00%

发文量