MOHGCN: A trustworthy multi-omics data integration framework based on specificity-aware heterogeneous graph convolutional neural networks for disease diagnosis
IF 7.5 1区 计算机科学Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Wenhao Wu , Shudong Wang , Yuanyuan Zhang , Kuijie Zhang , Wenjing Yin , Shanchen Pang
{"title":"MOHGCN: A trustworthy multi-omics data integration framework based on specificity-aware heterogeneous graph convolutional neural networks for disease diagnosis","authors":"Wenhao Wu , Shudong Wang , Yuanyuan Zhang , Kuijie Zhang , Wenjing Yin , Shanchen Pang","doi":"10.1016/j.eswa.2024.125772","DOIUrl":null,"url":null,"abstract":"<div><div>With the advancement of cutting-edge sequencing methodologies, the integration of multi-omics data provides invaluable opportunities for researchers to study complex diseases from a molecular perspective while at the same time being challenged by the deployment of safety-critical applications such as computer-aided diagnostics. However, existing methods in multi-omics data integration primarily explore interactions between omics or samples, neglecting high-order interaction information among biomolecules specific to certain diseases. In this work, we propose MOHGCN, a trustworthy multi-omics data integration framework based on specificity-aware heterogeneous graph convolutional neural networks for disease diagnosis, aiming to maximize the utilization of biomolecular interactions in patients with specific diseases for precise diagnosis to enhance the model’s credibility. In the approach, we constructed a heterogeneous graph of samples and genes and devised the HGCN graph convolution model specifically tailored to the sample–gene heterogeneous graph. Concurrently, techniques such as trustworthy attention weights and self-attention mechanisms were incorporated to unveil relationships between different omics, facilitating the efficient integration of multi-omics data. Through comprehensive experimentation on four publicly available multi-omics medical datasets, our proposed framework consistently demonstrates superior performance across various classification tasks. Simultaneously, the experimental results substantiate the model’s effectiveness in extracting features from multi-omics data and unveiling latent associations among different omics.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"263 ","pages":"Article 125772"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424026393","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
With the advancement of cutting-edge sequencing methodologies, the integration of multi-omics data provides invaluable opportunities for researchers to study complex diseases from a molecular perspective while at the same time being challenged by the deployment of safety-critical applications such as computer-aided diagnostics. However, existing methods in multi-omics data integration primarily explore interactions between omics or samples, neglecting high-order interaction information among biomolecules specific to certain diseases. In this work, we propose MOHGCN, a trustworthy multi-omics data integration framework based on specificity-aware heterogeneous graph convolutional neural networks for disease diagnosis, aiming to maximize the utilization of biomolecular interactions in patients with specific diseases for precise diagnosis to enhance the model’s credibility. In the approach, we constructed a heterogeneous graph of samples and genes and devised the HGCN graph convolution model specifically tailored to the sample–gene heterogeneous graph. Concurrently, techniques such as trustworthy attention weights and self-attention mechanisms were incorporated to unveil relationships between different omics, facilitating the efficient integration of multi-omics data. Through comprehensive experimentation on four publicly available multi-omics medical datasets, our proposed framework consistently demonstrates superior performance across various classification tasks. Simultaneously, the experimental results substantiate the model’s effectiveness in extracting features from multi-omics data and unveiling latent associations among different omics.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.