Sina Tabakhi, Charlotte Vandermeulen, Ian Sudbery, Haiping Lu
{"title":"Heterogeneous graph attention network improves cancer multiomics integration","authors":"Sina Tabakhi, Charlotte Vandermeulen, Ian Sudbery, Haiping Lu","doi":"arxiv-2408.02845","DOIUrl":null,"url":null,"abstract":"The increase in high-dimensional multiomics data demands advanced integration\nmodels to capture the complexity of human diseases. Graph-based deep learning\nintegration models, despite their promise, struggle with small patient cohorts\nand high-dimensional features, often applying independent feature selection\nwithout modeling relationships among omics. Furthermore, conventional\ngraph-based omics models focus on homogeneous graphs, lacking multiple types of\nnodes and edges to capture diverse structures. We introduce a Heterogeneous\nGraph ATtention network for omics integration (HeteroGATomics) to improve\ncancer diagnosis. HeteroGATomics performs joint feature selection through a\nmulti-agent system, creating dedicated networks of feature and patient\nsimilarity for each omic modality. These networks are then combined into one\nheterogeneous graph for learning holistic omic-specific representations and\nintegrating predictions across modalities. Experiments on three cancer\nmultiomics datasets demonstrate HeteroGATomics' superior performance in cancer\ndiagnosis. Moreover, HeteroGATomics enhances interpretability by identifying\nimportant biomarkers contributing to the diagnosis outcomes.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02845","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The increase in high-dimensional multiomics data demands advanced integration
models to capture the complexity of human diseases. Graph-based deep learning
integration models, despite their promise, struggle with small patient cohorts
and high-dimensional features, often applying independent feature selection
without modeling relationships among omics. Furthermore, conventional
graph-based omics models focus on homogeneous graphs, lacking multiple types of
nodes and edges to capture diverse structures. We introduce a Heterogeneous
Graph ATtention network for omics integration (HeteroGATomics) to improve
cancer diagnosis. HeteroGATomics performs joint feature selection through a
multi-agent system, creating dedicated networks of feature and patient
similarity for each omic modality. These networks are then combined into one
heterogeneous graph for learning holistic omic-specific representations and
integrating predictions across modalities. Experiments on three cancer
multiomics datasets demonstrate HeteroGATomics' superior performance in cancer
diagnosis. Moreover, HeteroGATomics enhances interpretability by identifying
important biomarkers contributing to the diagnosis outcomes.