Xu Luo, Pei Shu, Ning Liu, Dong Miao, Xiuding Cai, Yu Yao, Xin Wang
{"title":"DG-MSGAT:用于预测直肠癌新辅助治疗反应的生物学信息差异基因多尺度图关注网络。","authors":"Xu Luo, Pei Shu, Ning Liu, Dong Miao, Xiuding Cai, Yu Yao, Xin Wang","doi":"10.1016/j.cmpb.2025.108974","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objective: </strong>Accurate prediction of the efficacy of neoadjuvant therapy - particularly the likelihood of achieving a pathological complete response (pCR) - is critical to improving outcomes in patients with rectal cancer. The anticipation of therapeutic response prior to surgery enables the development of personalized treatment strategies and reduces unnecessary interventions for non-responders. While genetic profiling has been integrated into predictive models to enhance response estimation, many existing approaches overlook gene-gene interactions. Furthermore, they often struggle with the high dimensionality, noise, and sparsity inherent in gene expression data. To address these limitations, we propose a biologically informed model, the Differential Gene Multi-Scale Graph Attention Network (DG-MSGAT). This model integrates differential expression signals with multi-scale gene interaction patterns to improve the accuracy of treatment response prediction.</p><p><strong>Methods: </strong>By integrating gene expression profiles with differential expression signals, we construct a patient-specific gene graph whose edges are defined based on curated protein-protein interaction data. This graph is then processed by DG-MSGAT, a multi-scale graph attention network that utilizes stacked attention layers and residual connections to model hierarchical gene dependencies and preserve feature integrity. The resulting representation is subsequently used to estimate the probability of achieving a pathological complete response.</p><p><strong>Results: </strong>In patients with locally advanced rectal cancer, the DG-MSGAT model substantially outperformed conventional algorithms - including support vector machines, decision trees, and random forests - in predicting neoadjuvant therapy efficacy. Network analysis identified key genes (e.g., TP53, EGFR, CTNNB1) and immune-related pathways that are consistent with clinically established determinants of therapeutic response.</p><p><strong>Conclusion: </strong>The DG-MSGAT model offers a promising advancement in the prediction of neoadjuvant therapy outcomes in rectal cancer. By effectively modeling gene interactions and mitigating the limitations associated with high-dimensional gene expression data, it provides a clinically relevant tool to support personalized treatment decision-making.</p>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"271 ","pages":"108974"},"PeriodicalIF":4.8000,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DG-MSGAT: A Biologically-informed Differential Gene Multi-Scale Graph Attention Network for predicting neoadjuvant therapy response in rectal cancer.\",\"authors\":\"Xu Luo, Pei Shu, Ning Liu, Dong Miao, Xiuding Cai, Yu Yao, Xin Wang\",\"doi\":\"10.1016/j.cmpb.2025.108974\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background and objective: </strong>Accurate prediction of the efficacy of neoadjuvant therapy - particularly the likelihood of achieving a pathological complete response (pCR) - is critical to improving outcomes in patients with rectal cancer. The anticipation of therapeutic response prior to surgery enables the development of personalized treatment strategies and reduces unnecessary interventions for non-responders. While genetic profiling has been integrated into predictive models to enhance response estimation, many existing approaches overlook gene-gene interactions. Furthermore, they often struggle with the high dimensionality, noise, and sparsity inherent in gene expression data. To address these limitations, we propose a biologically informed model, the Differential Gene Multi-Scale Graph Attention Network (DG-MSGAT). This model integrates differential expression signals with multi-scale gene interaction patterns to improve the accuracy of treatment response prediction.</p><p><strong>Methods: </strong>By integrating gene expression profiles with differential expression signals, we construct a patient-specific gene graph whose edges are defined based on curated protein-protein interaction data. This graph is then processed by DG-MSGAT, a multi-scale graph attention network that utilizes stacked attention layers and residual connections to model hierarchical gene dependencies and preserve feature integrity. The resulting representation is subsequently used to estimate the probability of achieving a pathological complete response.</p><p><strong>Results: </strong>In patients with locally advanced rectal cancer, the DG-MSGAT model substantially outperformed conventional algorithms - including support vector machines, decision trees, and random forests - in predicting neoadjuvant therapy efficacy. Network analysis identified key genes (e.g., TP53, EGFR, CTNNB1) and immune-related pathways that are consistent with clinically established determinants of therapeutic response.</p><p><strong>Conclusion: </strong>The DG-MSGAT model offers a promising advancement in the prediction of neoadjuvant therapy outcomes in rectal cancer. By effectively modeling gene interactions and mitigating the limitations associated with high-dimensional gene expression data, it provides a clinically relevant tool to support personalized treatment decision-making.</p>\",\"PeriodicalId\":10624,\"journal\":{\"name\":\"Computer methods and programs in biomedicine\",\"volume\":\"271 \",\"pages\":\"108974\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.cmpb.2025.108974\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/7/29 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.cmpb.2025.108974","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
DG-MSGAT: A Biologically-informed Differential Gene Multi-Scale Graph Attention Network for predicting neoadjuvant therapy response in rectal cancer.
Background and objective: Accurate prediction of the efficacy of neoadjuvant therapy - particularly the likelihood of achieving a pathological complete response (pCR) - is critical to improving outcomes in patients with rectal cancer. The anticipation of therapeutic response prior to surgery enables the development of personalized treatment strategies and reduces unnecessary interventions for non-responders. While genetic profiling has been integrated into predictive models to enhance response estimation, many existing approaches overlook gene-gene interactions. Furthermore, they often struggle with the high dimensionality, noise, and sparsity inherent in gene expression data. To address these limitations, we propose a biologically informed model, the Differential Gene Multi-Scale Graph Attention Network (DG-MSGAT). This model integrates differential expression signals with multi-scale gene interaction patterns to improve the accuracy of treatment response prediction.
Methods: By integrating gene expression profiles with differential expression signals, we construct a patient-specific gene graph whose edges are defined based on curated protein-protein interaction data. This graph is then processed by DG-MSGAT, a multi-scale graph attention network that utilizes stacked attention layers and residual connections to model hierarchical gene dependencies and preserve feature integrity. The resulting representation is subsequently used to estimate the probability of achieving a pathological complete response.
Results: In patients with locally advanced rectal cancer, the DG-MSGAT model substantially outperformed conventional algorithms - including support vector machines, decision trees, and random forests - in predicting neoadjuvant therapy efficacy. Network analysis identified key genes (e.g., TP53, EGFR, CTNNB1) and immune-related pathways that are consistent with clinically established determinants of therapeutic response.
Conclusion: The DG-MSGAT model offers a promising advancement in the prediction of neoadjuvant therapy outcomes in rectal cancer. By effectively modeling gene interactions and mitigating the limitations associated with high-dimensional gene expression data, it provides a clinically relevant tool to support personalized treatment decision-making.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.