Jiahui Mao , Wuchao Li , Xinhuan Sun , Bangkang Fu , Junjie He , Chongzhe Yan , Jianguo Zhu , Zhuxue Zhang , Jiahui Mao , Zhangxin Hong , Qi Tang , Zhen Liu , Pinhao Li , Yan Zhang , Rongpin Wang
{"title":"RPF-Net:基于CT和全片图像的非转移性ccRCC术后UISS风险分层的多模式模型","authors":"Jiahui Mao , Wuchao Li , Xinhuan Sun , Bangkang Fu , Junjie He , Chongzhe Yan , Jianguo Zhu , Zhuxue Zhang , Jiahui Mao , Zhangxin Hong , Qi Tang , Zhen Liu , Pinhao Li , Yan Zhang , Rongpin Wang","doi":"10.1016/j.cmpb.2025.108836","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and objectives</h3><div>Postoperative non-metastatic clear cell renal cell carcinoma (nccRCC) patients face the risk of tumor recurrence and metastasis. However, prognosis assessment for nccRCC remains time-consuming and subjective. In the current diagnostic landscape, computed tomography (CT) images provide macro-scale anatomical information, and whole-slide images (WSIs) offer micro-scale details that are inaccessible to CT imaging. To address this gap, the study proposes a multimodal approach that leverages both CT and WSI data to develop an automated model for postoperative risk stratification in nccRCC.</div></div><div><h3>Methods</h3><div>This study proposes a multimodal model named the Radiology-Pathology Fusion Network (RPF-Net), which employs self-attention, graph-attention, and dynamic attention fusion mechanisms to integrate CT images and WSIs for classifying nccRCC patients into low-risk and intermediate-high-risk groups per the University of California, Los Angeles, Integrated Staging System (UISS) criteria. The proposed model is divided into three steps. First, the ResNet-50 and 3D ResNet-50 are used as feature extractors to respectively extract representative feature maps from WSIs and CT images. Second, a dual-branch module is designed to extract global and local features of the WSIs. Finally, a multilayer dynamic attention fusion (MDAF) module is developed to facilitate cross-modal feature interaction and predict the risk stratification results.</div></div><div><h3>Results</h3><div>The area under the curve (AUC), accuracy, precision, and F1 Score of the RPF-Net on the internal validation set are 0.949±0.013, 0.894±0.019, 0.895±0.020, and 0.894±0.019, respectively. Furthermore, the RPF-Net shows robust generalization, achieving an AUC of 0.901 on the external validation set and 0.924 on the public dataset.</div></div><div><h3>Conclusions</h3><div>The RPF-Net models the diagnostic process of multimodal data and shows strong generalization and excellent performance. This model may be a potential tool to facilitate clinical risk stratification and management for postoperative nccRCC patients.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"268 ","pages":"Article 108836"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RPF-Net: A multimodal model for the postoperative UISS risk stratification of non-metastatic ccRCC based on CT and whole-slide images\",\"authors\":\"Jiahui Mao , Wuchao Li , Xinhuan Sun , Bangkang Fu , Junjie He , Chongzhe Yan , Jianguo Zhu , Zhuxue Zhang , Jiahui Mao , Zhangxin Hong , Qi Tang , Zhen Liu , Pinhao Li , Yan Zhang , Rongpin Wang\",\"doi\":\"10.1016/j.cmpb.2025.108836\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background and objectives</h3><div>Postoperative non-metastatic clear cell renal cell carcinoma (nccRCC) patients face the risk of tumor recurrence and metastasis. However, prognosis assessment for nccRCC remains time-consuming and subjective. In the current diagnostic landscape, computed tomography (CT) images provide macro-scale anatomical information, and whole-slide images (WSIs) offer micro-scale details that are inaccessible to CT imaging. To address this gap, the study proposes a multimodal approach that leverages both CT and WSI data to develop an automated model for postoperative risk stratification in nccRCC.</div></div><div><h3>Methods</h3><div>This study proposes a multimodal model named the Radiology-Pathology Fusion Network (RPF-Net), which employs self-attention, graph-attention, and dynamic attention fusion mechanisms to integrate CT images and WSIs for classifying nccRCC patients into low-risk and intermediate-high-risk groups per the University of California, Los Angeles, Integrated Staging System (UISS) criteria. The proposed model is divided into three steps. First, the ResNet-50 and 3D ResNet-50 are used as feature extractors to respectively extract representative feature maps from WSIs and CT images. Second, a dual-branch module is designed to extract global and local features of the WSIs. Finally, a multilayer dynamic attention fusion (MDAF) module is developed to facilitate cross-modal feature interaction and predict the risk stratification results.</div></div><div><h3>Results</h3><div>The area under the curve (AUC), accuracy, precision, and F1 Score of the RPF-Net on the internal validation set are 0.949±0.013, 0.894±0.019, 0.895±0.020, and 0.894±0.019, respectively. Furthermore, the RPF-Net shows robust generalization, achieving an AUC of 0.901 on the external validation set and 0.924 on the public dataset.</div></div><div><h3>Conclusions</h3><div>The RPF-Net models the diagnostic process of multimodal data and shows strong generalization and excellent performance. This model may be a potential tool to facilitate clinical risk stratification and management for postoperative nccRCC patients.</div></div>\",\"PeriodicalId\":10624,\"journal\":{\"name\":\"Computer methods and programs in biomedicine\",\"volume\":\"268 \",\"pages\":\"Article 108836\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169260725002536\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725002536","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
RPF-Net: A multimodal model for the postoperative UISS risk stratification of non-metastatic ccRCC based on CT and whole-slide images
Background and objectives
Postoperative non-metastatic clear cell renal cell carcinoma (nccRCC) patients face the risk of tumor recurrence and metastasis. However, prognosis assessment for nccRCC remains time-consuming and subjective. In the current diagnostic landscape, computed tomography (CT) images provide macro-scale anatomical information, and whole-slide images (WSIs) offer micro-scale details that are inaccessible to CT imaging. To address this gap, the study proposes a multimodal approach that leverages both CT and WSI data to develop an automated model for postoperative risk stratification in nccRCC.
Methods
This study proposes a multimodal model named the Radiology-Pathology Fusion Network (RPF-Net), which employs self-attention, graph-attention, and dynamic attention fusion mechanisms to integrate CT images and WSIs for classifying nccRCC patients into low-risk and intermediate-high-risk groups per the University of California, Los Angeles, Integrated Staging System (UISS) criteria. The proposed model is divided into three steps. First, the ResNet-50 and 3D ResNet-50 are used as feature extractors to respectively extract representative feature maps from WSIs and CT images. Second, a dual-branch module is designed to extract global and local features of the WSIs. Finally, a multilayer dynamic attention fusion (MDAF) module is developed to facilitate cross-modal feature interaction and predict the risk stratification results.
Results
The area under the curve (AUC), accuracy, precision, and F1 Score of the RPF-Net on the internal validation set are 0.949±0.013, 0.894±0.019, 0.895±0.020, and 0.894±0.019, respectively. Furthermore, the RPF-Net shows robust generalization, achieving an AUC of 0.901 on the external validation set and 0.924 on the public dataset.
Conclusions
The RPF-Net models the diagnostic process of multimodal data and shows strong generalization and excellent performance. This model may be a potential tool to facilitate clinical risk stratification and management for postoperative nccRCC patients.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.