Agata Małgorzata Wilk , Andrzej Swierniak , Andrea d’Amico , Rafał Suwiński , Krzysztof Fujarewicz , Damian Borys
{"title":"Towards the use of multiple ROIs for radiomics-based survival modelling: Finding a strategy of aggregating lesions","authors":"Agata Małgorzata Wilk , Andrzej Swierniak , Andrea d’Amico , Rafał Suwiński , Krzysztof Fujarewicz , Damian Borys","doi":"10.1016/j.cmpb.2025.108840","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><div>Radiomic features, derived from a region of interest (ROI) in medical images, are valuable as prognostic factors. Selecting an appropriate ROI is critical, and many recent studies have focused on leveraging multiple ROIs by segmenting analogous regions across patients — such as the primary tumour and peritumoral area or subregions of the tumour. These can be straightforwardly incorporated into models as additional features. However, a more complex scenario arises, for example, in a regionally disseminated disease, when multiple distinct lesions are present.</div></div><div><h3>Aim:</h3><div>This study aims to evaluate the feasibility of integrating radiomic data from multiple lesions into survival models. We explore strategies for incorporating these ROIs and hypothesize that including all available lesions can improve model performance.</div></div><div><h3>Methods:</h3><div>While each lesion produces a feature vector, the desired result is a unified prediction. We propose methods to aggregate either the feature vectors to form a representative one or the modelling results to compute a consolidated risk score. As a proof of concept, we apply these strategies to predict distant metastasis risk in a cohort of 115 non-small cell lung cancer patients, 60% of whom exhibit regionally advanced disease. Two feature sets (radiomics extracted from PET and PET interpolated to CT resolution) are tested across various survival models using a Monte Carlo Cross-Validation framework.</div></div><div><h3>Results:</h3><div>Across both feature sets, incorporating all available lesions — rather than limiting analysis to the primary tumour — consistently improved the c-index, irrespective of the survival model used. The highest c-Index obtained by a primary tumour-only model was 0.611 for the PET dataset and 0.614 for the PET_CT dataset, while by using all lesions we were able to achieve c-Indices of 0.632 and 0.634.</div></div><div><h3>Conclusion:</h3><div>Lesions beyond the primary tumour carry information that should be utilized in radiomics-based models to enhance predictive ability.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"269 ","pages":"Article 108840"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725002573","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background:
Radiomic features, derived from a region of interest (ROI) in medical images, are valuable as prognostic factors. Selecting an appropriate ROI is critical, and many recent studies have focused on leveraging multiple ROIs by segmenting analogous regions across patients — such as the primary tumour and peritumoral area or subregions of the tumour. These can be straightforwardly incorporated into models as additional features. However, a more complex scenario arises, for example, in a regionally disseminated disease, when multiple distinct lesions are present.
Aim:
This study aims to evaluate the feasibility of integrating radiomic data from multiple lesions into survival models. We explore strategies for incorporating these ROIs and hypothesize that including all available lesions can improve model performance.
Methods:
While each lesion produces a feature vector, the desired result is a unified prediction. We propose methods to aggregate either the feature vectors to form a representative one or the modelling results to compute a consolidated risk score. As a proof of concept, we apply these strategies to predict distant metastasis risk in a cohort of 115 non-small cell lung cancer patients, 60% of whom exhibit regionally advanced disease. Two feature sets (radiomics extracted from PET and PET interpolated to CT resolution) are tested across various survival models using a Monte Carlo Cross-Validation framework.
Results:
Across both feature sets, incorporating all available lesions — rather than limiting analysis to the primary tumour — consistently improved the c-index, irrespective of the survival model used. The highest c-Index obtained by a primary tumour-only model was 0.611 for the PET dataset and 0.614 for the PET_CT dataset, while by using all lesions we were able to achieve c-Indices of 0.632 and 0.634.
Conclusion:
Lesions beyond the primary tumour carry information that should be utilized in radiomics-based models to enhance predictive ability.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.