Qian Cao , Zhaoyu Jiang , Zhixiang Wang , Leonard Wee , Andre Dekker , Zhen Zhang , Ji Zhu
{"title":"基于放射组学的二元结果预测模型的最小样本量计算:理论框架和实例。","authors":"Qian Cao , Zhaoyu Jiang , Zhixiang Wang , Leonard Wee , Andre Dekker , Zhen Zhang , Ji Zhu","doi":"10.1016/j.radonc.2025.111134","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and purpose</h3><div>Determining the appropriate sample size for developing robust radiomics-based binary outcome prediction models and identifying the maximum number of predictors safely allowable within a fixed dataset size remain critical yet challenging tasks. This study aims to propose and demonstrate a structured method for addressing these issues, enhancing methodological rigor and practicality in radiomics research.</div></div><div><h3>Materials and methods</h3><div>We introduce a comprehensive sample size calculation framework for binary outcome prediction models in radiomic studies. The proposed approach integrates three key criteria: (1) maintaining a global shrinkage factor (<em>S</em>) ≥ 0.9 to control model overfitting, (2) ensuring a minimal absolute difference between apparent and adjusted performance metrics, and (3) precisely estimating the overall outcome risk. Additionally, we develop an accessible online calculation tool enabling researchers to efficiently determine either the minimum sample size or the maximum number of predictors permissible, based on clearly defined statistical parameters.</div></div><div><h3>Results</h3><div>The presented method systematically addresses model overfitting by integrating a global shrinkage factor into the calculation, providing robust estimates compared with traditional heuristic approaches (“rules of thumb”). Practical examples demonstrate that this structured method effectively balances predictive accuracy and generalizability, while the online tool provides researchers with a user-friendly platform to perform the necessary calculations.</div></div><div><h3>Conclusion</h3><div>Clear justification of sample size decisions is essential for developing reliable predictive models in radiomics research. By adopting a structured and rigorous calculation method, researchers can effectively minimize overfitting, ensure accurate risk estimation, and substantially enhance the reliability and validity of their predictive models.</div></div>","PeriodicalId":21041,"journal":{"name":"Radiotherapy and Oncology","volume":"212 ","pages":"Article 111134"},"PeriodicalIF":5.3000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Minimum sample size calculation for radiomics-based binary outcome prediction models: Theoretical framework and practical example\",\"authors\":\"Qian Cao , Zhaoyu Jiang , Zhixiang Wang , Leonard Wee , Andre Dekker , Zhen Zhang , Ji Zhu\",\"doi\":\"10.1016/j.radonc.2025.111134\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background and purpose</h3><div>Determining the appropriate sample size for developing robust radiomics-based binary outcome prediction models and identifying the maximum number of predictors safely allowable within a fixed dataset size remain critical yet challenging tasks. This study aims to propose and demonstrate a structured method for addressing these issues, enhancing methodological rigor and practicality in radiomics research.</div></div><div><h3>Materials and methods</h3><div>We introduce a comprehensive sample size calculation framework for binary outcome prediction models in radiomic studies. The proposed approach integrates three key criteria: (1) maintaining a global shrinkage factor (<em>S</em>) ≥ 0.9 to control model overfitting, (2) ensuring a minimal absolute difference between apparent and adjusted performance metrics, and (3) precisely estimating the overall outcome risk. Additionally, we develop an accessible online calculation tool enabling researchers to efficiently determine either the minimum sample size or the maximum number of predictors permissible, based on clearly defined statistical parameters.</div></div><div><h3>Results</h3><div>The presented method systematically addresses model overfitting by integrating a global shrinkage factor into the calculation, providing robust estimates compared with traditional heuristic approaches (“rules of thumb”). Practical examples demonstrate that this structured method effectively balances predictive accuracy and generalizability, while the online tool provides researchers with a user-friendly platform to perform the necessary calculations.</div></div><div><h3>Conclusion</h3><div>Clear justification of sample size decisions is essential for developing reliable predictive models in radiomics research. By adopting a structured and rigorous calculation method, researchers can effectively minimize overfitting, ensure accurate risk estimation, and substantially enhance the reliability and validity of their predictive models.</div></div>\",\"PeriodicalId\":21041,\"journal\":{\"name\":\"Radiotherapy and Oncology\",\"volume\":\"212 \",\"pages\":\"Article 111134\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Radiotherapy and Oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167814025046389\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiotherapy and Oncology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167814025046389","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
Minimum sample size calculation for radiomics-based binary outcome prediction models: Theoretical framework and practical example
Background and purpose
Determining the appropriate sample size for developing robust radiomics-based binary outcome prediction models and identifying the maximum number of predictors safely allowable within a fixed dataset size remain critical yet challenging tasks. This study aims to propose and demonstrate a structured method for addressing these issues, enhancing methodological rigor and practicality in radiomics research.
Materials and methods
We introduce a comprehensive sample size calculation framework for binary outcome prediction models in radiomic studies. The proposed approach integrates three key criteria: (1) maintaining a global shrinkage factor (S) ≥ 0.9 to control model overfitting, (2) ensuring a minimal absolute difference between apparent and adjusted performance metrics, and (3) precisely estimating the overall outcome risk. Additionally, we develop an accessible online calculation tool enabling researchers to efficiently determine either the minimum sample size or the maximum number of predictors permissible, based on clearly defined statistical parameters.
Results
The presented method systematically addresses model overfitting by integrating a global shrinkage factor into the calculation, providing robust estimates compared with traditional heuristic approaches (“rules of thumb”). Practical examples demonstrate that this structured method effectively balances predictive accuracy and generalizability, while the online tool provides researchers with a user-friendly platform to perform the necessary calculations.
Conclusion
Clear justification of sample size decisions is essential for developing reliable predictive models in radiomics research. By adopting a structured and rigorous calculation method, researchers can effectively minimize overfitting, ensure accurate risk estimation, and substantially enhance the reliability and validity of their predictive models.
期刊介绍:
Radiotherapy and Oncology publishes papers describing original research as well as review articles. It covers areas of interest relating to radiation oncology. This includes: clinical radiotherapy, combined modality treatment, translational studies, epidemiological outcomes, imaging, dosimetry, and radiation therapy planning, experimental work in radiobiology, chemobiology, hyperthermia and tumour biology, as well as data science in radiation oncology and physics aspects relevant to oncology.Papers on more general aspects of interest to the radiation oncologist including chemotherapy, surgery and immunology are also published.